Jingfeng Yang @JingfengY

Applied Scientist @AmazonScience #LLMs #NLProc Formerly @SALT_NLP @Georgia_Tech @PKU1898 @Google @MSFTResearch . Opinions are my own. jingfengyang.github.io Joined April 2019

Tweets

483
Followers

2K
Following

618
Likes

2K

Rohan Paul @rohanpaul_ai

3 days ago

The self-extend paper is really becoming important - "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" 🔥 📌 Extend existing LLMs’ context window without any fine-tuning 📌 One feasible way to avoid the O.O.D. ( out-of-distribution) problems by caused unseen…

2 32 95 11K 99

Download Image

Jacob Pfau @jacob_pfau

4 days ago

Do models need to reason in words to benefit from chain-of-thought tokens? In our experiments, the answer is no! Models can perform on par with CoT using repeated '...' filler tokens. This raises alignment concerns: Using filler, LMs can do hidden reasoning not visible in CoT🧵

40 179 1K 248K 907

Download Image

Junyang Lin @JustinLin610

4 days ago

I guess you might have tried the demo (huggingface.co/spaces/Qwen/Qw…). Now the weights of Qwen1.5-110B are out! Temporarily only the base and chat models, AWQ and GGUF quantized models are about to be released very soon! Blog: qwenlm.github.io/blog/qwen1.5-1… Hugging Face:…

14 51 183 127K 49

Yijia Shao @EchoShao8899

4 days ago

Thanks for implementing our paper! But actually, you only need to modify 5 lines of code to configure STORM with Claude models. ZERO line of change is needed now because I just added an example script to our repo! github.com/stanford-oval/…

Alex Albert @alexalbert__

5 days ago

4 11 116 67K 186

Download Image

4 18 142 45K 180

Ansong Ni @AnsongNi

5 days ago

Excited to share our work at @GoogleDeepMind! We propose Naturalized Execution Tuning (NExT), a self-training method that drastically improves the LLM's ability to reason about code execution, by learning to inspect execution traces and generate chain-of-thought rationales 🧵👇

15 122 554 53K 409

Download Image

Wenhu Chen @WenhuChen

5 days ago

Great results on extending Llama-3 to long context reasoning!

Jingfeng Yang @JingfengY

6 days ago

Great results on extending Llama-3 to long context reasoning!

0 23 72 17K 31

Download Image

0 4 25 8K 5

Kyle Mistele 🏴‍☠️ @0xblacklight

5 days ago

Self-Extend is one of the coolest and under-rated techniques for extending context length in a stable manner. I've personally used it with llama.cpp (it feels much more stable that RoPE) and it blows my mind that more projects don't support it since it does not require…

Jingfeng Yang @JingfengY

6 days ago

0 23 72 17K 31

Download Image

0 2 4 1K 4

Jingfeng Yang @JingfengY

6 days ago

New results about LLama-3's long contexts abilities. Equipping Llama-3-8b/70b with SelfExtend (arxiv.org/pdf/2401.01325…), we test their in-context-learning abilities on two long tasks: DialogRe and FewNerd from LongCIL benchmark (arxiv.org/pdf/2404.02060…) @WenhuChen @TianleLI123.…

0 23 72 17K 31

Download Image

HongyeJ @serendip410

6 days ago

We tested our SelfExtend (arxiv.org/pdf/2401.01325…) for LLama-3-8B/70B-Instruct on the new challenging long context benchmark Ada-Eval (arxiv.org/abs/2404.06480). The task is selecting the best answer from candidates. The results are pretty good! 🌟 Highlights: 1: Equipped with…

0 8 14 805 1

Download Image

Jingfeng Yang @JingfengY

7 days ago

Impressive results using self-extend in embedding models! Refer to table 3.

Aran Komatsuzaki @arankomatsuzaki

a week ago

Impressive results using self-extend in embedding models! Refer to table 3.

4 35 210 40K 123

Download Image

0 0 6 1K 1

Yijia Shao @EchoShao8899

7 days ago

Since launching STORM code & web preview, thousands have tried it & offered feedback. - Can I run STORM with open LMs? - Can I change its report style? - Can I contribute to new info source support? Yes! We refactored our codebase for smoother running, customization & dev! 🔗🧵

6 16 103 26K 67

Download Image

Dawei Zhu @dwzhu128

2 weeks ago

🚀Excited to share our new paper "LongEmbed: Extending Embedding Models for Long Context Retrieval". We introduce the LongEmbed benchmark, explore context extension of existing embedding models, and release E5-Base-4k & E5-RoPE-Base. Paper: arxiv.org/abs/2404.12096

1 7 25 2K 8

Download Image

elvis @omarsar0

2 weeks ago

How Faithful are RAG Models? This new paper aims to quantify the tug-of-war between RAG and LLMs' internal prior. It focuses on GPT-4 and other LLMs on question answering for the analysis. It finds that providing correct retrieved information fixes most of the model…

11 118 573 75K 561

Download Image

lmsys.org @lmsysorg

3 weeks ago

🔥Exciting news -- GPT-4-Turbo has just reclaimed the No. 1 spot on the Arena leaderboard again! Woah! We collect over 8K user votes from diverse domains and observe its strong coding & reasoning capability over others. Hats off to @OpenAI for this incredible launch! To offer…

54 210 1K 619K 278

Download Image

Jingfeng Yang @JingfengY

3 weeks ago

Verified that our Rufus could code :) , as I stated in my original tweet thread x.com/jingfengy/stat… you would expect even better coding and general-purpose models and agents from us :)

Mert @mertdumenci

3 weeks ago

Verified that our Rufus could code :) , as I stated in my original tweet thread x.com/jingfengy/stat… you would expect even better coding and general-purpose models and agents from us :)

41 669 6K 506K 621

Download Image

1 0 6 1K 1

Yijia Shao @EchoShao8899

3 weeks ago

Thrilled to announce we’ve received IRB approval to launch our web demo of STORM at storm.genie.stanford.edu! 🌪️ While we’ve analyzed its limitations in our paper, we’re eager to kick off a real-world exploration. Try it out, and give us your feedback directly through the demo!

Yijia Shao @EchoShao8899

2 months ago

41 205 1K 307K 1K

Download Video

7 30 186 40K 173

Jingfeng Yang @JingfengY

3 weeks ago

Totally agree with this, as I raised the question in my earlier blog post jingfengyang.github.io/alignment : “How to improve language agents’ capabilities as a whole, considering there is no moat for current LLM-driven agent frameworks? The moat is still the fundamental LLM capability,…

Jim Fan @DrJimFan

3 weeks ago

27 156 968 178K 525

Download Image

2 10 31 7K 22

AK @_akhaliq

3 weeks ago

Social Skill Training with Large Language Models People rely on social skills like conflict resolution to communicate effectively and to thrive in both work and personal life. However, practice environments for social skills are typically out of reach for most people.

2 54 240 117K 169

Download Image

Xiaotian (Max) Han @XiaotianHan1

3 weeks ago

SelfExtend, without further training, upgrades Mistral-inst-v0.1 to match the performance level of its successor, v0.2, in qa tasks. therefore, the value of SelfExtend is at least equivalent to the training cost of Mistral-inst-v0.2?

0 4 15 2K 6

Download Image

Anthropic @AnthropicAI

4 weeks ago

New Anthropic research paper: Many-shot jailbreaking. We study a long-context jailbreaking technique that is effective on most large language models, including those developed by Anthropic and many of our peers. Read our blog post and the paper here: anthropic.com/research/many-…

83 348 2K 500K 872

Download Image

Akari Asai @AkariAsai

11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃‍♀️🧗‍♀️🍳

Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_usc

Bill Yuchen Lin 🤖 @billyuchenlin

6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_usc

@NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Jim Fan @DrJimFan

229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Dist. Assist. Prof.@OhioState, Director @osunlp, 20% Researcher@Microsoft. I like to think about intelligence, artificial or biological

Yu Su @ysu_nlp

6K Followers 857 Following Dist. Assist. Prof.@OhioState, Director @osunlp, 20% Researcher@Microsoft. I like to think about intelligence, artificial or biological

PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep running

Yao Fu @Francis_YAO_

14K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep running

Tao Yu @taoyds

3K Followers 815 Following @XLangNLP lab, asst. prof. @HKUniversity. prev. postdoc @uwnlp; phd @Yale; intern @MSFTResearch, @SFResearch. he/him 🌈

Weijia Shi @WeijiaShi2

5K Followers 968 Following PhD student @uwcse @uwnlp | Visiting Researcher @MetaAI | Undergrad @CS_UCLA | https://t.co/eLBQmgkvym

Sam Bowman @sleepinyourhat

35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.

Xi Ye @xiye_nlp

2K Followers 304 Following CS PhD student @UTAustin. I study NLP, particularly explanations. I sometimes make memes.

Michi Yasunaga @michiyasunaga

3K Followers 867 Following CS PhD @Stanford working on language models and multimodal models. Previously @Meta @GoogleDeepMind @Yale

Assistant Professor @Stanford CS @StanfordNLP @StanfordAILab. Formerly @GeorgiaTech. Computational Social Science & NLP

Diyi Yang @Diyi_Yang

14K Followers 2K Following Assistant Professor @Stanford CS @StanfordNLP @StanfordAILab. Formerly @GeorgiaTech. Computational Social Science & NLP

AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.

Wenhu Chen @WenhuChen

11K Followers 520 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.

PostDoc @cogcomp UPenn | Ph.D. CS @UUtah | @iitkanpur. @Bloomberg & @MSFTResearch Fellow | ex-@MetaAI, @IBM, @Verisk, @samsungresearch, @Synopsys #nlp #ml

Vivek Gupta @keviv9

2K Followers 5K Following PostDoc @cogcomp UPenn | Ph.D. CS @UUtah | @iitkanpur. @Bloomberg & @MSFTResearch Fellow | ex-@MetaAI, @IBM, @Verisk, @samsungresearch, @Synopsys #nlp #ml

Yizhong Wang @yizhongwyz

3K Followers 1K Following CS PhD student @uwcse @uwnlp. NLP/ML

Jie Huang @jefffhj

4K Followers 568 Following Ph.D. Candidate at UIUC🌽; Formerly @GoogleDeepmind @NVIDIAAI @AmazonScience. #NLProc Large Language Models

Incoming Assistant Professor @RiceCompSci, Postdoc @jhuclsp, working on Trustworthy AI/NLP/ML, PhD @CS_UVA, former intern @allen_ai, @MSFTResearch, @IBM

Hanjie Chen @hanjie_chen

2K Followers 365 Following Incoming Assistant Professor @RiceCompSci, Postdoc @jhuclsp, working on Trustworthy AI/NLP/ML, PhD @CS_UVA, former intern @allen_ai, @MSFTResearch, @IBM

Dinghuai Zhang 张鼎.. @zdhnarsil

2K Followers 1K Following PhD student at @Mila_Quebec. Ex intern at FAIR Labs @MetaAI. Previous math undergraduate at @PKU1898.

Member of Technical Staff @ https://t.co/60CJY0lbJL; PhD @UCSanDiego 🏄; Formerly @Microsoft @GoogleAI @huggingface 🤗. RT & like ≠ endorsements. Views are my own. He/him

Canwen Xu @XuCanwen

2K Followers 393 Following Member of Technical Staff @ https://t.co/60CJY0lbJL; PhD @UCSanDiego 🏄; Formerly @Microsoft @GoogleAI @huggingface 🤗. RT & like ≠ endorsements. Views are my own. He/him

Qingxiu Dong @qx_dong

841 Followers 592 Following PhD student @PKU1898. Research Intern @MSFTResearch Asia.

Chao Zhang @chaozhangcs

466 Followers 393 Following Assistant Professor @ Georgia Tech CSE LLM, Uncertainty, AI for science

Assistant Professor at UC Berkeley, Department of Statistics and EECS. Researcher working on foundations of AI and deep learning.

Song Mei @Song__Mei

1K Followers 571 Following Assistant Professor at UC Berkeley, Department of Statistics and EECS. Researcher working on foundations of AI and deep learning.

Saurabh Srivastava @_saurabh

830 Followers 375 Following Research in reasoning for better program synthesis (PhD, Postdoc, YC)

Allen pang @Allenpang123456

1 Followers 15 Following

Akshat Gupta @akshatgupta57

194 Followers 721 Following CS PhD, UC Berkeley

ML Engineer (e/acc)

📌 https://t.co/x0IIWfnOt8

🚀 https://t.co/QEO4CKRl1b

Open LLMs is Happiness 💡

Ex Deutsche & HSBC.

DM for collaboration.

Rohan Paul @rohanpaul_ai

13K Followers 1K Following ML Engineer (e/acc) 📌 https://t.co/x0IIWfnOt8 🚀 https://t.co/QEO4CKRl1b Open LLMs is Happiness 💡 Ex Deutsche & HSBC. DM for collaboration.

Lenna Register @RegistLenn

58 Followers 5K Following

huansong @huansong514

7 Followers 172 Following

Fatih saidi @fatih_said87797

2 Followers 42 Following

Yasmine @bdwy240434

1 Followers 119 Following

James Chan @JamesChan736527

120 Followers 47 Following Seize opportunities with speed! DM for exclusive promotions. Swift action breeds success. Limited time: get 2, Pay for 1.

0xlambdaZ @r2kluh

64 Followers 2K Following Trade / Defi / Gamefi / Metaverse

nisten @nisten

10K Followers 5K Following fullstack-dev democratizing intelligence @skunkworks_ai | 🦝.ai | prev https://t.co/68jAlAVBKR |

Roel Van de Paar @RoelVandePaar

710 Followers 302 Following

Hyperlexical Polymath Savant – GenTech / AI Constulant / CTO @ https://t.co/s7KzUOWpY5 - EX-Heroku Trust and Security. Bringing AGI to the public. GPT-5

Kyle 'esSOBi' Stone @essobi

6K Followers 3K Following Hyperlexical Polymath Savant – GenTech / AI Constulant / CTO @ https://t.co/s7KzUOWpY5 - EX-Heroku Trust and Security. Bringing AGI to the public. GPT-5

Research Officer at Faculty of Economics, The University of Tokyo. Keywords: Entrepreneur/OpenAI/Quantum/Crypto/Analytics/Consulting. Views are my own. ⁡

𝕋𝕒𝕥𝕤𝕦�.. @tatsuru_kikuchi

365 Followers 3K Following Research Officer at Faculty of Economics, The University of Tokyo. Keywords: Entrepreneur/OpenAI/Quantum/Crypto/Analytics/Consulting. Views are my own. ⁡

Guy Swann ⚡️| Act.. @TheGuySwann

81K Followers 3K Following Liberty is a technology problem • Host of @BitcoinAudible, @Ai_Unchained • Pro Memecraft • Audiobook Narrator

Peng Wu @DPZH2527

49 Followers 640 Following INFJ-A 终身学习丨数字营销丨认知行为丨机器学习丨思维模型丨Ai商用探索

Postdoc in Jiayang Li Lab, Institute of Genetics and Developmental Biology, CAS. Focus on the Multi-omics of dandelions & rice🌱🌾Recruiting Top-Tier Talents👇

Yin-Hong Cao @caoyinhong

141 Followers 1K Following Postdoc in Jiayang Li Lab, Institute of Genetics and Developmental Biology, CAS. Focus on the Multi-omics of dandelions & rice🌱🌾Recruiting Top-Tier Talents👇

aqqq_hush! @AqqqHush

32 Followers 77 Following For fun

Parallel College @HenryWang550879

109 Followers 1K Following a new method to learn AI and Prompt Engineering

emanon @JianSuji

67 Followers 1K Following

Markus Junginger @greenrobot_de

1K Followers 408 Following Distributed and on-device data/AI. Cofounder/CTO @objectbox_io.

Nandan Shettigar @mfflnando

228 Followers 1K Following #TAMU2020

HyperLiberalism @hyperliberalism

317 Followers 1K Following There is only one sex: the human sex.

Gorden Sun @Gorden_Sun

17K Followers 1K Following 产品经理，只发AI相关信息，个人维护的AI资讯日报↓

zirui @zirui3

36 Followers 948 Following

Siyuan Yu @cadaleyu

35 Followers 763 Following MSc @UAlberta @AmiiThinks Economics and Computation, Algorithmic Game Theory, Decision Making under Uncertainty

Perry @kosh516

275 Followers 3K Following

Zirui Liu @ziruirayliu

53 Followers 69 Following Final year PhD student at Rice

Yifei Hu @hu_yifei

311 Followers 375 Following Ph.D. Candidate @LifeAtPurdue | NLP | LLM | UX | Programmer On job market for any AI related industry/academia roles

Phd in Neuroscience looking at the intersection between machine learning and neuroscience #machinelearning #AI #neuroscience

Ward Plunet @StartupYou

129K Followers 110K Following Phd in Neuroscience looking at the intersection between machine learning and neuroscience #machinelearning #AI #neuroscience

Daniel Israel @danielmisrael

240 Followers 2K Following PhD Student Studying AI/ML @UCLA

Jiefeng Chen @jiefengchen1

336 Followers 529 Following Research Scientist at Google | Working on LLM Research.

𝔽_un @FF_un1

648 Followers 8K Following have fun

Alo @Hal90910

0 Followers 2K Following

Pensé FFun @inftyCategory

100 Followers 6K Following

Alyssa, Yi CHENG @YiCheng77783310

86 Followers 207 Following Ph.D. student, working on NLP for social good and conversational AI.

Shay Zavala @ShayZavala36610

75 Followers 5K Following

AI, reinforcement learning, distributed systems
something new @Woven_ToyotaJP
prev - discovery @bookmyshow, cs @IITIOfficial

Arjun Srivastava @arjunsriv

63 Followers 1K Following AI, reinforcement learning, distributed systems something new @Woven_ToyotaJP prev - discovery @bookmyshow, cs @IITIOfficial

Cheng Jiayang @jchengaj

22 Followers 142 Following PhD student @HKUST

AugerDecay @augerdecay

133 Followers 3K Following 生活在两个世界之间，在旧的世界，它已经过去，但我们还清楚地记得它；新的世界，它正在到来，但我们还不完全理解它

jackmtlee @jackmtleee

48 Followers 536 Following 🇨🇦

🥑 DevRel @Streamlit @SnowflakeDB
🪶 𝕏 about #AI, #LLMs, #DataScience, #WebApps, #SEO
💕 My heart is open source
🌍 Nature Lover
👀 My views!

Charly Wargnier @DataChaz

112K Followers 31K Following 🥑 DevRel @Streamlit @SnowflakeDB 🪶 𝕏 about #AI, #LLMs, #DataScience, #WebApps, #SEO 💕 My heart is open source 🌍 Nature Lover 👀 My views!

Shunyu Yao @ShunyuYao12

7K Followers 857 Following Language agents (ReAct, Reflexion, Tree of Thoughts) for digital automation (WebShop, SWE-bench, SWE-agent)

Chetan Dhembre @ichetandhembre

1K Followers 4K Following CTO, co-founder @getloconow, ex @unacademy, @crowdfire

Tyne宇 @Tyne03720826082

110 Followers 3K Following

Kunal Katre @kunalkatre1995

81 Followers 1K Following Designer

Akari Asai @AkariAsai

11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃‍♀️🧗‍♀️🍳

AK @_akhaliq

310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gx

Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist

Percy Liang @percyliang

49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist

Bill Yuchen Lin 🤖 @billyuchenlin

6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_usc

UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.

William Wang @WilliamWangNLP

14K Followers 718 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.

Jim Fan @DrJimFan

229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Jason Wei @_jasonwei

57K Followers 491 Following ai researcher @openai

Yi Tay @YiTayML

29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼

Professor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.

Yann LeCun @ylecun

711K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.

Yu Su @ysu_nlp

6K Followers 857 Following Dist. Assist. Prof.@OhioState, Director @osunlp, 20% Researcher@Microsoft. I like to think about intelligence, artificial or biological

Yao Fu @Francis_YAO_

14K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep running

(((ل()(ل() 'yoav))).. @yoavgo

46K Followers 2K Following

Graham Neubig @gneubig

31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.

Aran Komatsuzaki @arankomatsuzaki

95K Followers 78 Following @TeraflopAI

Tao Yu @taoyds

3K Followers 815 Following @XLangNLP lab, asst. prof. @HKUniversity. prev. postdoc @uwnlp; phd @Yale; intern @MSFTResearch, @SFResearch. he/him 🌈

PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵

Kayo Yin @kayo_yin

8K Followers 560 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵

Weijia Shi @WeijiaShi2

5K Followers 968 Following PhD student @uwcse @uwnlp | Visiting Researcher @MetaAI | Undergrad @CS_UCLA | https://t.co/eLBQmgkvym

Sam Bowman @sleepinyourhat

35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.

Xi Ye @xiye_nlp

2K Followers 304 Following CS PhD student @UTAustin. I study NLP, particularly explanations. I sometimes make memes.

Michi Yasunaga @michiyasunaga

3K Followers 867 Following CS PhD @Stanford working on language models and multimodal models. Previously @Meta @GoogleDeepMind @Yale

Song Mei @Song__Mei

1K Followers 571 Following Assistant Professor at UC Berkeley, Department of Statistics and EECS. Researcher working on foundations of AI and deep learning.

Rohan Paul @rohanpaul_ai

13K Followers 1K Following ML Engineer (e/acc) 📌 https://t.co/x0IIWfnOt8 🚀 https://t.co/QEO4CKRl1b Open LLMs is Happiness 💡 Ex Deutsche & HSBC. DM for collaboration.

Alex Albert @alexalbert__

19K Followers 401 Following DevRel + Prompting @anthropicai

nnnnnnzy @nnnnnnzy

23 Followers 502 Following XXXX

Jiefeng Chen @jiefengchen1

337 Followers 529 Following Research Scientist at Google | Working on LLM Research.

Alyssa, Yi CHENG @YiCheng77783310

86 Followers 207 Following Ph.D. student, working on NLP for social good and conversational AI.

Human, only human, infinitely human. Pretrained by evolution, finetuned by experience, prompted by situations. PhD student @UofT. Sharing ideas in AI&Psychology

Sicong (Sheldon) Huan.. @sicong_huang

715 Followers 1K Following Human, only human, infinitely human. Pretrained by evolution, finetuned by experience, prompted by situations. PhD student @UofT. Sharing ideas in AI&Psychology

Jiaxin Lin @jxlin_lock

3 Followers 3 Following UT Austin PhD

Chuang Gan @gan_chuang

4K Followers 456 Following Faculty Member at UMass Amherst; Principal researcher at MIT-IBM Watson AI Lab; Homepage: https://t.co/oXP6pqXCpo

Unraveling AI complexities, crafting user-friendly innovations. Bridging the gap between intricate tech and practical applications.

Vaibhav @vaibhav_p1234

429 Followers 898 Following Unraveling AI complexities, crafting user-friendly innovations. Bridging the gap between intricate tech and practical applications.

seshu bonam @seshubon

1K Followers 1K Following r/🔁 Reinforcement loops make everything better. building 🤹 Collaborative Ai spaces @ 🤖 https://t.co/jbHHOlOLTY

Philosopher & ethicist teaching models to be good @AnthropicAI.
Personal account. All opinions come from my training data.

Amanda Askell @AmandaAskell

26K Followers 653 Following Philosopher & ethicist teaching models to be good @AnthropicAI. Personal account. All opinions come from my training data.

Han Fang @Han_Fang_

709 Followers 102 Following Research Scientist Manager at @meta GenAI, leading the LLM development of Meta AI

Research Scientist @anthropicai. Previously Postdoc @stanfordnlp and PhD @cornellcis. Working on LLMs & evaluating their safety and impact on society. she/her.

Esin Durmus @esindurmusnlp

3K Followers 383 Following Research Scientist @anthropicai. Previously Postdoc @stanfordnlp and PhD @cornellcis. Working on LLMs & evaluating their safety and impact on society. she/her.

Zac Kenton @ZacKenton1

1K Followers 1K Following Research Scientist in AI safety at DeepMind. Views are my own and don't represent DeepMind.

Wanjia Zhao @WanjiaZhao1203

201 Followers 299 Following Incoming CS PhD @Stanford; Math Undergrad @ZJU_CHINA; Research intern @MSFTResearch Asia | ML/AI4Sci

Song Jiang @songjiang24

439 Followers 698 Following CS PhD student at @UCLA. Machine Learning, LLM, Causality and Graph.

Postdoc @LTIatCMU. PhD from Ohio State @osunlp. Training & evaluating foundation models. Pushing the boundaries of AI🤖. Previously @MSFTResearch.

Xiang Yue @xiangyue96

2K Followers 434 Following Postdoc @LTIatCMU. PhD from Ohio State @osunlp. Training & evaluating foundation models. Pushing the boundaries of AI🤖. Previously @MSFTResearch.

Binyuan Hui @huybery

6K Followers 318 Following 🐚 Core maintainer at Qwen and OpenDevin. || Code Generation, Text-to-SQL, Large Language Models.

Zexuan Zhong @ZexuanZhong

1K Followers 562 Following PhD student @PrincetonCS, @princeton_nlp

Roger Grosse @RogerGrosse

10K Followers 751 Following

Aakanksha Chowdhery @achowdhery

7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to change

Yuan Cao @caoyuan33

172 Followers 2K Following Google DeepMind

Graduate student at @Dartmouthcs ; Visiting Research Intern @SALT_NLP; Prev. Research Intern @MSFTResearch; Formerly undergrad from @sjtu1896; NLP&ML #NLProc

Zhehao Zhang @Zhehao_Zhang123

98 Followers 390 Following Graduate student at @Dartmouthcs ; Visiting Research Intern @SALT_NLP; Prev. Research Intern @MSFTResearch; Formerly undergrad from @sjtu1896; NLP&ML #NLProc

• Director of the Center for AI Safety (https://t.co/ahs3LYCpqv)
• GELU/ImageNet-C/MMLU/safety groundwork
• PhD in AI from UC Berkeley
https://t.co/rgXHAnYAsQ
https://t.co/YtGtDh1aAV

Dan Hendrycks @DanHendrycks

17K Followers 81 Following • Director of the Center for AI Safety (https://t.co/ahs3LYCpqv) • GELU/ImageNet-C/MMLU/safety groundwork • PhD in AI from UC Berkeley https://t.co/rgXHAnYAsQ https://t.co/YtGtDh1aAV

Collin Burns @CollinBurns4

11K Followers 276 Following Superalignment @OpenAI. Formerly @berkeley_ai @Columbia. Former Rubik's Cube world record holder.

Ellen Wu @zeqiuwu1

593 Followers 430 Following PhD student at UWNLP

Final-year PhD @Mila_Quebec. BSc @PKU1898. Intern @Google. Reasoning, large language models, knowledge graphs and ML systems. Photographer held back by CS/ML.

Zhaocheng Zhu (on the.. @zhu_zhaocheng

2K Followers 287 Following Final-year PhD @Mila_Quebec. BSc @PKU1898. Intern @Google. Reasoning, large language models, knowledge graphs and ML systems. Photographer held back by CS/ML.

Nathan Lambert @natolambert

25K Followers 689 Following Figuring out AI @allen_ai, "rl boi" DM me papers. Writes @interconnectsai, talks @retortai Has phd and some credentials

Haotian Liu @imhaotian

6K Followers 397 Following building intelligence @xAI, creator of #LLaVA, cs @UWMadison, prev @MSFTResearch

Qinyuan Cheng @cheng_qinyuan

265 Followers 380 Following Alignment researcher, PhD student at FNLP Lab @FudanUniv; MOSS team; Intern at Shanghai AI Lab; True Dota2 fans

Lichang Chen @LichangChen2

214 Followers 486 Following LLM PhD @umdcs | Student Researcher @GoogleAI & @GoogleDeepmind| Building the AGI | BS @ZJU_China | Opinions are my own.

Banghua Zhu @BanghuaZ

2K Followers 804 Following PhD @Berkeley_EECS, statistics, info theory, LLM, RL, Human-AI Interactions.

Tianbao Xie @TianbaoX

1K Followers 1K Following Ph.D. student of @XLangNLP lab and @HKUNLP group 2022. Advised by @taoyds and @ikekong . e/ia

PhD-ing @StanfordAILab @stanfordnlp. Advisor @PlaytestAI. Past: @ContextualAI, @huggingface, @Meta FAIR, @mitbrainandcog, @MIT_CSAIL, @NASAJPL

Tristan Thrush @TristanThrush

3K Followers 761 Following PhD-ing @StanfordAILab @stanfordnlp. Advisor @PlaytestAI. Past: @ContextualAI, @huggingface, @Meta FAIR, @mitbrainandcog, @MIT_CSAIL, @NASAJPL

Weizhu Chen @WeizhuChen

2K Followers 199 Following VP in Microsoft

Yining Chen @cynnjjs

489 Followers 90 Following Alignment@OpenAI

PI @UChicagoCS & @DSI_UChicago, leader of Conceptualization Lab https://t.co/BVCT3zdaNV, Post-doc @Meta. We don’t really know much about language models...yet.

Ari Holtzman @universeinanegg

3K Followers 2K Following PI @UChicagoCS & @DSI_UChicago, leader of Conceptualization Lab https://t.co/BVCT3zdaNV, Post-doc @Meta. We don’t really know much about language models...yet.

Yang Song @DrYangSong

10K Followers 887 Following Leading the Strategic Explorations team @OpenAI. Score-Based Models. Diffusion Models. Consistency Models.

Colin Raffel @colinraffel

30K Followers 655 Following nonbayesian parameterics, sweet lessons, and random birds. Friend of @srush_nlp

Assistant professor @UChicagoCS @UChicago. Working on human-centered AI, NLP, CSS at @ChicagoHAI, also part of @ChicagoNLP. DM for Postdoc/PhD opportunities.

Chenhao Tan @ChenhaoTan

4K Followers 902 Following Assistant professor @UChicagoCS @UChicago. Working on human-centered AI, NLP, CSS at @ChicagoHAI, also part of @ChicagoNLP. DM for Postdoc/PhD opportunities.

Le Hou @Hou_Le

1K Followers 135 Following Computer Sciencer, Transformer, StarCrafter.

Tech, AI, and education startup nerd. Former @WhiteHouse, @DeptofDefense, @TheJusticeDept wonk. Now an Innovation Fellow in AI @NASA. Tweets are my own.

James Villarrubia @james_mtc

24K Followers 8K Following Tech, AI, and education startup nerd. Former @WhiteHouse, @DeptofDefense, @TheJusticeDept wonk. Now an Innovation Fellow in AI @NASA. Tweets are my own.

Taiwei Shi @taiwei_shi

511 Followers 262 Following Ph.D. student @nlp_usc. Formerly @GeorgiaTech @USC_ISI. NLP & Computational Social Science.

Ella Minzhi Li @EllaMinzhiLi

145 Followers 105 Following CS PhD student at NUS @wing_nus 🇸🇬, incoming visiting PhD at Stanford @stanfordnlp🌲, NLP researcher📒

EECS Rising Star, 2023 Apple Scholar, Visiting PhD @berkeley_ai, Intern @GoogleAI Brain team @MetaAI (FAIR Labs), CS PhD @JHUCompSci

Yutong Bai @YutongBAI1002

3K Followers 397 Following EECS Rising Star, 2023 Apple Scholar, Visiting PhD @berkeley_ai, Intern @GoogleAI Brain team @MetaAI (FAIR Labs), CS PhD @JHUCompSci

#AI Expert, CEO of @01ai_yi and Chairman of 创新工场 @sinovationvc, former President of Google China, Author of AI 2041 and NYT Bestseller AI Superpowers

Kai-Fu Lee @kaifulee

1.5M Followers 658 Following #AI Expert, CEO of @01ai_yi and Chairman of 创新工场 @sinovationvc, former President of Google China, Author of AI 2041 and NYT Bestseller AI Superpowers

Cheng Lu @ChengLu05671218

1K Followers 85 Following Member of technical staff @OpenAI. PhD @Tsinghua_Uni. Interested in diffusion models.

Director of Gemini Bard Research @GoogleDeepMind | Lead of LaMDA LLM & Conversation AI | Worked on Duplex, TensorFlow, Wide & Deep Learning | We're hiring!

Heng-Tze Cheng @HengTze

2K Followers 119 Following Director of Gemini Bard Research @GoogleDeepMind | Lead of LaMDA LLM & Conversation AI | Worked on Duplex, TensorFlow, Wide & Deep Learning | We're hiring!

dilara @dilarafsoylu

128 Followers 2K Following nowadays a cs phd @stanford

Alex Reibman 🖇️ @AlexReibman

a day ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments Ever since OpenInterpreter, we've all been wondering just how effective agents can be if you give them a computer. Now we have a proper benchmark. Let's take a look (🧵):

11 76 500 91K 705

Download Video

Jason Wei @_jasonwei

17 hours ago

Enjoyed this paper that plots emergent abilities with pretraining loss on the x-axis, which is actually a suggestion that @OriolVinyalsML also made a few years back: arxiv.org/abs/2403.15796 The paper uses intermediate checkpoints to plot a variety of pretraining losses. For some…

7 41 296 39K 255

Download Image

Rohan Paul @rohanpaul_ai

21 hours ago

Nice paper from Microsoft - "LongEmbed: Extending Embedding Models for Long Context Retrieval" 🔥 ✨ While the context limit of LLMs has been pushed beyond 1 million tokens, embedding models are still confined to a narrow context window not exceeding 8k tokens, refrained from…

2 8 26 2K 18

Download Image

Saurabh Srivastava @_saurabh

2 months ago

More than 50% of the reported reasoning abilities of LLMs might not be true reasoning. How do we evaluate models trained on the entire internet? I.e., what novel questions can we ask of something that has seen all written knowledge? Below: new eval, results, code, and paper.…

51 237 1K 476K 1K

Download Image

Rohan Paul @rohanpaul_ai

3 days ago

2 32 95 11K 99

Download Image

Qian Huang @qhwang3

2 days ago

A video on what how an agent can improve a ML model on MLAgentBench: youtube.com/watch?v=s9NANr…

1 1 10 1K 2

AK @_akhaliq

a month ago

LLM Agent Operating System The integration and deployment of large language model (LLM)-based intelligent agents have been fraught with challenges that compromise their efficiency and efficacy. Among these issues are sub-optimal scheduling and resource allocation of agent

7 84 410 62K 245

Download Image

Yongfeng Zhang @yongfengzhang9

4 weeks ago

Introducing #AIOS, the world's first LLM Agent Operating System. AIOS embeds LLM into the OS as the brain, enabling an operating system "with soul". Paper1: arxiv.org/abs/2403.16971 Paper2: arxiv.org/abs/2312.03815 GitHub: github.com/agiresearch/AI… Discord: discord.gg/aUg3b2Kd

AK @_akhaliq

a month ago

7 84 410 62K 245

Download Image

3 5 21 2K 9

Download Image

yi 🦛 @agihippo

3 days ago

> phi-3 claims: better than mixtral 8x7B on benchmarks > phi-3 reality: worse than mistral 7b on lmsys you cannot cheat the scaling gods. very exciting 49 place. 🥲

1 8 117 8K 14

Nathan Lambert @natolambert

3 days ago

Snowflakes Arctic LLM team must literally be cooking

4 6 117 14K 18

Download Image

Adam Jermyn @AdamSJermyn

4 days ago

Some small updates from the Anthropic Interpretability team: transformer-circuits.pub/2024/april-upd…

2 16 117 74K 91

Chris Olah @ch402

3 days ago

Scaling laws for dictionary learning! transformer-circuits.pub/2024/april-upd…

Adam Jermyn @AdamSJermyn

4 days ago

Some small updates from the Anthropic Interpretability team: transformer-circuits.pub/2024/april-upd…

2 16 117 74K 91

2 19 212 50K 131

Download Image

Sam Bowman @sleepinyourhat

3 days ago

Very excited to see this come out:

Jacob Pfau @jacob_pfau

4 days ago

40 179 1K 248K 907

Download Image

0 14 143 29K 93

Jacob Pfau @jacob_pfau

4 days ago

40 179 1K 248K 907

Download Image

Alex Albert @alexalbert__

2 weeks ago

@mattshumer_ Hey Matt, appreciate you bringing this to our attention. We haven't modified any of the Claude 3 models since we launched them. On claude.ai, there's currently two layers that may contribute to perceived model performance: our T&S measures (standard mechanisms…

30 22 383 87K 64

Junyang Lin @JustinLin610

4 days ago

14 51 183 127K 49

Junyang Lin @JustinLin610

4 days ago

Great to be On HN!

3 6 105 8K 5

Download Image

Yijia Shao @EchoShao8899

4 days ago

Alex Albert @alexalbert__

5 days ago

STORM by @angelina_magr @MehdiAllahyari Implementation of the paper STORM (Synthesis of Topic Outlines through Retrieval and Multi-perspective Question Asking) -- uses Claude + sub-agents to write long-form articles. github.com/angelina-yang/…

4 11 116 67K 186

Download Image

4 18 142 45K 180

Yao Fu @Francis_YAO_

5 days ago

From Claude100K to Gemini10M, we are in the era of long context language models. Why and how a language model can utilize information at any input locations within long context? We discover retrieval heads, a special type of attention head responsible for long-context factuality