Aman Madaan @aman_madaan

@xai, PhD Candidate @LTIatCMU madaan.github.io Pittsburgh Joined February 2010

Tweets

402
Followers

1K
Following

481
Likes

1K

xAI @xai

3 weeks ago

👀 x.ai/blog/grok-1.5v

599 1K 7K 22.9M 879

Last week, I described four design patterns for AI agentic workflows that I believe will drive significant progress this year: Reflection, Tool use, Planning and Multi-agent collaboration. Instead of having an LLM generate its final output directly, an agentic workflow prompts…

88 578 3K 435K 3K

Ruohong Zhang @RuohongZhang

4 weeks ago

[p1] 🐕Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward🐕 Paper link: arxiv.org/pdf/2404.01258… page: github.com/RifleZhang/LLa… How to effectively train video large multimodal Model (LMM) alignment with preference modeling?

2 15 65 18K 34

Download Image

Yuhuai (Tony) Wu @Yuhu_ai_

a month ago

Our latest reasoning update. 24%->50% on MATH from Grok 1 to 1.5.

xAI @xai

a month ago

Our latest reasoning update. 24%->50% on MATH from Grok 1 to 1.5.

685 1K 7K 20.7M 563

29 35 355 28K 13

xAI @xai

a month ago

x.ai/blog/grok-1.5

685 1K 7K 20.7M 563

Jimmy Ba @jimmybajimmyba

a month ago

based and 🔓 wanna help accelerate the next Grok? looking for builders: — Rust/Jax/Kube infra engineers — front-end/full-stack engineers x.ai/careers

Grok @grok

a month ago

based and 🔓 wanna help accelerate the next Grok? looking for builders: — Rust/Jax/Kube infra engineers — front-end/full-stack engineers x.ai/careers

1K 2K 16K 9.7M 812

39 96 446 90K 53

Yuhuai (Tony) Wu @Yuhu_ai_

a month ago

Alright people check it out

Grok @grok

a month ago

Alright people check it out

1K 2K 16K 9.7M 812

4 7 125 13K 2

Igor Babuschkin @ibab

a month ago

x.ai/blog/grok-os

159 385 3K 347K 273

Aman Madaan @aman_madaan

2 months ago

Really nice work! Part of it is also quite simple to implement in just a few (<100) lines with torch/hf. Here is a notebook that implements and runs algorithm 1 in the paper, and correctly guesses 4096 as one of the candidates for `h` for `mistralai/Mistral-7B-v0.1`. Works…

Aran Komatsuzaki @arankomatsuzaki

2 months ago

17 151 972 237K 662

Download Image

0 2 35 5K 10

Download Image

Shrimai @shrimai_

2 months ago

🚀Introducing Nemotron-4 15B by @nvidia! 🎉 With 15B parameters and trained on 8T tokens, it's impressive in multilingual AI. Outperforms all similarly-sized models and dominates in multilingual tasks, even surpassing models 4x larger! #NVIDIA #Nemotron4 arxiv.org/pdf/2402.16819…

2 29 124 14K 25

Download Image

Ge Zhang @GeZhang86038849

2 months ago

[1/n] 🚀 Excited to share our latest work on OpenCodeInterpreter! With a blend of execution results and human feedback, we've achieved significant advancements in code generation. Here are the key points: ✨ Introducing OpenCodeInterpreter - a leap in iterative code refinement.…

13 61 219 145K 133

Download Image

Yao Fu @Francis_YAO_

2 months ago

Frontier models all have at least 100k context length, Gemini 1.5 has even 1m context. What about research and open source? Introducing Long Context Data Engineering, a data driven method achieving the first 128k context open source model matching GPT4-level Needle in a…

8 70 473 83K 315

Download Image

Aman Madaan @aman_madaan

3 months ago

For many tasks, there is usually one correct answer, and *many* ways to be wrong. But mistakes can be informative, too! LEAP uses this idea to automatically draft a few "principles" for every task (e.g., two `not` operations cancel out in boolean algebra). These principles are…

Uri Alon @urialon1

3 months ago

5 21 111 21K 58

Download Image

0 4 28 5K 9

Swaroop Mishra @Swarooprm7

3 months ago

In-Context Principle Learning can potentially transform instruction-tuning 🔥. Here's how: 🧠 Long-form instructions are back! Instructions in its original form were longer and represented valuable task-specific knowledge, that's how they were different from prompts. For…

AK @_akhaliq

3 months ago

1 32 169 49K 100

Download Image

1 11 59 11K 35

Download Image

Uri Alon @urialon1

3 months ago

📢New paper : "In-Context Principle Learning from Mistakes" Instead of prompting using only *correct* few-shot examples, we intentionally make *mistakes*, and then learn "principles" or "lessons" from them. Lead by @tianjun_zhang @aman_madaan @luyu_gao arxiv.org/pdf/2402.05403…

AK @_akhaliq

3 months ago

1 32 169 49K 100

Download Image

5 21 111 21K 58

Download Image

AK @_akhaliq

3 months ago

In-Context Principle Learning from Mistakes paper page: huggingface.co/papers/2402.05… In-context learning (ICL, also known as few-shot prompting) has been the standard method of adapting LLMs to downstream tasks, by learning from a few input-output examples. Nonetheless, all…

1 32 169 49K 100

Download Image

Teknium (e/λ) @Teknium1

3 months ago

Today I have a huge announcement. The dataset used to create Open Hermes 2.5 and Nous-Hermes 2 is now PUBLIC! Available Here: huggingface.co/datasets/tekni… This dataset was the culmination of all my work on curating, filtering, and generating datasets, with over 1M Examples from…

112 316 2K 232K 876

Download Image

Jeremy Howard @jeremyphoward

3 months ago

I used to find writing CUDA code rather terrifying. But then I discovered a couple of tricks that actually make it quite accessible. In this video I introduce CUDA in a way that will be accessible to Python programmers, and I even show how to do it all in @GoogleColab!

36 408 3K 206K 3K

Download Video

Sean Welleck @wellecks

3 months ago

Teaching a new course on Neural Code Generation with @dan_fried! cmu-codegen.github.io/s2024/ Here is the lecture on pretraining and scaling laws: cmu-codegen.github.io/s2024/static_f…

3 74 408 37K 252

Download Image

Zhiqing Sun @EdwardSun0909

3 months ago

Our paper on ✨ Self-Aligning Language Models via RLAIF ✨ has been officially accepted at @iclr_conf 2024! We're thrilled to share our insights in Vienna. Stay tuned for self-aligning advancements in LLMs. #ICLR2024 See you there! 🌍🚀

Zhiqing Sun @EdwardSun0909

7 months ago

5 89 295 92K 173

Download Image

1 17 95 13K 22

@NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Jim Fan @DrJimFan

229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

* Research Scientist @GoogleDeepMind
* #NLProc research
* PhD from @LTIatCMU
* Amateur woodworker, scuba diver, foosball player

Shruti Rijhwani @shrutirij

4K Followers 499 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from @LTIatCMU * Amateur woodworker, scuba diver, foosball player

Graham Neubig @gneubig

31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.

Founder/CEO @codegen. Tweets about AI, computing, and their impacts on society. Previously did startups, @palantir, @stanford. Not a pseudonym.

Jay Hack @mathemagic1an

37K Followers 3K Following Founder/CEO @codegen. Tweets about AI, computing, and their impacts on society. Previously did startups, @palantir, @stanford. Not a pseudonym.

Harrison Chase @hwchase17

54K Followers 410 Following @LangChainAI, previously @robusthq @kensho MLOps ∪ Generative AI ∪ sports analytics

PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep running

Yao Fu @Francis_YAO_

14K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep running

Uri Alon @urialon1

2K Followers 510 Following Research Scientist @GoogleDeepMind

Sam Whitmore @sjwhitmore

12K Followers 2K Following building @newcomputer. not a cat (or a man) in real life. I like to run a lot! @kensho @harvard @StuyNY

Stanford CS PhD student @stanfordnlp @StanfordAILab. Master's from Carnegie Mellon @LTIatCMU. NLP, Computer Vision, Machine Learning, and AI research.

Steven Feng @stevenyfeng

1K Followers 275 Following Stanford CS PhD student @stanfordnlp @StanfordAILab. Master's from Carnegie Mellon @LTIatCMU. NLP, Computer Vision, Machine Learning, and AI research.

Luyu Gao @luyu_gao

1K Followers 241 Following PhD candidate @CarnegieMellon @LTIatCMU On the job market for full-time industry position.

Head of NLP, CTO office, @Bloomberg. (he/him)

Generating natural language, one word at a time. Also making sense of that language afterwards. views my own

Sebastian Gehrmann @sebgehr

5K Followers 2K Following Head of NLP, CTO office, @Bloomberg. (he/him) Generating natural language, one word at a time. Also making sense of that language afterwards. views my own

Gabriel Ilharco @gabriel_ilharco

4K Followers 1K Following Building cool things @xAI. Prev. PhD at UW, Google AI

I’m Alex Graveley, creator of GitHub Copilot, AI Tinkerers, Dropbox Paper, MobileCoin, and Hackpad.
Building @ai_minion
Hiring https://t.co/nsHar8OLPC

Alex Graveley @alexgraveley

31K Followers 933 Following I’m Alex Graveley, creator of GitHub Copilot, AI Tinkerers, Dropbox Paper, MobileCoin, and Hackpad. Building @ai_minion Hiring https://t.co/nsHar8OLPC

Aakanksha Chowdhery @achowdhery

7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to change

PostDoc @cogcomp UPenn | Ph.D. CS @UUtah | @iitkanpur. @Bloomberg & @MSFTResearch Fellow | ex-@MetaAI, @IBM, @Verisk, @samsungresearch, @Synopsys #nlp #ml

Vivek Gupta @keviv9

2K Followers 5K Following PostDoc @cogcomp UPenn | Ph.D. CS @UUtah | @iitkanpur. @Bloomberg & @MSFTResearch Fellow | ex-@MetaAI, @IBM, @Verisk, @samsungresearch, @Synopsys #nlp #ml

Stephan Baasch @stbaasch

86K Followers 7K Following Interested in investments and technology.

Tyman Mayo @tymanmayo2

871 Followers 2K Following Doesn't matter. I'm fulfilled and relaxed in Colorado.😆

Shashank Sangar @ShashankTesla

16 Followers 204 Following Recruiting at Tesla AI for Core Autonomy (Autopilot & Optimus)

PollyWylde @tsj0NsqlRB91B

0 Followers 111 Following

Michi Yasunaga @michiyasunaga

3K Followers 869 Following CS PhD @Stanford working on language models and multimodal models. Previously @Meta @GoogleDeepMind @Yale

Cheng-Kuang Wu @brianckwu

4 Followers 37 Following

ElmaSenior @Ty1t20vYhl0ReD

0 Followers 181 Following

jj @punchgod_7

275 Followers 3K Following always guard up. ( ง︡'-'︠)ง

🍁 Wissenschaft ist der neueste Stand bewiesener Irrtümer! 🕴️Autodidakt ⚕️Cannabispatient & -Sommelier ✨ 𝕏Ɖ 🧬 #teamscience 🔬Do Only Good Everyday 🐕

Sir Mo van da Weed �.. @can420nabis

421 Followers 1K Following 🍁 Wissenschaft ist der neueste Stand bewiesener Irrtümer! 🕴️Autodidakt ⚕️Cannabispatient & -Sommelier ✨ 𝕏Ɖ 🧬 #teamscience 🔬Do Only Good Everyday 🐕

Civocim @civocim

220 Followers 481 Following 🇺🇸🇺🇸🇺🇸

super intelligence @eacc72

12 Followers 688 Following GPT6 is a Level 2 AGI and will be released in 2025

Virgil Meridith @VirgMerid

71 Followers 5K Following

Andrew Thompson @AndrewT65390500

312 Followers 374 Following Christian Conservative 🍊#1a + #2a = God-given non-negotiable rights to reject totalitarianism and tyranny.

phd student @Mila_Quebec | ms @CILVRatNYU @NYU_Courant | previously @GoogleDeepMind @AIatMeta @GoogleAI @labsdotgoogle @MSFTResearch @AdobeResearch

Abhinav Gupta @backpropper

793 Followers 5K Following phd student @Mila_Quebec | ms @CILVRatNYU @NYU_Courant | previously @GoogleDeepMind @AIatMeta @GoogleAI @labsdotgoogle @MSFTResearch @AdobeResearch

Sahil Antil @oxshitantil

16 Followers 804 Following Founder @kavachbuilders @foodkavach @arqaifashion

Essence @nick88886666

140 Followers 251 Following of life $TSLA #KpopIdol #JoyofCompoundInterest

MAB氏 @MAB1791652

1 Followers 36 Following

Ads ads @Adsads252800

0 Followers 16 Following

Weloop @Weloop_official

17 Followers 72 Following Download “Weloop” to be a part of your friends circle

⌗ Innovator-in-Chief ⇢ ❍ne World ✍︎ Investigative Journalist & Director of Open Records Strategy ⇢ AtNight Media ⌇ The New Way ® | One World 🌍

nik t. hatziefstathio.. @nikthehat

40K Followers 4K Following ⌗ Innovator-in-Chief ⇢ ❍ne World ✍︎ Investigative Journalist & Director of Open Records Strategy ⇢ AtNight Media ⌇ The New Way ® | One World 🌍

Product of progressive public policy; raised by public libraries and public education that produced a passion for politics.

and apparently alliteration

Omair Shahid @OmairShahid

382 Followers 959 Following Product of progressive public policy; raised by public libraries and public education that produced a passion for politics. and apparently alliteration

YESHUA Ha'Mashiach (LORD Jesus Christ) is The Creator and The King of the Universe!

- For Elon Musk: I have monetization idea for X. Game changer! -

Ben A. Goldberg ™ �.. @BenAnaven

953 Followers 1K Following YESHUA Ha'Mashiach (LORD Jesus Christ) is The Creator and The King of the Universe! - For Elon Musk: I have monetization idea for X. Game changer! -

Connor Skalitzky @Connor_Ska

65 Followers 604 Following CS specializing in AI

Sahil Antil @oxshitantil1

43 Followers 642 Following

INGABO @lingaboh

53 Followers 109 Following

Building self-learning, multi-modal conversational AI w/ a lean team of A-players (exploring millions of hrs of call data + self-play + game theory principles)

Thomas Lancer @LancerThomas

441 Followers 1K Following Building self-learning, multi-modal conversational AI w/ a lean team of A-players (exploring millions of hrs of call data + self-play + game theory principles)

paul @wanggnoy

34 Followers 1K Following

Doran Date @Siafu_Krotan

26 Followers 3K Following ドラン伊達氏 🇺🇸 𝐘𝐨𝐮𝐫 𝐚𝐫𝐦𝐬 𝐭𝐨𝐨 𝐬𝐡𝐨𝐫𝐭 𝐭𝐨 𝐛𝐨𝐱 𝐰𝐢𝐭𝐡 𝐆𝐨𝐝.

Dana Mahmood @deordered

24 Followers 731 Following Fine-tuning AI models oftentimes & practicing philosopher at other times.

Jannifer chigbu @riva_edgew11272

30 Followers 809 Following ELITE Business coach 1st female Fx trader & Educator 7 figure forex trader & mentor (mindset) peak parformance coach

ラムジーセネカ @Nandekore84

32 Followers 3K Following アメリカ人 🇺🇸 𝐀𝐜𝐭𝐢𝐨𝐧𝐬 𝐬𝐩𝐞𝐚𝐤 𝐥𝐨𝐮𝐝𝐞𝐫 𝐭𝐡𝐚𝐧 𝐰𝐨𝐫𝐝𝐬

runway model, cybersecurity CEO, dad, Supreme Court paralegal, escort, CIA consultant, TV director, landlord, poet, HERETIC.
engineered products used by 1% 🌎.

Mars (parody) @marknadal

6K Followers 369 Following runway model, cybersecurity CEO, dad, Supreme Court paralegal, escort, CIA consultant, TV director, landlord, poet, HERETIC. engineered products used by 1% 🌎.

Charles @Charlie10tang

38 Followers 72 Following

AMSARAJ N @amsaraj_n

439 Followers 2K Following

Terry Yue Zhuo @terryyuezhuo

215 Followers 663 Following No HumanEval. We have a better answer @BigCodeProject @sgSMU @seaAIL @Data61news @Monashinfotech

Yash Darji @YashDarji_

50 Followers 375 Following Dreamer

coffee & AI @realcoffeeAI

52 Followers 740 Following Sitting on a park bench scattering random seeds for the LLMs. I never bet against Elon.

Hassan Ghandour @h_ghandour96

9 Followers 73 Following 🇧🇷🇱🇧🇵🇾 Lead Principal Software Engineer

Claire Korea @theclairekorea

82 Followers 123 Following making friends @Character_AI | prev Data Engine @Tesla_AI | opinions are my own

Aditi @aditigaur_

106 Followers 421 Following

Spiderman 🇮🇳 @returnspiderman

1K Followers 6K Following Seek the truth | Everybody talks, very few listen | Watch out here comes the Spider-Man 😁 https://t.co/qwmEhH45SY

BreezyC50 @BreezyC50

128 Followers 404 Following Barça

Will Mac @ca_dryclean

6 Followers 122 Following

Cryptocracyyy @cryptocracyyy

87 Followers 307 Following

Kodom John @kodomm__

8 Followers 70 Following

Pablo Ubilla @pablo_ubilla7

724 Followers 4K Following I will tell you enough to keep you intrigued... but you shall never truly know me

@FBI Target #TwitterFiles For Censorship, Meteorologist, AI, Data Scientist, @USArmy Ret, #IC, Fmr TX Elected Official.
Seen @AmThoughtLeader
Heard @SeanHannity

John Basham @JohnBasham

80K Followers 13K Following @FBI Target #TwitterFiles For Censorship, Meteorologist, AI, Data Scientist, @USArmy Ret, #IC, Fmr TX Elected Official. Seen @AmThoughtLeader Heard @SeanHannity

SOT @SoloOrTroll

10K Followers 2K Following 22 | smite pro | twitch streamer | i love movies, tesla, robots, and technology 🦾🤖

Currently PhD at the University of Toronto. Fall 2023 student researcher at Google. Training sequence models. Recent: APE, STEVE-1, OpenWebMath, Llemma.

Keiran Paster @keirp1

1K Followers 638 Following Currently PhD at the University of Toronto. Fall 2023 student researcher at Google. Training sequence models. Recent: APE, STEVE-1, OpenWebMath, Llemma.

Dr. Jason Bourne @DR_BOURNE

4K Followers 7K Following Chief Information Security Officer (CISO)

Andrej Karpathy @karpathy

980K Followers 905 Following 🧑‍🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥

Sam Altman @sama

2.8M Followers 892 Following AI is cool i guess

Danish Pruthi @danish037

7K Followers 628 Following Faculty at Indian Institute of Science, Bangalore. PhD from @LTIatCMU.

Professor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.

Yann LeCun @ylecun

712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.

Jim Fan @DrJimFan

229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Shruti Rijhwani @shrutirij

4K Followers 499 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from @LTIatCMU * Amateur woodworker, scuba diver, foosball player

Graham Neubig @gneubig

31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.

AK @_akhaliq

310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gx

Jay Hack @mathemagic1an

37K Followers 3K Following Founder/CEO @codegen. Tweets about AI, computing, and their impacts on society. Previously did startups, @palantir, @stanford. Not a pseudonym.

Harrison Chase @hwchase17

54K Followers 410 Following @LangChainAI, previously @robusthq @kensho MLOps ∪ Generative AI ∪ sports analytics

Greg Brockman @gdb

667K Followers 51 Following President & Co-Founder @OpenAI

Divyansh Kaushik @dkaushik96

4K Followers 3K Following Emerging tech and national security. DC/PGH. “An imported Indian immigrant,” @BreitbartNews.

Yao Fu @Francis_YAO_

14K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep running

Uri Alon @urialon1

2K Followers 510 Following Research Scientist @GoogleDeepMind

Jason Wei @_jasonwei

57K Followers 491 Following ai researcher @openai

hardmaru @hardmaru

285K Followers 1K Following Building Collective Intelligence @SakanaAILabs 🧠

Assistant Professor (he/him) @LTIatCMU/@SCSatCMU - embodied #NLProc

Stealing ideas from @_Hao_Zhu @viddivj @_Yingshan @SoYeonTiffMin, @FernJared @abitha___

Yonatan Bisk @ybisk

3K Followers 883 Following Assistant Professor (he/him) @LTIatCMU/@SCSatCMU - embodied #NLProc Stealing ideas from @_Hao_Zhu @viddivj @_Yingshan @SoYeonTiffMin, @FernJared @abitha___

Sam Whitmore @sjwhitmore

12K Followers 2K Following building @newcomputer. not a cat (or a man) in real life. I like to run a lot! @kensho @harvard @StuyNY

Aran Komatsuzaki @arankomatsuzaki

95K Followers 78 Following @TeraflopAI

Greg Durrett @gregd_nlp

6K Followers 752 Following CS professor at UT Austin. I do NLP most of the time. he/him

Colin Flaherty @colin__flaherty

338 Followers 248 Following AI researcher

Making GPUs go brrrr @augmentcode 🤖 Past: Research Scientist at Google Brain 🧠 IMO Silver Medalist 🥈 waiting for LLMs to beat me. Tweets are my own opinions.

Hieu Pham @hyhieu226

2K Followers 41 Following Making GPUs go brrrr @augmentcode 🤖 Past: Research Scientist at Google Brain 🧠 IMO Silver Medalist 🥈 waiting for LLMs to beat me. Tweets are my own opinions.

Zaid Sheikh @zdshkh11

31 Followers 103 Following Senior Research Programmer at Carnegie Mellon University

Mars (parody) @marknadal

6K Followers 369 Following runway model, cybersecurity CEO, dad, Supreme Court paralegal, escort, CIA consultant, TV director, landlord, poet, HERETIC. engineered products used by 1% 🌎.

Keiran Paster @keirp1

1K Followers 638 Following Currently PhD at the University of Toronto. Fall 2023 student researcher at Google. Training sequence models. Recent: APE, STEVE-1, OpenWebMath, Llemma.

Shunyu Yao @ShunyuYao12

7K Followers 858 Following Language agents (ReAct, Reflexion, Tree of Thoughts) for digital automation (WebShop, SWE-bench, SWE-agent)

Devin Kim @devindkim

1K Followers 116 Following a real human bean. building intelligence @xAI

CS PhD cand @ucsbNLP 🌊🌴 @NSF GRFP
🧐analyzing semantics in generative lang/img AI models🤖
Big tech ex-intern. BS/MS @ASU 🌵🏜
🔜 @AMD opensrc GenAI RS intern

Michael Saxon @m2saxon

2K Followers 1K Following CS PhD cand @ucsbNLP 🌊🌴 @NSF GRFP 🧐analyzing semantics in generative lang/img AI models🤖 Big tech ex-intern. BS/MS @ASU 🌵🏜 🔜 @AMD opensrc GenAI RS intern

Jiayi Pan @pan_jiayipan

575 Followers 1K Following First year PhD student @Berkeley_AI/@BerkeleyNLP

Ze Liu @zeliu_

267 Followers 311 Following @xAI. Previously PhD @MSFTResearch (MSRA) & USTC.

Gil-Martin @RobertWringhim

2K Followers 583 Following Play is the exultation of the possible.

Nicolas Wörmann @NWormann

57 Followers 237 Following mathematics and computer science @lmu_muenchen ceo speedscale

Shehzaad Dhuliawala @shehzaadzd

343 Followers 909 Following PhD student at @ETH_en | Previously Research Engineer @MSFTResearch Montréal | Master's at @UMassCS. He/Him

Joe Fenton @JoeFenton

1K Followers 2K Following AI and investing 🤖📈 PM @MicrosoftAI Prev. Founding Product Manager @InflectionAI and PM @GoogleDeepMind and @GoogleAI

Ali Behrouz @behrouz_ali

914 Followers 846 Following Ph.D. Student @cornell, interested in machine learning.

Lunjun Zhang @ZhangLunjun

366 Followers 537 Following cs phd student @uoft, student researcher @GoogleDeepMind singularity requires singular focus

A somewhat-intelligent three-dimensional being at @xAI. Writer: https://t.co/pisunzyEVv. AI Filmmaker. Musician. Upcoming book: https://t.co/rBk0AMk1mF

Katia Karpenko @KatiaEarth

811 Followers 546 Following A somewhat-intelligent three-dimensional being at @xAI. Writer: https://t.co/pisunzyEVv. AI Filmmaker. Musician. Upcoming book: https://t.co/rBk0AMk1mF

Chris Zheng @ChrisZheng001

12K Followers 609 Following Creative content creator I Team player I Love and kindness I CTO ：）

Jaime Alonso @JaimeAlns

287 Followers 126 Following @xAI

Qian Huang @qhwang3

2K Followers 277 Following @xai | CS PhD student @StanfordAILab

Jiawei Liu @JiaweiLiu_

2K Followers 957 Following Simplifying the making of great software. PhD Student @plfmse @IllinoisCS.

Ethan Knight @eknight

7K Followers 219 Following 🏋️‍♂️ @xai | prev e2e @tesla, rl @openai

Haotian Liu @imhaotian

6K Followers 397 Following building intelligence @xAI, creator of #LLaVA, cs @UWMadison, prev @MSFTResearch

Manuel Kroiss @makro_ai

14K Followers 60 Following

Kyle Kosic @kylekosic

13K Followers 66 Following @xAI Previously @OpenAI

• Director of the Center for AI Safety (https://t.co/ahs3LYCpqv)
• GELU/ImageNet-C/MMLU/safety groundwork
• PhD in AI from UC Berkeley
https://t.co/rgXHAnYAsQ
https://t.co/YtGtDh1aAV

Dan Hendrycks @DanHendrycks

17K Followers 81 Following • Director of the Center for AI Safety (https://t.co/ahs3LYCpqv) • GELU/ImageNet-C/MMLU/safety groundwork • PhD in AI from UC Berkeley https://t.co/rgXHAnYAsQ https://t.co/YtGtDh1aAV

Fabio Aguilera-Conver.. @Faruletes

1K Followers 187 Following

Ting Chen @tingchenai

5K Followers 365 Following Bump up intelligence in all bit streams @xai. Previous @GoogleDeepmind, @GoogleBrain.

Sergey Ioffe @Sergey_xai

749 Followers 6 Following https://t.co/E7YNgwpalf

Xuechen Li @lxuechen

2K Followers 901 Following Building intelligence @xai. PhD @Stanford. Undergrad @UofT. Worked at @GoogleAI @MSFTResearch @Vectorinst. I go by Chen.

Prasanna Lahoti @_PrasannaLahoti

585 Followers 94 Following @xAI. previously @scale_AI.

Rutvik Makwana @rutvikwrites

908 Followers 772 Following AI Tutor @xai • Grokking @grok • Pharmaceutical Science • Cricket, Movies, Voracious Reader

omar @therealomaralfy

3K Followers 2K Following Mostly just having conversations with myself 🤷🏽‍♂️ @X

Gabriel Ilharco @gabriel_ilharco

4K Followers 1K Following Building cool things @xAI. Prev. PhD at UW, Google AI

Saeed Maleki @MalekiSaeed

474 Followers 110 Following

Lianmin Zheng @lm_zheng

4K Followers 439 Following CS Ph.D. @ UC Berkeley. Creator of Alpa, Vicuna, and Chatbot Arena. @lmsysorg

Harshita Diddee @ihsrahedid

642 Followers 698 Following LTI PhD @SCSatCMU | Prev: RF at @MSFTResearch | Interested in Data Quality Estimation

xiao sun @xiaosun86

2K Followers 93 Following

Jesik Min @jesikmin

416 Followers 347 Following validating 42 @xAI

Aditya Paliwal @VastoLorde95

527 Followers 85 Following I only read books that have pictures in them

ex-Google Brain, OpenAI, Meta
Scholar: https://t.co/iVycFw5dSX
New Blog: https://t.co/SLix8HqVeY
Old Blog: https://t.co/Ur3GWKoOzy

Yaroslav Bulatov @yaroslavvb

6K Followers 703 Following ex-Google Brain, OpenAI, Meta Scholar: https://t.co/iVycFw5dSX New Blog: https://t.co/SLix8HqVeY Old Blog: https://t.co/Ur3GWKoOzy

Yongchao Zhou @Yongchao_Zhou_

537 Followers 301 Following Build Intelligence @xai | ML PhD @UofT @VectorInst | Prev. @GoogleAI @GoogleDeepMind | Working on LLMs

Roger Grosse @RogerGrosse

10K Followers 751 Following

Mustafa Suleyman @mustafasuleyman

131K Followers 536 Following CEO, Microsoft AI | Author: The Coming Wave | Past: Co-founder, @InflectionAI & @GoogleDeepMind

Connor Leahy @NPCollapse

23K Followers 554 Following Hacker - CEO @ConjectureAI - Ex-Head of @AiEleuther - I don't know how to save the world, but dammit I'm gonna try

Biao Zhang @BZhangGo

621 Followers 279 Following Research Scientist @ Google. Past: PostDoc at UoE. PhD in NLP/MT @edinburghnlp. All opinions are my own.

Toolmaker. Software creator, optimizer and harmonizer.

Makes things work and fly at @ContextualAI

Training LLM/RAG/Generative AI/Machine Learning/Scalability

Stas Bekman @StasBekman

7K Followers 268 Following Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at @ContextualAI Training LLM/RAG/Generative AI/Machine Learning/Scalability

Pratyush Maini @pratyushmaini

1K Followers 340 Following Trustworthy ML | PhD student @mldcmu | Founding Member @datologyai | Prev. Comp Sc @iitdelhi

xAI @xai

997K Followers 36 Following

Yao Fu @Francis_YAO_

10 hours ago

Have to disagree with this point. I tend to view the needle in haystack as an **entry barrier**: if you cannot pass it, you are not even in the game. To be able to perform complex reasoning over long context, you should able to first be able to retrieve the information at any…

Ofir Press 🖋 @OfirPress

12 hours ago

There is no such thing as "long context performance". It just has no meaning. The needle in a haystack thing is almost a complete waste of time. End-to-end evaluation is always the answer.

1 0 12 9K 1

4 4 58 8K 16

Amir Yazdanbakhsh @ayazdanb

a day ago

Jeff Clune @jeffclune

a day ago

0 0 22 6K 1

Download Image

0 0 1 360 0

Download Image

Yao Fu @Francis_YAO_

2 days ago

In the age of large language models, I realized the only sentence I ever talked to Siri is "five minutes timer"

8 1 45 7K 1

Gytis Daujotas @gytdau

4 days ago

How does self-correction affect problem solving? In a toy transformer model that was trained to solve mazes, I found that performance reliably improved (!) by inserting mistakes and self-corrections into the training data.

14 33 200 22K 142

Download Gif

Hieu Pham @hyhieu226

6 days ago

One year ago, I left Google Brain (now DeepMind) to join a very early startup. We had fewer than 10 people at that time, and have grown many times since. Today, I am extremely proud to share our milestone. We are Augment. You can read about us here. techcrunch.com/2024/04/24/eri…

24 40 716 442K 353

Graham Neubig @gneubig

6 days ago

We're having a big event on agents at CMU on May 2-3 (one week from now), all are welcome! cmu-agent-workshop.github.io It will feature: * Invited talks from @alsuhr @ysu_nlp @xinyun_chen_ @MaartenSap and @chris_j_paxton * Posters of cutting edge research * Seminars and hackathons

Frank Xu @frankxu2004

a week ago

On May 2-3, we're going to have a big event in Pittsburgh about LLM Agents. We have invited talks from great speakers inside and outside CMU, student research presentations and posters, tutorials and discussions! Come join us at CMU campus, and register at cmu-agent-workshop.github.io

1 17 110 39K 28

4 29 178 44K 61

Pei Zhou @peizNLP

6 days ago

@aman_madaan Thanks Aman!!

0 0 1 215 0

Pei Zhou @peizNLP

6 days ago

PhDone!!!! 👨‍🎓 08/2019-04/2024 What a journey 🥳🚞 I especially feel lucky to share this once-in-a-life-time moment with people I love ❤️ . And seeing my passion-driven research efforts being acknowledged by researchers I deeply admire 🌞!! Special thanks to my awesome committee…

34 10 191 13K 10

Download Image

Kaixin Ma @KaixinMa9

a week ago

Turns out that even SOTA MLLMs achieve near random accuracy on these visual IQ questions 🧐

yifan jiang @yifanji24618785

a week ago

How good are MLLM at solving IQ (abstract visual reasoning) problems? Check our new benchmark paper! MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning Paper: arxiv.org/pdf/2404.13591… Website: marvel770.github.io

0 1 2 438 0

0 0 5 266 0

Chunting Zhou @violet_zct

a week ago

@WenhuChen @Teknium1 I think the motivation of LIMA is not to quantify the number of SFT examples that is needed but to highlight (1) how important high quality SFT data is and (2) the superficial alignment hypothesis where pretrained LLM stores all the knowledge and can be easily tuned into an…

0 4 34 2K 1

Swaroop Mishra @Swarooprm7

2 weeks ago

The super exciting TED talk on the SixthSense technology by @pranavmistry 14 years back inspired me a lot in many ways over the years 🔥. Finally got a chance to meet him and discuss research 😍. The TED talk video which I have watched a thousand times: youtube.com/watch?v=YrtANP……

5 1 43 3K 1

Download Image

Aran Komatsuzaki @arankomatsuzaki

2 weeks ago

Can Language Models Solve Olympiad Programming? - Uses self-reflection and retrieval over episodic knowledge to boost the perf of GPT-4 on USACO from 8.7% pass@1 to 20.2% - Giving a small number of targeted hints solves most of the questions repo: github.com/princeton-nlp/… abs:…

7 43 194 23K 112

Download Image

Jing Yu Koh @kohjingyu

2 weeks ago

Honored to receive the 2024 Jane Street Graduate Research Fellowship! Thank you @JaneStreetGroup for the award and for organizing an amazing workshop! The best part of this was getting to meet PhD students working on algebraic geometry, cosmology, quantum algorithms, and more!

12 5 126 11K 8

Download Image

Hyeonbin Hwang @ronalhwang

2 weeks ago

🚨 New LLM Reasoning Paper 🚨 Q. How can LLMs self-improve their reasoning ability? ⇒ Introducing Self-Explore⛰️🧭, a training method specifically designed to help LLMs avoid reasoning pits by learning from their own outputs! [1/N]

8 55 295 35K 281

Download Image

Devendra Chaplot @dchaplot

2 weeks ago

We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Free to use under Apache 2.0 license - Outperforms all open models - Native function calling - Masters English, French, Italian, German and Spanish. - Seq_len = 64K mistral.ai/news/mixtral-8…

27 188 1K 141K 287

Download Image

🇺🇦 Alex Polozov @Skiminok

2 weeks ago

Benchmarks are useful while they still provide signal. Even though every SOTA model has seen the involved PRs, their performance on the task is still under 20%. We can worry about leakage when we start succeeding at extracting task-related knowledge out of the model.

Aman Sanger @amanrsanger

2 weeks ago

SWE-bench is probably contaminated for frontier models (gpt-4/claude-3-opus). Given only the name of a pull request in the dataset, Claude-3-opus already knows the correct function to modify.

14 59 606 100K 155

Download Image

1 0 16 3K 3

Andrej Risteski @risteski_a

2 weeks ago

The folks at @OpenAI and @ericschmidt were kind enough to give @AdtRaghunathan and me a generous gift to better understand supervision with weak models. We are honored to be awarded, and are looking forward to the exciting work that will come out of this !

4 2 131 14K 14

Download Image

Greg Yang @TheGregYang

3 weeks ago

Looking for top engineers and designers passionate about harnessing our AI capabilities to create never-before-seen consumer products. 🛼 come roll w us! x.ai/careers

xAI @xai

3 weeks ago

👀 x.ai/blog/grok-1.5v

599 1K 7K 22.9M 879

13 34 383 45K 23

Toby Pohlen @TobyPhln

3 weeks ago

Some early results of our first vision model. It'll be integrated into the Grok chat in the medium term. A few other features will ship before that (likely very soon). Props to {@tingchenai, @gabriel_ilharco}. x.ai/blog/grok-1.5v

28 43 418 48K 20

Devin Kim @devindkim

3 weeks ago

excited to share that ive joined @xai! its only been 2 weeks, but the team is insanely stacked and the rate of progress is astounding. 📈📈 im looking forward to learning a lot and sharing everything i know about data and post-training with the team 🥳