Sang Michael Xie @sangmichaelxie

PhD student @StanfordAILab @StanfordNLP @Stanford advised by Percy Liang and Tengyu Ma. Prev: visiting @GoogleAI Brain, BS, MS Stanford ‘17 cs.stanford.edu/~eix Stanford, CA Joined May 2019

Tweets

358
Followers

3K
Following

709
Likes

2K

Sang Michael Xie @sangmichaelxie

a month ago

Connect Later, our targeted fine-tuning method for robust+accurate models, tops the WILDS leaderboard for iWildCam and Camelyon17 and achieves SoTA on astronomical time-series tasks (3 very different domains)! arxiv.org/abs/2402.03325

Helen Qu @_helenqu

2 months ago

0 0 3 6K 1

Download Image

1 1 8 6K 2

Helen Qu @_helenqu

2 months ago

today, gen AI performance is surprisingly robust to new data/tasks, even beating specialized models! the secret: training on large-scale unlabeled data. what can we as scientists learn from this? some thoughts on robustness & the power of the unlabeled data you already have:

1 1 8 2K 0

Download Image

Aaron Lou @aaron_lou

2 months ago

Announcing Score Entropy Discrete Diffusion (SEDD) w/ @chenlin_meng @StefanoErmon. SEDD challenges the autoregressive language paradigm, beating GPT-2 on perplexity and quality! Arxiv: arxiv.org/abs/2310.16834 Code: github.com/louaaron/Score… Blog: aaronlou.com/blog/discrete-… 🧵1/n

19 132 673 152K 391

Download Gif

rishi @RishiBommasani

2 months ago

We need to rigorously reason about the benefits and risks of open foundation models. There is plenty of debate and speculation, animating lively policy conversations. To make progress, we have put out new work on the societal impact of open FMs crfm.stanford.edu/open-fms/

1 20 68 17K 19

Sang Michael Xie @sangmichaelxie

2 months ago

Great effort led by @AlbalakAlon to corral the wild west of LM data selection! A meta-issue: how do we make data work (esp. for pretraining) more accessible? Not everyone can train 7B LMs, but a first bar is to show that the benefits don't shrink with scale, at smaller scales.

Alon Albalak @AlbalakAlon

2 months ago

9 76 303 100K 266

Download Image

1 1 19 3K 4

Sang Michael Xie @sangmichaelxie

2 months ago

Interestingly, pretraining on unlabeled source/target+finetuning doesn’t improve much over just supervised learning on source in iWildcam-WILDS. Correspondingly, the connectivity conditions on the success of contrastive pretraining for UDA (arxiv.org/abs/2204.00570) also fail!

Helen Qu @_helenqu

2 months ago

1 5 20 11K 13

Download Image

1 5 26 9K 15

Ruoxi Jia @ruoxijia

3 months ago

The paper submission deadline has been extended to 2/11 AoE. Look forward to your submissions!

Ruoxi Jia @ruoxijia

3 months ago

The paper submission deadline has been extended to 2/11 AoE. Look forward to your submissions! https://t.co/CE4UPFZ9A5

1 23 108 45K 30

Download Image

3 10 22 8K 5

Download Image

Yuhuai (Tony) Wu @Yuhu_ai_

3 months ago

Euclidean geometry problems have been my favorite math puzzles since middle school. The most intriguing part of it is the creation of auxiliary lines, which opens a space for imagination and the freedom to explore various diagrams. Once a proof is found, these auxiliary lines…

trieu @thtrieu_

3 months ago

37 163 781 2.3M 364

178 178 919 1.0M 258

Sang Michael Xie @sangmichaelxie

3 months ago

Excited to co-organize this ICLR 2024 workshop! I think better data will be crucial for the next big advances in foundation models. The submission date is Feb 3 - details at sites.google.com/view/dpfm-iclr…

Ruoxi Jia @ruoxijia

3 months ago

1 23 108 45K 30

Download Image

0 4 30 10K 5

Lucio Dery Jnr Mwinm @derylucio

4 months ago

Excited to announce the 2nd ME-FoMo workshop on understanding foundation models will be at ICLR 2024 , Vienna! Topics include pretraining, adaptation and emergence amongst many others. Paper deadline: Feb 3 Website : sites.google.com/view/me-fomo20… Open Review : tinyurl.com/2p6hzybr

1 6 40 9K 8

Download Image

Sadhika Malladi @SadhikaMalladi

4 months ago

Announcing the 2nd Workshop on Mathematical and Empirical Understanding of Foundation Models (ME-FoMo) at ICLR 2024! Improving our understanding helps us advance capabilities and build safer, more aligned models. Paper deadline is Feb 3! Website: sites.google.com/view/me-fomo20…

0 15 106 31K 15

Download Image

Darek Kłeczek @dk21

4 months ago

I’m a big fan of exploring data mixtures as a key ingredient for training better models - happy to find this poster presented by @sangmichaelxie on the topic!

1 15 68 15K 29

Download Image

Volodymyr Kuleshov 🇺🇦 @volokuleshov

4 months ago

Loved this nice and simple idea for better data selection in LMs. First, use high level features to describe high-value data (eg textbook chunks). Then use importance sampling to prioritize similar data in a large dataset. ⁦@sangmichaelxie⁩

2 30 231 41K 148

Download Image

Stephan Xie @stephofx

4 months ago

I'm at #NeurIPS2023 workshops today! Check out our work on high dimensional prediction and applications to online combinatorial optimization, extensive form games, and prediction sets! ⏱️ Talk 2:30-3, poster 3-4 📍OPT for ML w/ Georgy, Ramya, @Aaroth arxiv.org/abs/2310.17651

0 3 15 1K 3

Tri Dao @tri_dao

5 months ago

Transformers power most advances in LLMs, but its core attention layer can’t scale to long context. With @_albertgu, we’re releasing Mamba, an SSM architecture that matches/beats Transformers in language modeling, yet with linear scaling and 5x higher inference throughput. 1/

Albert Gu @_albertgu

5 months ago

53 424 2K 790K 1K

Download Image

42 348 2K 528K 1K

Download Image

Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist

Percy Liang @percyliang

49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist

@NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Jim Fan @DrJimFan

229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Jason Wei @_jasonwei

56K Followers 490 Following ai researcher @openai

Ananya Kumar @ananyaku

4K Followers 469 Following Researcher at @openai Previously PhD at Stanford University (@StanfordAILab) advised by Percy Liang and Tengyu Ma

Behnam Neyshabur @bneyshabur

18K Followers 689 Following Senior Staff Research Scientist @GoogleDeepMind, Interested in reasoning w. LLMs, traveling & backpacking

Stanford CS PhD @StanfordCRFM
@StanfordNLP @StanfordAILab @StanfordHAI

Advisers: @percyliang @jurafsky
Previous: @CornellCIS @clairecardie
#FoundationModels

rishi @RishiBommasani

4K Followers 2K Following Stanford CS PhD @StanfordCRFM @StanfordNLP @StanfordAILab @StanfordHAI Advisers: @percyliang @jurafsky Previous: @CornellCIS @clairecardie #FoundationModels

ML / AI researcher, emphasis on theory.

Research Director and Canada CIFAR AI Chair, @VectorInst
Professor, @UofT (Statistics/CS)

Dan Roy @roydanroy

45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)

Shane Gu @shaneguML

28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)

Akari Asai @AkariAsai

11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃‍♀️🧗‍♀️🍳

Sewon Min @sewon__min

7K Followers 642 Following PhD student at @uwcse @uwnlp

PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵

Kayo Yin @kayo_yin

8K Followers 556 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵

Yann Dubois @yanndubs

4K Followers 1K Following PhD student @stanfordAILab | Prev: AI resident @metaai, @vectorinst, @CambridgeMLG

Ethan Caballero is bu.. @ethanCaballero

8K Followers 2K Following ML PhD student @Mila_Quebec ; previously @GoogleDeepMind

Miles Brundage @Miles_Brundage

43K Followers 10K Following Policy research at @openai. I mostly tweet about AI, animals, and sci-fi. He/him. Views my own.

Assistant Professor @WisconsinCS. Formerly postdoc @StanfordAILab, Ph.D. @Cornell. Making AI safe and reliable for the open world.

Sharon Y. Li @SharonYixuanLi

7K Followers 657 Following Assistant Professor @WisconsinCS. Formerly postdoc @StanfordAILab, Ph.D. @Cornell. Making AI safe and reliable for the open world.

Jesse Mu @jayelmnop

5K Followers 581 Following Computational linguistics @AnthropicAI

I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.

Sara Hooker @sarahookr

39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.

Tim Dettmers @Tim_Dettmers

29K Followers 819 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.

I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.

Ofir Press @OfirPress

10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.

@SnorkelAI @uwcse / prev @StanfordAILab – Interested in data management systems for machine learning, weak supervision, and impactful applications.

Alex Ratner @ajratner

5K Followers 548 Following @SnorkelAI @uwcse / prev @StanfordAILab – Interested in data management systems for machine learning, weak supervision, and impactful applications.

Umair Abid @umairabid_999

4 Followers 186 Following Data Scientist

Avinash Dwivedi @Avinash21396649

1 Followers 283 Following insanely Curious

Nick Cannon @inkymaze

6K Followers 2K Following vp growth @gauntlet_xyz. @aerafinance. poker → crypto ⇄ fintech.

G K @gauravkaul

430 Followers 3K Following VLSI|RTL|Computer Architecture| AI

We first transfer USDT to you TRC20, you return 90% to BEP20, you get 10% , 2K per day
Our co hv a large amt of USDT need to from TRC20 convert to BEP20 network

hkpacjlcyh @cjlcyh50367y8v

29 Followers 1K Following We first transfer USDT to you TRC20, you return 90% to BEP20, you get 10% , 2K per day Our co hv a large amt of USDT need to from TRC20 convert to BEP20 network

Chip Huyen @chipro

92K Followers 443 Following Data processing on GPUs @VoltronData Designing ML Systems: https://t.co/G81hL2dWmr @designmlsys #AI x #GPU

Mihara @mihara88869272

18 Followers 712 Following RL, NLP, LLM for Intelligent Education.

Melihcan Erol @hsme1986

3 Followers 24 Following

MesubsetofRunionC @mesubsetof

35 Followers 426 Following

Arif Ahmad @arif_ahmad_py

248 Followers 7K Following All things AI, Computer Science and Circuits! Prev. @GoogleAI

We seek to reduce the female founders funding gap to protect others from deceptive business practices like those exhibited by DelMorgan & Co.

DelMorganCo-Review @DelMorganReview

34 Followers 525 Following We seek to reduce the female founders funding gap to protect others from deceptive business practices like those exhibited by DelMorgan & Co.

kovariance @kovariance

68 Followers 2K Following

hanncx @hanncx

63 Followers 4K Following perpetual learning

Woodrow @Woodrow12465631

6 Followers 340 Following

Muhammad Imran @Muhamma55183541

3 Followers 127 Following

Eva Louise Marie Gabr.. @e681554349

9 Followers 3K Following

Achyuta Rajaram @AchyutaBot

267 Followers 400 Following 17 | mech interp @mit_csail | @atlasfellow '23 | STS 2024

I'mDust @cs__Henry

2 Followers 87 Following Try not to become a man of success, but rather try to become a man of value.

bourbakis @bourbakii

785 Followers 8K Following ☞ Math Platonist ☞ Views Mine & RT/LIKES = bookmark

Aryan Pandey (Look fo.. @AryanPa66861306

1K Followers 3K Following Half Machine Learning Engineer || DevOps and Machine Learning || Open Source at OpenVINO

Katie Kang @katie_kang_

1K Followers 416 Following PhD student @berkeley_ai

gabedaramola @gabedaramola

0 Followers 5K Following

Makya @Makya12345678

6 Followers 962 Following

Mathieu Ravaut @MatRavox

371 Followers 2K Following PhD candidate in NLP at @ntunlpsg w @JotyShafiq and @astarhq. Ex @layer6ai | @uoftcompsci | @centralesupelec

Sichao Liu @ErikLiuSe

45 Followers 289 Following

bellamy @solacebellamy

46 Followers 114 Following

tradernews.ai @tradernewsai

2K Followers 1K Following AI + MARKETS + NEWS THIS IS NOT INVESTMENT ADVICE

Ruidong Wu @RuidongWu

57 Followers 279 Following Researcher at @HelixonBio. Prev: @UofIllinois @MIT_CSAIL @Tsinghua_Uni.

Winnie Yeung @mimo90918

2 Followers 51 Following MLE @ Square

Tolga @standard_ai

18 Followers 2K Following 𝓝𝓾𝓵𝓵𝓲𝓾𝓼 𝓘𝓷 𝓥𝓮𝓻𝓫𝓪 https://t.co/8d1tBxpvKG

Bob0409 @hzxhx111

36 Followers 79 Following

jay @seekerum

160 Followers 887 Following

Yun Fu @fuyun

86 Followers 409 Following

Zeqian Bao @BaoZeqian18347

5 Followers 300 Following

LLM applied scientist by day, esports data scientist for fun. Working on rating systems and benchmarks for esports (and LLMs?)

I ❤️ paired comparison data

Clayton @cthorrez

1K Followers 1K Following LLM applied scientist by day, esports data scientist for fun. Working on rating systems and benchmarks for esports (and LLMs?) I ❤️ paired comparison data

Zhen Xu @NehzUx

54 Followers 191 Following Buried in arxiv daily papers

Cloud Twitt @Twitt2Cloud

156 Followers 380 Following

Aymane @aymane_arfaoui

67 Followers 96 Following I own and grow software e-commerce businesses

George Grigorev @iamgrigorev

2K Followers 532 Following formerly generative ml @ snap, global talent interested in llms

Gaole He @HeGaole

217 Followers 452 Following PhD student at TU Delft @wisdelft, Human-centered AI.

Visiting Ph.D. student at Cornell University. Ph.D. candidate at CUHK. Working on bandits and reinforcement learning theory.

Zhiyong Wang @Zhiyong16403503

380 Followers 2K Following Visiting Ph.D. student at Cornell University. Ph.D. candidate at CUHK. Working on bandits and reinforcement learning theory.

Qasim Ali @QasimAliSidhu

168 Followers 1K Following AI First Tech Savvy Technical Customer Support Engineer #AI #GenerativeAI #GenAI #FutureAILeaders #AIFirst

$Lead AI Researcher https://t.co/z4idFlmggM (T2I), Lead AI Researcher @fractalai Prev: GSoC @CERN, Alumni @IITKgp, Intern @AmiiThinks Diffusion, VLMs, reasoning@LLM$

kunal singh @ikunalsingh7

62 Followers 660 Following Lead AI Researcher https://t.co/z4idFlmggM (T2I), Lead AI Researcher @fractalai Prev: GSoC @CERN, Alumni @IITKgp, Intern @AmiiThinks Diffusion, VLMs, reasoning@LLM

pinktopus @pinktopus_

6 Followers 39 Following

Alexander Wan @alexwan55

472 Followers 944 Following CS at Berkeley; @BerkeleyML @BerkeleyNLP; NLP research

Jeff Nickerson @jvnickerson

155 Followers 821 Following

Pensé FFun @inftyCategory

113 Followers 6K Following

MoonRide @moonride303

78 Followers 4K Following Friend of AIs

Mathieu Alain @miniapeur

19K Followers 2K Following Researching @ai_ucl. Co-organises @uclcsml and @logconference. FR, EN, trying ES. 🇹🇼🇨🇦🇬🇳🇺🇸🇩🇴🇫🇷🇪🇸🇬🇧🇿🇦

Megan Richards @megan_richards_

124 Followers 288 Following AI Resident @AIatMeta, previously @DukeInnovate. Reliable/Responsible AI.

Percy Liang @percyliang

49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist

Jim Fan @DrJimFan

229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

hardmaru @hardmaru

285K Followers 1K Following Building Collective Intelligence @SakanaAILabs 🧠

Jason Wei @_jasonwei

56K Followers 490 Following ai researcher @openai

(((ل()(ل() 'yoav))).. @yoavgo

46K Followers 2K Following

Ananya Kumar @ananyaku

4K Followers 469 Following Researcher at @openai Previously PhD at Stanford University (@StanfordAILab) advised by Percy Liang and Tengyu Ma

Assistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Co-EiC @TmlrOrg. I lead @TheSalonML. Privacy, robustness, machine learning.

Gautam Kamath @thegautamkamath

44K Followers 505 Following Assistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Co-EiC @TmlrOrg. I lead @TheSalonML. Privacy, robustness, machine learning.

Behnam Neyshabur @bneyshabur

18K Followers 689 Following Senior Staff Research Scientist @GoogleDeepMind, Interested in reasoning w. LLMs, traveling & backpacking

Jacob Andreas @jacobandreas

14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw

a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).

Kyunghyun Cho @kchonyc

61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).

Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋

Christopher Manning @chrmanning

126K Followers 115 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋

rishi @RishiBommasani

4K Followers 2K Following Stanford CS PhD @StanfordCRFM @StanfordNLP @StanfordAILab @StanfordHAI Advisers: @percyliang @jurafsky Previous: @CornellCIS @clairecardie #FoundationModels

Eric Jang @ericjang11

69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0p

Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

Delip Rao e/σ @deliprao

46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

Sam Bowman @sleepinyourhat

35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.

We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.

Anthropic @AnthropicAI

261K Followers 26 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.

Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.

AI at Meta @AIatMeta

531K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.

Dan Roy @roydanroy

45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)

Sasha Rush @srush_nlp

52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGz

Shane Gu @shaneguML

28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)

Yizhong Wang @yizhongwyz

3K Followers 1K Following CS PhD student @uwcse @uwnlp. NLP/ML

Marc Marone @ruyimarone

422 Followers 586 Following PhD student at Johns Hopkins @jhuclsp. Previously @microsoft Semantic Machines, @mstranslator, @GeorgiaTech

Neel Guha @NeelGuha

746 Followers 663 Following JD-PhD candidate in computer science @Stanford

Principal Engineer, @GoogleDeepMind Gemini. prev PaLM-2. Tinkering with optimization and distributed systems. opinions are my own.

rohan anil @_arohan_

12K Followers 2K Following Principal Engineer, @GoogleDeepMind Gemini. prev PaLM-2. Tinkering with optimization and distributed systems. opinions are my own.

Tianyu Gao @gaotianyu1350

3K Followers 686 Following CS PhD student @Princeton @Princeton_nlp working on NLP. Previously: @Tsinghua_Uni @TsinghuaNLP

Kangwook Lee @Kangwook_Lee

2K Followers 667 Following Assistant Professor, ECE, UW-Madison / Leading deep learning research @ KRAFTON

Dimitris Papailiopoul.. @DimitrisPapail

11K Followers 970 Following prof @ wisconsin; thinking about transformers; learning in context; babas of Inez Lily

Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_usc

Bill Yuchen Lin 🤖 @billyuchenlin

6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_usc

Ph.D. candidate@NUSingapore, Intern of GEAR @NVIDIA | Google PhD Fellow | LLM, Foundation Model Scaling | Ex-@GoogleBrain | Zero-shot Cooking Learner🧑‍🍳

Fuzhao Xue @XueFz

4K Followers 542 Following Ph.D. candidate@NUSingapore, Intern of GEAR @NVIDIA | Google PhD Fellow | LLM, Foundation Model Scaling | Ex-@GoogleBrain | Zero-shot Cooking Learner🧑‍🍳

Allan Zhou @AllanZhou17

1K Followers 443 Following Final-year AI PhD student @Stanford. NN architecture design, learned optimizers, and hparam optimization.

Yangsibo Huang @YangsiboHuang

1K Followers 726 Following PhD candidate @Princeton. Prev: @GoogleAI @AIatMeta.

Cofounder @openai, lead post-training for ChatGPT and the API. Interested in reinforcement learning, alignment, birds, jazz music

John Schulman @johnschulman2

39K Followers 609 Following Cofounder @openai, lead post-training for ChatGPT and the API. Interested in reinforcement learning, alignment, birds, jazz music

Student Researcher, LLM Reasoning Team @GoogleDeepMind. Led @stanfordnlp's Alexa Prize Team to 1st Place (Science) at @AmazonScience's Socialbot Challenge.

Ryan Chi @ryanandrewchi

18 Followers 42 Following Student Researcher, LLM Reasoning Team @GoogleDeepMind. Led @stanfordnlp's Alexa Prize Team to 1st Place (Science) at @AmazonScience's Socialbot Challenge.

Greg Durrett @gregd_nlp

6K Followers 752 Following CS professor at UT Austin. I do NLP most of the time. he/him

Douwe Kiela @douwekiela

10K Followers 378 Following @ContextualAI CEO, @Stanford Adjunct Prof.

Postdoc @StanfordNLP, incoming assistant professor @Northeastern, PhD @Columbia| Prev Intern @MetaAI |Co-created CICERO | persuasive chatbots + privacy #nlproc

Weiyan Shi @shi_weiyan

3K Followers 683 Following Postdoc @StanfordNLP, incoming assistant professor @Northeastern, PhD @Columbia| Prev Intern @MetaAI |Co-created CICERO | persuasive chatbots + privacy #nlproc

Alon Albalak @AlbalakAlon

885 Followers 464 Following CS PhD candidate at @ucsbNLP. Research: Data-centric AI, Efficiency in ML, NLP.

Hanxiao Liu @Hanxiao_6

2K Followers 102 Following @inflectionAI, ex Google Brain & DeepMind

Research @MetaAI+NYU. Pretrain+FT: NLP from Scratch (2011). Multilayer attention+position embed+LLM: MemNets (2015). Recent (2023+):Sys 2 Attn, Self-Rewarding..

Jason Weston @jaseweston

9K Followers 568 Following Research @MetaAI+NYU. Pretrain+FT: NLP from Scratch (2011). Multilayer attention+position embed+LLM: MemNets (2015). Recent (2023+):Sys 2 Attn, Self-Rewarding..

Shital Shah @sytelus

10K Followers 8K Following Deep learning research and code. If universe is an optimizer, what is the loss function? All opinions are my own.

schen @schen_7

13 Followers 185 Following NLP research @ Stanford

Ahmed Ahmed @AhmedSQRD

407 Followers 795 Following CS PhD @Stanford - Funding @KnightHennessy @NSF- 🇸🇩 - tweets include history & politics

Together AI @togethercompute

27K Followers 303 Following The future of AI is open-source. Let's build together.

Tian Xie @tianxie233

75 Followers 296 Following Research Engineer @character_ai | previously @SFResearch

Yasaman Bahri @yasamanbb

5K Followers 954 Following Research Scientist @GoogleDeepMind // ML + physics + quantum materials // Ph.D. theoretical cond matt physics @UCBerkeley.

Maurice Weber @mauriceweberq

72 Followers 359 Following AI Research @togethercompute | ML PhD @ETH @DS3Lab

Jacob Springer @jacspringer

327 Followers 168 Following PhD student @mldcmu

Center for Research o.. @StanfordCRFM

2K Followers 3 Following Making foundation models more reliable and accessible.

Assistant professor at Generative Intelligence Lab @SCSatCMU @CarnegieMellon. Understanding and creating pixels (https://t.co/yvop9D3ftM).

Jun-Yan Zhu @junyanz89

9K Followers 582 Following Assistant professor at Generative Intelligence Lab @SCSatCMU @CarnegieMellon. Understanding and creating pixels (https://t.co/yvop9D3ftM).

Emily Huynh @_ehuynh

19 Followers 62 Following PhD student @pennbioeng | prev: engineer @czbiohub, @thermofisher, @ucberkeley '20 |👩🏻‍💻👩🏻‍🔬 (she/her)

ML/AI Ph.D. student at Virginia Tech advised by Prof. @ruoxijia, passionate about #DataCentricAI and #TrustworthyML . All contacts are welcome :)

Feiyang Kang @feiyang_ml

51 Followers 33 Following ML/AI Ph.D. student at Virginia Tech advised by Prof. @ruoxijia, passionate about #DataCentricAI and #TrustworthyML . All contacts are welcome :)

Building embedding/vectorization models, customized for your domain and company, for better retrieval quality

https://t.co/MEAhTpBQqd

Voyage AI @Voyage_AI_

2K Followers 164 Following Building embedding/vectorization models, customized for your domain and company, for better retrieval quality https://t.co/MEAhTpBQqd

Francis Lewis @_francis_lewis

73 Followers 227 Following m.a.d. @anduriltech | prev robotics @stanfordsvl

Sadhika Malladi @SadhikaMalladi

798 Followers 104 Following CS PhD student at Princeton

Weijia Shi @WeijiaShi2

5K Followers 967 Following PhD student @uwcse @uwnlp | Visiting Researcher @MetaAI | Undergrad @CS_UCLA | https://t.co/eLBQmgkvym

Ruiqi Zhong @ZhongRuiqi

2K Followers 698 Following 5th Year Ph.D. @BerkeleyNLP, Columbia'19. part time working for @AnthropicAI . Supervising machines to do what I can't do.

Lucio Dery Jnr Mwinm @derylucio

461 Followers 956 Following

Mengzhou Xia @xiamengzhou

3K Followers 618 Following PhD student @princeton_nlp, MS @CarnegieMellon, Undergrad at Fudan.

Faculty jobs in Computer Science worldwide. Mostly automated. Mention/DM openings & we'll retweet. Created by @emilianoucl, now run by @shaddih

CS Faculty Jobs @csfacultyjobs

5K Followers 2 Following Faculty jobs in Computer Science worldwide. Mostly automated. Mention/DM openings & we'll retweet. Created by @emilianoucl, now run by @shaddih

Ethan Chi @ethanachi

297 Followers 147 Following NLP research at @wehrtyou. Previously at @stanfordnlp. Pianist/organist.

Irena Gao @irena_gao

380 Followers 219 Following PhD student @StanfordAILab

Audio AGI Research Lead @OpenAI - GPT-Next - Past: XLM, Unsupervised ASR, Unsupervised MT, Wav2vec 2.0/XLSR, MUSE, Unsupervised cross-lingual transfer

Alexis Conneau @alex_conneau

24K Followers 111 Following Audio AGI Research Lead @OpenAI - GPT-Next - Past: XLM, Unsupervised ASR, Unsupervised MT, Wav2vec 2.0/XLSR, MUSE, Unsupervised cross-lingual transfer

Incoming PhD Candidate @StanfordAILab @StanfordNLP @Stanford. Author of HELM + extensions (https://t.co/f9UOXPWkpR). Prev: Research Eng at @StanfordCRFM.

Tony Lee @tonyh_lee

402 Followers 86 Following Incoming PhD Candidate @StanfordAILab @StanfordNLP @Stanford. Author of HELM + extensions (https://t.co/f9UOXPWkpR). Prev: Research Eng at @StanfordCRFM.

Mark Chen @markchen90

10K Followers 245 Following Head of Frontiers Research at OpenAI. Coach for the USA IOI Team.

Stephen McAleer @McaleerStephen

3K Followers 807 Following Postdoc at CMU researching LLM agents and AI alignment

Jonathan Ho @hojonathanho

4K Followers 151 Following

Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.

Jascha Sohl-Dickstein @jaschasd

19K Followers 623 Following Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.

Hyung Won Chung @hwchung27

18K Followers 229 Following Research Scientist @OpenAI. Past: @Google Brain / PhD @MIT

Shayne Longpre @ShayneRedford

4K Followers 998 Following PhD @MIT. Prev: @Google Brain, @apple ML, @stanfordnlp. 🇨🇦 Interests: AI/ML/NLP, Data-centric AI, transparency & societal impact

Saurabh Garg @saurabh_garg67

862 Followers 579 Following Building next-gen AI at @MistralAI | prev/ PhD @mldcmu; CS @iitbombay (undergrad); Collab @GoogleAI @awscloud @apple

Hieu Pham @hyhieu226

3 days ago

One year ago, I left Google Brain (now DeepMind) to join a very early startup. We had fewer than 10 people at that time, and have grown many times since. Today, I am extremely proud to share our milestone. We are Augment. You can read about us here. techcrunch.com/2024/04/24/eri…

24 40 709 439K 349

Fahim Tajwar @FahimTajwar10

5 days ago

Happy to share our work on preference learning methods for LLMs. Key insights: 1. Use more on-policy samples > off-policy samples 2. Contrastive DPO > Pref-FT. Also we provide insights on DPO's training mechanism. 3. Theoretical unification under mode-covering/seeking KL

Aviral Kumar @aviral_kumar2

5 days ago

Many LLM fine-tuning methods. Unclear what you should use & why? In our new paper, we did an extensive study of on-policy RL, supervised & offline contrastive methods (DPO, IPO) to answer this... 🧵⬇️ On-policy > offline, mode-seeking > mode-covering understanding-rlhf.github.io

3 65 270 31K 246

Download Image

2 9 45 5K 16

Tri Dao @tri_dao

6 days ago

It's a great week for open source AI! Data is among the highest impact work to push the field forward. Bravo to 🤗

Thomas Wolf @Thom_Wolf

6 days ago

Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…

24 300 2K 289K 965

1 6 127 30K 15

Guilherme Penedo @gui_penedo

6 days ago

We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!

38 327 1K 526K 723

Download Image

Mike Lewis @ml_perception

a week ago

Yes, both the 8B and 70B are trained way more than is Chinchilla optimal - but we can eat the training cost to save you inference cost! One of the most interesting things to me was how quickly the 8B was improving even at 15T tokens.

Felix @felix_red_panda

a week ago

Llama3 8B is trained on almost 100 times the Chinchilla optimal number of tokens

8 7 187 68K 30

Download Image

14 38 503 87K 78

Tengyu Ma @tengyuma

2 weeks ago

🆕📢 @Voyage_AI_'s new embedding model for legal and long-context retrieval and RAG: voyage-law-2! 1.🥇 # 1 on MTEB legal retrieval benchmark with a large margin 2.📜 Best quality for long-context (16K) 3.✨ Improved quality across domains 4.🛒 On AWS Marketplace #RAG #LLMs

3 25 84 20K 37

Download Image

Tony Z. Zhao @tonyzzhao

2 weeks ago

Introducing 𝐀𝐋𝐎𝐇𝐀 𝐔𝐧𝐥𝐞𝐚𝐬𝐡𝐞𝐝 🌋 - Pushing the boundaries of dexterity with low-cost robots and AI. @GoogleDeepMind Finally got to share some videos after a few months. Robots are fully autonomous filmed in one continuous shot. Enjoy!

57 324 1K 296K 400

Download Video

Helen Qu @_helenqu

a week ago

I’m a Dr now!! so grateful to my advisor and all my collaborators, friends, and family for supporting me every step of the way 🥰

51 11 542 26K 15

Download Image

Andrew Gordon Wilson @andrewgwils

2 weeks ago

I went to the Lincoln center yesterday for Rachmaninoff's No 2 concerto. It felt so refreshing to completely unplug and give this extraordinary piece of music my undivided attention. My favourite recording of this work is the 1963 Ashkenazy performance: youtube.com/watch?v=xyPDWa….

4 1 40 6K 9

Andrea Montanari @Andrea__M

3 weeks ago

I am partial to the original version, but then again, if this is what it takes…

Ben Golub 🇺🇦 🕸️ @ben_golub

3 weeks ago

I took a famous paper and asked Claude to rewrite its introduction in the style of Malcolm Gladwell, while preserving the mathematical content

21 55 548 105K 290

Download Image

1 3 75 18K 24

Sadhika Malladi @SadhikaMalladi

3 weeks ago

Dataset choice is crucial in today's ML training pipeline. We (@xiamengzhou and I) introduce desiderata for "good" data and explain how our recent algorithm, LESS, fits into the picture. Huge review of data selection algs for pre-training and fine-tuning! cs.princeton.edu/~smalladi/blog…

2 54 203 35K 119

Download Image

Tony Lee @tonyh_lee

4 weeks ago

Excited to share that I will be joining @StanfordAILab @Stanfordnlp @Stanford as a PhD student, working on foundation models 📈, robotics 🤖, and, of course, evaluation 🧐! Thankful for my health and the opportunity to do research in this field 🙏

12 4 280 39K 32

Jim Fan @DrJimFan

a month ago

Today is the beginning of our moonshot to solve embodied AGI in the physical world. I’m so excited to announce Project GR00T, our new initiative to create a general-purpose foundation model for humanoid robot learning. The GR00T model will enable a robot to understand multimodal…

197 1K 5K 838K 2K

Download Video

David Hall @dlwh

a month ago

Built with Levanter!

Vaibhav (VB) Srivastav @reach_vb

a month ago

Anticipatory Music Transformer by @StanfordCRFM 🎶 > A foundation model for symbolic music. > Supports generating accompaniments (enrich music) and infill (fill in musical details). > 780 Million parameters, trained for 800 Thousand steps. > Trained on Lakh, MetaMIDI and…

2 30 137 24K 101

Download Video

0 4 19 8K 1

Zachary Lipton @zacharylipton

a month ago

When we looked for a final strategic partner to fulfill our dreams for series C, only one name came to mind: @nvidia. We are so thrilled to welcome our newest investors, collaboration partners, and long-time research buddies to the @AbridgeHQ family: abridge.com/press-release/…

10 11 145 17K 10

Boaz Barak @boazbaraktcs

a month ago

A commendable paper that make a versatile use of innovative and meticulous analysis to reach its notable conclusion.

Science of Science @MishaTeplitskiy

a month ago

Lots of people in CS are (almost surely) GPT-ing their peer reviews arxiv.org/abs/2403.07183

53 1K 6K 1.6M 1K

Download Image

3 4 117 13K 12

Anastasios Nikolas Angelopoulos @ml_angelopoulos

a month ago

U give me: a bunch of unlabeled data. I give u: AI-generated labels. Result: a massive, but biased, val set. We use PPI to correct the bias, giving unbiased evaluations with better precision 🚀 arxiv.org/abs/2403.07008 Experiments on GPT-4 and ResNets, using @lmsysorg :)

Pierre Boyeau @pierreboyeau

a month ago

What if we could use AI to evaluate AI? 🧐 This would save thousands of human-hours---e.g., on platforms like @lmsysorg. But it introduces bias! Enter AutoEval Done Right: producing unbiased evaluations of models with synthetic data! arxiv.org/abs/2403.07008

7 15 78 41K 54

8 30 175 44K 94

Download Image

Christopher Manning @chrmanning

a month ago

I do not believe human-level AI (artificial superintelligence, or the commonest sense of #AGI) is close at hand. AI has made breakthroughs, but the claim of AGI by 2030 is as laughable as claims of AGI by 1980 are in retrospect. Look how similar the rhetoric was in @LIFE in 1970!

117 393 2K 378K 564

Download Image

Suraj Nair @SurajNair_1

2 months ago

Thrilled to be starting a new adventure at Physical Intelligence with some amazing colleagues and friends! Learn more: physicalintelligence.company

Karol Hausman @hausman_k

2 months ago

🚨 Big news 🚨 Together with a set of amazing folks we decided to start a company that tackles one of the hardest and most impactful problems - Physical Intelligence In fact, we even named our company after that: physicalintelligence.company or Pi (π) for short 🧵

45 43 539 119K 122

7 4 121 20K 3

Sergey Levine @svlevine

2 months ago

Since cat is out of the bag, it’s time I share: I’ll be starting a new adventure with an incredible team of friends and long-time collaborators to take on the big challenge of robot learning at scale! It's called Physical Intelligence (Pi… or π, like the symbol). 🧵👇