Cody Blakeney @code_star
Data Dawg @datologyai | Formerly Data Research Lead @DbrxMosaicAI | Visiting Researcher @ Facebook | Ph.D | #TXSTFOOTBALL fan | https://t.co/4G6Jf3at5w Redwood City, CA Joined August 2011-
Tweets16K
-
Followers5K
-
Following1K
-
Likes38K
It's happening, guys! > @arcee_ai AFM-4.5B on AMD Instinct MI300X VF with bs 64 (* 4 grad accu * 1024 seq len). > Full fine-tuning on medical data from @OpenMed_AI thanks to @HotAisle! Thank you, @Shekswess, for saving me hours on setup 🙏
Many of us are doing lots of interesting other things now. Training stacks and infra have come a long way since the 2023 MosaicML days. Now you can swipe a credit card and get lots of nodes and find efficient configs to run on them. The last bit to crack was data. I’m hopeful…
Many of us are doing lots of interesting other things now. Training stacks and infra have come a long way since the 2023 MosaicML days. Now you can swipe a credit card and get lots of nodes and find efficient configs to run on them. The last bit to crack was data. I’m hopeful…
+1 this, in virtually every multi-modal codebase I worked on it was always a post-hoc addition to the text training stack. Developing our infra and codebases from scratch allowed us to really nail down the proper data design.
+1 this, in virtually every multi-modal codebase I worked on it was always a post-hoc addition to the text training stack. Developing our infra and codebases from scratch allowed us to really nail down the proper data design.
We're trying something a bit new. Making sense of the massive expenditure and noisy headlines can be challenging for those who aren't closely following the industry's daily pulse. There's room for clearer and more concise analysis. Lmk what you think.
We're trying something a bit new. Making sense of the massive expenditure and noisy headlines can be challenging for those who aren't closely following the industry's daily pulse. There's room for clearer and more concise analysis. Lmk what you think.
in retrospect, the 2023 mini-wave of pretraining as a service was directionally correct, just three years too early.
Big token hated him
How many tokens would it take to express everything you have ever learned in your life?
Research teams have complexity budgets. Simple foundations allow teams to spend on more novelty and do it more efficiently. VLM training recipes often have complex specs (many stages, permodule LRs, MLP warmup). We wanted to find the minimum recipe that maximizes performance.
Research teams have complexity budgets. Simple foundations allow teams to spend on more novelty and do it more efficiently. VLM training recipes often have complex specs (many stages, permodule LRs, MLP warmup). We wanted to find the minimum recipe that maximizes performance.
Language models forced to train on random slop from the internet
Language models forced to train on random slop from the internet
No codex, Claude code, or cursor? What?
everybody worried about FOOM when they should be worried about DOOM
It makes sense why Anthropic spends so much time working on safety. Their model is a supervillain.
.@stochasticchasm is indeed hibernating as we have a big gpu reservation coming online at 4:30am tomorrow morning. More to come :)
.@stochasticchasm is indeed hibernating as we have a big gpu reservation coming online at 4:30am tomorrow morning. More to come :)

Sebastian Raschka @rasbt
358K Followers 1K Following ML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).
AK @_akhaliq
428K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5YmrQ
Jonathan Frankle @jefrankle
20K Followers 733 Following Chief AI Scientist @databricks via MosaicML.
Delip Rao e/σ @deliprao
61K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
Databricks Mosaic Res... @DbrxMosaicAI
41K Followers 120 Following We remove the barriers to state-of-the-art generative AI model development and make data + AI available to all.
Aran Komatsuzaki @arankomatsuzaki
145K Followers 305 Following Looking for a cofounder. Sharing AI research. Early work on AI (GPT-J, LAION, scaling, MoE). Ex ML PhD (GT) & Google.
Naveen Rao @NaveenGRao
33K Followers 880 Following VP GenAI @Databricks. Former CEO/cofounder MosaicML & Nervana/IntelAI. Neuro + CS. I like to build stuff that will eventually learn how to build other stuff.
Rosanne Liu @savvyRL
46K Followers 1K Following (On mat leave.) Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS.
Jeremy Howard @jeremyphoward
261K Followers 6K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Prev: professor @ UQ; Stanford fellow; @kaggle president; @fastmail/@enlitic/etc founder https://t.co/16UBFTX7mo
elvis @omarsar0
266K Followers 680 Following Building with AI agents @dair_ai • Prev: Meta AI, Galactica LLM, Elastic, PaperswithCode, PhD • I share insights on how to build with AI Agents ↓
Sara Hooker @sarahookr
50K Followers 9K Following I lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
Gautam Kamath @thegautamkamath
57K Followers 568 Following Assistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Joining @NYU_Courant September 2026. Co-EiC @TmlrOrg. I lead @TheSalonML.
Abhi Venigalla @ml_hardware
7K Followers 1K Following Researcher @Databricks. Former @MosaicML, @CerebrasSystems. Addicted to all things compute.
Davis Blalock @davisblalock
15K Followers 168 Following Research scientist @GoogleDeepMind. Past: @Databricks, first hire @MosaicML, @MIT PhD. I post about AI technical progress + sometimes the business side.
Matthew Leavitt @leavittron
3K Followers 970 Following Chief Science Officer, Co-Founder @datologyai. Former: Head of Data Research @MosaicML; FAIR. 🧠 and 🤖 intelligence // views are from nowhere
Hamel Husain @HamelHusain
39K Followers 2K Following Evals evals evals https://t.co/Zrmp6LRd9c About Me: https://t.co/P6WyeKkyTa
merve @mervenoyann
80K Followers 5K Following open-sourceress at @huggingface 🧙🏻♀️proud Aegean, I work on computer vision, VLMs & agents | gençleri serbest bırakın
Chairman Birb Bernank... @Bonecondor
36K Followers 6K Following technoyapitalist ms frizzle @secretsoupco
Ax Tan @ax_tan
8 Followers 75 Following
K Aayush Mazumdar @Tweeting_Aayush
800 Followers 6K Following Founder Founding Foundations. I tweet to learn better. Platforms are interesting I guess.
Sourabh Medapati @activelifetribe
201 Followers 2K Following MLE @ Netflix Research, previously RE @ Google Deepmind, AlgoPerf contributor
nixpiper @nixpiper
63 Followers 2K Following
Omair Shahid @OmairShahid
824 Followers 7K Following Product of progressive public policy; raised by public libraries and public education that produced a passion for politics. and apparently alliteration
Sai Vignan @vignan_sai
111 Followers 2K Following ML Engineering @Microsoft, prev ML @sprinklr, CS @iitdelhi, Interested in ML, Bio Informatics
Jinghua Zhong @zhongjinghua
0 Followers 4K Following
RockyParadox @RockyParadox44
126 Followers 7K Following
Luca Baggi @baggiponte
533 Followers 2K Following 📈 AI Engineer @ https://t.co/Du2lQ9AFgU 🗞 Ho scritto spiegoni @ilpost 🎓 MSc Econ & Stats @LaStatale 🎓 BA Filosofia @UniBergamo & @SorbonneParis1
Alvin Vinod @alvinvinodc
183 Followers 6K Following Masters in Machine Learning @UniofNottingham Former Data Scientist @KPMG
Jasper @zjasper666
15K Followers 2K Following Co-founder and CEO @Hyperbolic_Labs. ex-@avax & ex-@citsecurities. Finished Math PhD in 2yrs @UCBerkeley. Math Olympiad Gold Medalist. Highest honor @PKU1898
Earl Dennsion Tan @EarlDennisonTan
30 Followers 397 Following AI Engineer tinkering with LLMs to make 'em actually useful Skeptical of benchmarks, pragmatic builds.
Alex Danilowicz @alexdanilowicz
6K Followers 729 Following co-founder @magicpatterns — where the best teams build products. Try our AI design tool for free. Building with @teddarific
Daniel San @dani_avila7
15K Followers 2K Following co-founder and CTO, building @aitmpl_com + @codegptAI + @deepgraphMCP | Powered by TypeScript & Pumpkin Spice Lattes ☕️
harmon @_harm0n
139 Followers 647 Following exploring mlsys/compiler optimization | research @securebio/@mit | prev @WisconsinCS
Erica Ward @ward_erica84113
62 Followers 2K Following
RobertaMarlowe @Gt34YN5e5zQAR8O
13 Followers 582 Following
RorroArt @rorroart_code
10 Followers 210 Following
Lina Defi 🦋 @LinaDefi__
6K Followers 8K Following Web3 Girl👠👠| NFT/Alpha Threads | Tweet is NFA & DYOR| Early Access → DM ✉️
Brooler @Brooler3029
80 Followers 3K Following
Manu Gaur @gaur_manu
534 Followers 890 Following used to do physics, now multiplying matrices @CMU_Robotics | prev @IIIT_Hyderabad
shwetu (luca) @_shwetu
349 Followers 4K Following organic general intelligence | jack of all trades, master's from @NYUDataScience prev: Research @NYTimesRD @precog_iiitd; Manipal grad | he/him
ConstantWriter @loonacy58
796 Followers 4K Following Canadian. Proud leftist. 🇨🇦♏️🦂❄️ Text-based lifeform. She/her. Elbows Up.
Jamie Bloxham @__jamie_b
60 Followers 48 Following Co-founder @ Sphinx. Previously early SWE at Scale AI + MosaicML
Supreet Sahu @supreet_sahu
19 Followers 847 Following IIT Kharagpur @IITKgp '26 | 4th Year Undergrad @ ECE( Dual degree spl- Vision & Intelligent Systems) | AI/ML/DL/Computer Vision | Also on X : @SupreetSahu
Alsiarjar @Alsiarjar44623
114 Followers 2K Following
Gretel, Vega. @Vwarsuis939
26 Followers 872 Following
finest.eth @kyoungrok_jang
749 Followers 6K Following
Michael @scharf_michael
389 Followers 813 Following Account focused on exploring web3. EP, TV show creator, writer, character designer. https://t.co/WNuaxgGEfJ
GDP @bookwormengr
8K Followers 9K Following AI @amazon. AI infrastructure, Open Source, RL, Agents, China. Strictly my views.
Yann LeCun @ylecun
954K Followers 765 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
François Chollet @fchollet
575K Followers 816 Following Co-founder @ndea. Co-founder @arcprize. Creator of Keras and ARC-AGI. Author of 'Deep Learning with Python'.
Sebastian Raschka @rasbt
358K Followers 1K Following ML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).
AK @_akhaliq
428K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5YmrQ
Jonathan Frankle @jefrankle
20K Followers 733 Following Chief AI Scientist @databricks via MosaicML.
Delip Rao e/σ @deliprao
61K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
Emad @EMostaque
291K Followers 24 Following Distributing Intelligence @ii_posts. Founder @StabilityAI.
Databricks Mosaic Res... @DbrxMosaicAI
41K Followers 120 Following We remove the barriers to state-of-the-art generative AI model development and make data + AI available to all.
Aran Komatsuzaki @arankomatsuzaki
145K Followers 305 Following Looking for a cofounder. Sharing AI research. Early work on AI (GPT-J, LAION, scaling, MoE). Ex ML PhD (GT) & Google.
Google DeepMind @GoogleDeepMind
1.2M Followers 279 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
Naveen Rao @NaveenGRao
33K Followers 880 Following VP GenAI @Databricks. Former CEO/cofounder MosaicML & Nervana/IntelAI. Neuro + CS. I like to build stuff that will eventually learn how to build other stuff.
Andrew Ng @AndrewYNg
1.3M Followers 1K Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs
Lucas Beyer (bl16) @giffmana
110K Followers 523 Following Researcher (now: Meta. ex: OpenAI, DeepMind, Brain, RWTH Aachen), Gamer, Hacker, Belgian. Anon feedback: https://t.co/xe2XUqkKit ✗DMs → email
Soumith Chintala @soumithchintala
252K Followers 1K Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.
Rosanne Liu @savvyRL
46K Followers 1K Following (On mat leave.) Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS.
Jeremy Howard @jeremyphoward
261K Followers 6K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Prev: professor @ UQ; Stanford fellow; @kaggle president; @fastmail/@enlitic/etc founder https://t.co/16UBFTX7mo
Gabriele Berton @gabriberton
7K Followers 1K Following Postdoc @Amazon working on VLM - ex @CarnegieMellon @PoliTOnews @IITalk
fal @fal
34K Followers 6 Following the generative media cloud. hiring https://t.co/JrbUk989MN. for support/discounts, e-mail us at [email protected].
Jamie Bloxham @__jamie_b
60 Followers 48 Following Co-founder @ Sphinx. Previously early SWE at Scale AI + MosaicML
Dom @dominik_scherm
2K Followers 2K Following Kardashev accelerator @PrimeIntellect ● Founder Aurea Berlin ● Europe AI ecosystem
Songlin Yang @SonglinYang4
14K Followers 3K Following research @MIT_CSAIL @thinkymachines. work on scalable and principled algorithms in #LLM and #MLSys. in open-sourcing I trust 🐳. she/her/hers
Mike Solana @micsolana
378K Followers 1K Following billionaire media tycoon and former mayor of san francisco. disinformation researcher. cmo @foundersfund. editor-in-chief @piratewires 🏴☠️
David Stutz @davidstutz92
4K Followers 1K Following Research scientist @DeepMind working on robust and safe AI, previously @maxplanckpress, views my own.
OpenPipe @OpenPipeAI
4K Followers 4 Following OpenPipe: Fine-tuning for production apps. Train higher quality, faster models. (YC S23)
Aleksa Gordić (水�... @gordic_aleksa
25K Followers 229 Following getting us to singularity with friends x @GoogleDeepMind @Microsoft tensor core maximalist
Christina Baek @_christinabaek
2K Followers 560 Following PhD student @mldcmu | intern @datologyai @GoogleAI | Robust ML
Chris 🇨🇦 @llm_wizard
1K Followers 489 Following Working on cool open-source AI stuff @ NVIDIA Views my own.
Jason Weston @jaseweston
13K Followers 723 Following @Meta+NYU. NLP from scratch(Pretrain+FT LLM) 2008,MemNet (pre-Transformer) 2015, DrQA(pre-RAG) 2017, BlenderBot(dialog pre-ChatGPT) 2018+, Self-Rewarding+more!
Taylor W. Killian @tw_killian
3K Followers 891 Following Senior Research Scientist @MBZUAI @a16z, interested in Decision Making & Generalization // @BYU '13; @Harvard '17; @UofT '24
Unsloth AI @UnslothAI
32K Followers 458 Following Open source LLM fine-tuning & RL! 🦥 https://t.co/2kXqhhvLsb
Anna @AnushkaDeshpan8
34 Followers 1K Following
Mira Joyce @mira__joyce
13K Followers 2K Following
Eric Hu @_EricHu
34K Followers 4K Following vp of design @cohere member @agigraphic previously @nike @ssense
Mark McQuade @mmcquade_ai_u
551 Followers 790 Following CEO and founder of @arcee_ai | @huggingface 🤗 alum. AI and Data Obsessed. Fitness Fanatic. Tattoo Enthusiast.
Shaurya Rohatgi @shauryr
1K Followers 2K Following Burning GPUs at Institute of Foundation Models @mbzuai, PhD @ISTatPENNSTATE Ex @allen_ai @SemanticScholar @UChicago @AbvIiitm
Chris Offner @chrisoffner3d
3K Followers 3K Following Student Researcher @rai_inst, CS MSc student @ETH_en. visual computing, 3D vision, spatial AI, machine learning, robot perception.
aashay sachdeva @AashaySachdeva
3K Followers 473 Following I tweet about ML,data, investing and startups | ML @SarvamAI | Ex- Invest @RebrightVC |Ex-Senior Data Scientist at @PlayMPL | Built https://t.co/hWenaRkujG
hud @hud_evals
1K Followers 6 Following RL environments + evals for agents | @ycombinator | we're hiring!
Jay @jayendra_ram
2K Followers 918 Following founder @hud_evals, prev cs+physics @columbia, @ycombinator
vLLM @vllm_project
18K Followers 20 Following A high-throughput and memory-efficient inference and serving engine for LLMs. Join https://t.co/lxJ0SfX5pJ to discuss together with the community!
Mango @MangoSweet78
384 Followers 481 Following Post-training @ https://t.co/jQT9G3hHUc, See what i've cooked on my HF @ https://t.co/QZABvVi2P0
mads campbell @martyrdison
38K Followers 4K Following funny girl in tech. founded @ledahealthco. wannabe florist 💐
Zephyr @zephyr_z9
31K Followers 491 Following Tech, AI, Semiconductors, Stocks, Finance. DMs are open
𝔊𝔴𝔢𝔯𝔫 @gwern
64K Followers 106 Following Internet besserwisser; pedantic, mean reply guy. 𝘞𝘢𝘵𝘢𝘴𝘩𝘪 𝘬𝘪𝘯𝘪𝘯𝘢𝘳𝘪𝘮𝘢𝘴𝘶! (Follow requests ignored due to terrible UI.)
Z.ai @Zai_org
17K Followers 153 Following The AI lab behind GLM models, dedicated to inspiring the development of AGI to benefit humanity. https://t.co/b6zGxJvzzS
Daniel Han @danielhanchen
28K Followers 2K Following Building @UnslothAI. Finetune train LLMs faster. LLMs bug hunter. OSS package https://t.co/aRyAAgKOR7. YC S24. Prev ML at NVIDIA. Hyperlearn used by NASA.
thebes @voooooogel
15K Followers 898 Following "peaceful, albeit ominous" ꙮ website → https://t.co/aykxqKippW ꙮ games → https://t.co/3Pz19vHOwd ꙮ 💞💍📝 @holotopian ꙮ she/they 🏳️⚧️
Zachary Nado @zacharynado
13K Followers 750 Following Research eng @GoogleDeepMind on Gemini pretrain. Personal acct. Past: swe intern @SpaceX, ugrad researcher in @tserre lab @BrownUniversity. All opinions my own.
Junhua Mao @junhuamao
919 Followers 84 Following Lead personality and model behavior research @OpenAI; Previously built the object understanding system and foundation models for self-driving @Waymo