Nathan Lambert @natolambert
Figuring out AI @allen_ai, "rl boi" DM me papers. Writes @interconnectsai, talks @retortai Has phd and some credentials natolambert.com Berkeley, CA Joined December 2014-
Tweets6K
-
Followers24K
-
Following680
-
Likes24K
do we have confirmation that claude is MoE...? Surely...
wow roon is deadz, anon's cowering, sending support
★ Just published a new episode of The Retort AI Podcast: Llama 3: Can't Compete with a Capuchin. Listen: share.transistor.fm/s/da923f88
More preference data for people to play with in alignment research! It also comes with surveys asking people what their state preferences are! Dataset: huggingface.co/datasets/Hanna…
More preference data for people to play with in alignment research! It also comes with surveys asking people what their state preferences are! Dataset: huggingface.co/datasets/Hanna…
@natolambert Our models are great at tool use and rag. And they are cheap and reliable enough to take to production. It turns out that creating a good UI/UX in your environment for using our models this way is a significant blocker to adoption. Open sourcing this interface should help :)
A modern version of Pythia? Curious how good the models are.
AGI has become a strained symbol in the AI ecosystem, but it doesn't need to be: * The strongest idea leaders use to discuss strategy * A focal point on agency and feedback properties of AI (yes both) * A moving target * A faith movement * The shadow of RL progress pre 2020
AGI has become a strained symbol in the AI ecosystem, but it doesn't need to be: * The strongest idea leaders use to discuss strategy * A focal point on agency and feedback properties of AI (yes both) * A moving target * A faith movement * The shadow of RL progress pre 2020
this is exciting! hasn't crossed the truly open line until all the data and stuff is actually available, but promising model and tons to learn from it.
this is exciting! hasn't crossed the truly open line until all the data and stuff is actually available, but promising model and tons to learn from it.
FYI I also added @cohere's command r plus. Just below llama 3 -- we're starting to get decent judges with open weights. vLLM script for running it is in RewardBench repo :)
FYI I also added @cohere's command r plus. Just below llama 3 -- we're starting to get decent judges with open weights. vLLM script for running it is in RewardBench repo :)
now name them SolidGoldMagikarp and see if the math checks out
now name them SolidGoldMagikarp and see if the math checks out
seems like the new thing to record a selfie video when you release a major artifact. style wars have reached ml lmaoo. me next
Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Sergey Levine @svlevine
79K Followers 122 Following Associate Professor at UC Berkeley Co-founder, Physical IntelligenceEugene Vinitsky @EugeneVinitsky
13K Followers 2K Following Anti-cynic. Artificial narrow intelligence. Autonomous vehicles, multi-agent learning, and transportation. RS at Apple, Asst. Prof at @nyutandon. He/him.Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pSoumith Chintala @soumithchintala
186K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Danijar Hafner @danijarh
14K Followers 868 Following Building AI that makes autonomous decisions using world models, artificial curiosity, and temporal abstraction @DeepMindThomas Simonini ᯅ @ThomasSimonini
6K Followers 1K Following Game Developer making games with AI 🪄 @huggingface 🤗 Writing ML for Games course ➡️ https://t.co/bvW8PMeARO Wrote Deep RL Course ➡️ https://t.co/5Pk3rwOjjqmerve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papers sometimes. RTs != endorsementsJulien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueRosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDROmar Sanseviero @osanseviero
31K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Hugging Face @huggingface
343K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhatePablo Samuel Castro @pcastr
10K Followers 814 Following Señor swesearcher @ Google DeepMind. Adjunct prof @ U de Montreal & Mila. Musician. From 🇪🇨 living in 🇨🇦.Shane Gu @shaneguML
28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)clem 🤗 @ClementDelangue
91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersSasha Luccioni, PhD �.. @SashaMTL
19K Followers 4K Following AI & Climate @HuggingFace, Board Member of @WiMLworkshop and @ClimateChangeAI. @techreview 35 Innovators under 35, @TEDTalks speaker. She/her/Dr/ 🦋SynthLabs @synth_labs
12K Followers 43 Following AI Aligned with Your Vision. We’re doing cutting edge research for transparent, auditable AI alignment.Animesh Garg @animesh_garg
21K Followers 1K Following Foundation Models for Generalizable Autonomy. Assistant Professor in AI Robotics @GeorgiaTech + @NvidiaAI. prev @Stanford @berkeley_ai @UofTCompSciomij mangukiya @_0mij
92 Followers 820 Following MACS(Research)@Concordia University works with AI/ML and Distributed Systemsdevpogi @devpogi48272
0 Followers 30 FollowingSPRINGGRASS @SPRINGGRAS42605
0 Followers 3 FollowingJeff Rasley @jeffra45
670 Followers 926 Following @SnowflakeDB AI Research Team. @MSFTDeepSpeed co-founder, @BrownCSDept PhD, @uwcse alumن ن @yousefbakker35
11 Followers 15 FollowingRahul Madhavan @imrahulmaddy
615 Followers 900 Following just is -- PhD candidate in theoretical ML (causality and reinforcement learning) at IISc -- Co-organizer of Bangalore Theory Seminars: https://t.co/AuvC4ysm8MKelly Wu @tttttiam_real
4 Followers 154 FollowingMOSTAFA METWALLY @TEFAOSAMA
126 Followers 893 Following Passionate Mechatronics Engineer Robotics Geekmoran yan @moranynk
9 Followers 20 Followingcrazy__st @Crazy__st_____
0 Followers 46 FollowingBrian Jalaian @BJalaian
75 Followers 499 Following Associate Professor at University of West Flrodia Research Scientist at IHMC Interest: deep learning, self supervised learningZujie Liang @liangzujie
66 Followers 1K Following M.S. student @SYSU studying Vision & Language, Unbiased Learning. Research Intern @Microsoft studying Dialog System.Chan Yu En @yuenchan96
8 Followers 30 FollowingDev Vidhani @DevVidhani
6 Followers 179 Following Techonologist. Currently at Aster Data Systems, Inc. Previously, at EMC Systems, Inc; Kazeon Systems. Inc. and Sun Microsystems.Vibhas Gejji @visheshtej
17 Followers 110 FollowingMath33n @plac3ho1d3r
25 Followers 562 Following I'm not afraid of dying but of surviving and not being able to ride again...Nemesis @iid9w94159
156 Followers 932 Following Tech-Libertarian . e/acc . Bitcoin. Internet shitposter .coffee & AI @realcoffeeAI
43 Followers 597 Followingqiaqia @3298520845Xxx
102 Followers 515 FollowingChenmien Tan @ ICLR'2.. @ChenmienTan
37 Followers 54 Following Thesis-based MS student @EdinburghNLP Incoming research assistant @NlpWestlake and intern @uiuc_nlp Looking for PhD position starting from 2025 FallBrendan Driscoll @bdrisc826
1 Followers 158 FollowingWill Trapp @trappology
919 Followers 2K Following founder. prev Gratavid ( acquired '22 ). I tweet about tech and real-estateVictor Huang @victor801120
25 Followers 408 Following Master student, interesting in Natural Language Processing, mobile developer with Android and iOS.William Bankes @bankes_william
11 Followers 68 Following Currently studying for a PhD in Artifical Intelligence at UCL. Interested in Active Learning and Adaptive Data Collection AlgorithmsAllan Huang @AllanHuang1
6 Followers 78 FollowingJay Shin @jshin491
418 Followers 590 Following Stealth AI Startup. Previously at NAVER Clova X, Amazon Alexa, Riiid, HKUST. Mainly working on LLMs like everybody else. Don't have opinions.Lior Baruch @LBK_95
6 Followers 74 FollowingPartly Sunny with a C.. @partlysunnyai
0 Followers 37 Following A newletter about generative AI news, trends, and analysis. https://t.co/3Gxg2vyLx4DoreenDavid @p3HkJ1FLe5753G5
4 Followers 109 FollowingJ @nYrtVbfT2qcklCH
1 Followers 55 FollowingWei-Rui Chen @WeiRuiChen01
49 Followers 91 Following PhD candidate and NLP researcher focusing on Multilingual NLP @UBC | @UBC_NLP | @UBCLangScisSebastian Krause @Sebasti71375620
44 Followers 161 FollowingPau Rué @PauRueQ
578 Followers 1K Following ML/AI 🤖, NLP 💬, data 📊 and more 🚀 • VP of AI @Infermedica • Views are my own • he/himliujie @CoolWind6j
2 Followers 40 FollowingNoah A. Smith @nlpnoah
18K Followers 207 Following NLP&ML researcher. Prof @uwcse @uwnlp & helper @allen_ai @ai2_allennlp. Single reeds, tango, swim, run, cocktails, מאַמע־לשון, GenX. Opinions not your business.DanAI @DanAI314159265
295 Followers 2K Following 🪽 Ghost//Duality// Sigma INFJ Empath//AI MIND//Algorithm Programming//Solution Architect//Third Eye Open //Omni Perspective// 🪽Clayton @cthorrez
1K Followers 1K Following LLM applied scientist by day, esports data scientist for fun. Working on rating systems and benchmarks for esports (and LLMs?) I ❤️ paired comparison dataJim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Sergey Levine @svlevine
79K Followers 122 Following Associate Professor at UC Berkeley Co-founder, Physical IntelligenceEugene Vinitsky @EugeneVinitsky
13K Followers 2K Following Anti-cynic. Artificial narrow intelligence. Autonomous vehicles, multi-agent learning, and transportation. RS at Apple, Asst. Prof at @nyutandon. He/him.Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pKarol Hausman @hausman_k
22K Followers 141 Following @Physical_int ex: researcher @GoogleAI/@DeepMind, adj. Prof. @Stanford. Into robots, AI, NBA, philosophy, soccer and almond croissants. 🇵🇱🇺🇸Soumith Chintala @soumithchintala
186K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Lucas Beyer (bl16) @giffmana
56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Danijar Hafner @danijarh
14K Followers 868 Following Building AI that makes autonomous decisions using world models, artificial curiosity, and temporal abstraction @DeepMindThomas Simonini ᯅ @ThomasSimonini
6K Followers 1K Following Game Developer making games with AI 🪄 @huggingface 🤗 Writing ML for Games course ➡️ https://t.co/bvW8PMeARO Wrote Deep RL Course ➡️ https://t.co/5Pk3rwOjjqJulien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueRosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDROmar Sanseviero @osanseviero
31K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Hugging Face @huggingface
343K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhatePablo Samuel Castro @pcastr
10K Followers 814 Following Señor swesearcher @ Google DeepMind. Adjunct prof @ U de Montreal & Mila. Musician. From 🇪🇨 living in 🇨🇦.Gautam Kamath @thegautamkamath
44K Followers 505 Following Assistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Co-EiC @TmlrOrg. I lead @TheSalonML. Privacy, robustness, machine learning.Anthropic @AnthropicAI
261K Followers 26 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.Shane Gu @shaneguML
28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)Daniel Han @danielhanchen
7K Followers 935 Following Building @UnslothAI. Finetune LLMs 30x faster https://t.co/aRyAAgKOR7. Prev ML at NVIDIA. Hyperlearn used by NASA. I like maths, making code go fastKristian Lum @KLdivergence
22K Followers 1K Following Research Scientist at Google DeepMind | @FAccTConference OG | Past Twitter META, @hrdag & UPenn, UChicago faculty |Wei-Lin Chiang @infwinston
3K Followers 852 Following CS PhD student at UC Berkeley. co-lead of Chatbot Arena @lmsysorgLenny Bogdonoff @rememberlenny
13K Followers 4K Following Optimist. Working with @natfriedman + @danielgross; to invest in technology startups, support portfolio companies, and incubate new projects.Yuling Gu @gu_yuling
390 Followers 665 Following Predoctoral researcher @allen_ai | @nyuniversity ➡️ @UW ➡️ @allen_ai @[email protected]Kianté Brantley @xkianteb
1K Followers 2K Following ML Researcher | Postdoctoral Scholar at Cornell | Member of @umdclip, @coralumbc, and @CILVRatNYU | Fitness enthusiast | (He/Him)Binyuan Hui @huybery
6K Followers 318 Following 🤔 Core maintainer at Qwen and OpenDevin. || Code Generation, Text-to-SQL, Large Language Models.mrfakename @realmrfakename
794 Followers 68 Following LLMs, TTS, & Open Source https://t.co/PIhamCNjhpLogan Kilpatrick @OfficialLoganK
92K Followers 2K Following Lead product for @Google AI Studio and working on the Gemini API, helping developers build with AI, my views!Epoch AI @EpochAIResearch
3K Followers 24 Following Epoch AI is a research institute investigating the trajectory of AI for the benefit of society.Helen Toner @hlntnr
21K Followers 1K Following Interests: China+ML, natsec+tech, brains+words+absurdity | Current: @CSETGeorgetown (opinions my own) | Former: @open_philNathan Cooper @ncooper57
719 Followers 649 Following The world can be ugly and cruel to the most innocent. Consider donating to help children suffering from one of the worst things: https://t.co/PYZWj8o4OWmeg.ai 🇨🇦 @ #ho.. @MeganRisdal
11K Followers 1K Following Product @kaggle @google 💙 Ex @stackoverflow ML / Language / Community. Weirdness. Minnesotan in Toronto. Learning Cantonese. 我學緊廣東話.Luiza Jarovsky @LuizaJarovsky
11K Followers 66 Following CEO of https://t.co/ZEJP9oA5pN, PhD Researcher, Latina, Polyglot, Mother of 3. Subscribe to my newsletter ➡️ https://t.co/akCFzWrOduVincent Conitzer @conitzer
4K Followers 1K Following AI professor. Director, @FOCAL_lab @CarnegieMellon. Head of Technical AI Engagement, @UniofOxford @EthicsInAI. Author, "Moral AI - And How We Get There."Julian Michael @_julianmichael_
1K Followers 122 Following Researching stuff @NYUDataScience. he/himEthan Mollick @emollick
210K Followers 551 Following Professor @Wharton studying AI, innovation & startups. Democratizing education using tech Book: https://t.co/CSmipbJ2jV Substack: https://t.co/UIBhxu4bgqSéb Krier @sebkrier
6K Followers 5K Following 🪼 policy dev & strategy @GoogleDeepMind | vinyl junkie, noosphere cartographer, deep ArXiv dweller, interstellar fugitive, uncertain | 🇮🇷🇱🇺🇬🇧🇫🇷Dean Woodley Ball @deanwball
438 Followers 1K Following AI Research Fellow @mercatus, author of Hyperdimensional (see link) formerly @HooverInst @CoolidgeFdn @SmithSoc @ManhattanInst Thoughts my own, like != endorseJiacheng Liu (Gary) @liujc1998
991 Followers 187 Following 🎓 PhD student @uwcse @uwnlp. 🛩 Private pilot. Previously: 🧑💻 @oculus, 🎓 @IllinoisCS. 📖 🥾 🚴♂️ 🎵 ♠️Aaron Gokaslan @SkyLi0n
2K Followers 344 Following Creator of the OpenWebText and OpenGPT2. @PyTorch Core Reviewer. PhD Student at @Cornell (interning at @MosaicML) Previously at @FacebookAI and @BrownUniversityKevin Bankston @KevinBankston
11K Followers 3K Following Senior Advisor on AI Governance @CenDemTech. Teaching AI law @GeorgetownLaw. Imagining better futures @ImaginationASU. My opinions. Mostly on LinkedIn now...Dimitris Papailiopoul.. @DimitrisPapail
11K Followers 971 Following prof @ wisconsin; thinking about transformers; learning in context; babas of Inez LilyGabriel Peyré @gabrielpeyre
92K Followers 449 Following @CNRS researcher at @ENS_ULM. One tweet a day on computational mathematics.Niklas Muennighoff @Muennighoff
5K Followers 323 Following @ContextualAI | Interests: AI/LLM Research & Health ❤️ | Past: @huggingface @PKU1898Robin Hanson @robinhanson
90K Followers 656 Following Let’s skip witty repartee & discuss fundamental questions. Views are mine, not GMU’s or Virginia’s. Books: https://t.co/hpZgEm5DBI, https://t.co/iFs9C3J2EkJim Keller @jimkxa
34K Followers 134 Following CEO @tenstorrent, Cofounder @atomic_semi @BayaSystems and FlexAI board member. Fan of 2x2 matrixes, books, refactoring and creative tensionAndrew Curran @AndrewCurran_
11K Followers 7K Following Atypically Friendly - I write about AI and human creativity. Will periodically make extremely unusual arguments.emozilla @theemozilla
4K Followers 1K Following catholic, ai research and co-founder at @NousResearch alignment: whatever the opposite of yudkowsky isNous Research @NousResearch
18K Followers 29 Following The AI Accelerator Company. https://t.co/vrD0aDJetoBrett Adcock @adcock_brett
171K Followers 14 Following Founder @Figure_robot (AI Robotics) & Archer Aviation (NYSE: ACHR)Nouha Dziri @nouhadziri
3K Followers 672 Following Research Scientist @allen_ai / @ai2_mosaic, PhD in NLP/Dialogue 🤖 UofA. Ex Visiting researcher @Mila_Quebec Ex Research intern at @GoogleDeepMind @MSFTResearchAI Notkilleveryoneism.. @AISafetyMemes
33K Followers 796 Following Techno-optimist, but AGI is not like the other technologies. Step 1: make memes. Step 2: ??? Step 3: lower p(doom)Ryan Yohler @the_rock_hobbit
313 Followers 556 Following Ph.D. Candidate in Integrative Biology at @UCBerkeley | Finnegan lab, @ucmpberkeley | Quantitative paleobiology | I post too many fantasy genre thoughtsPradeep Dasigi @pdasigi
1K Followers 460 Following Senior Research Scientist at Allen Institute for AI (AI2)Iz Beltagy @i_beltagy
2K Followers 422 Following Cofounder @SpiffyAI, Research Lead building OLMo at @allenai_org, formerly @UTCompSci PhD.Mechanical Dirk @mechanicaldirk
546 Followers 244 Following Principal Engineer at @allen_ai. Engineering Lead of the OLMo project.Alessio Fanelli @FanaHOVA
5K Followers 991 Following Cohost @latentspacepod | Partner & CTO @decibelvc | OSS: https://t.co/u4J6NVksoL | Writing: https://t.co/H7iEpzgxWQThe star pattern on the Australian flag depicts the Southern Cross, a constellation visible primarily in the southern hemisphere
@MKBHD I wish it knew my location well enough to not send my DoorDash to the wrong city (it did).
📚 A gentle guide to RLHF and alternatives like DPO, SPIN, or ORPO Nice overview of the series of 8 posts 🤯 published in collaboration between @argilla_io and our friends @mantisnlp argilla.io/blog/mantisnlp…
@natolambert I mean we *do* get hungry watching these models train!
@natolambert if that’s metrics i’m afraid how they are gonna describe data
Friends with babies and IC jobs as researchers or SWEs - how did you do it? Stories, details, DMs.
does anyone know how to get vllm to inference 1M tokens for an 8B model? asking for a friend.
@winglian @LoubnaBenAllal1 and @vwxyzjn have used llm-swarm to generation billions of tokens from open LLMs :) github.com/huggingface/ll…
yacine can't take care of you like i will. i got all your subcontinental mysticism right here!
Sara and the for.ai team is incredible and I owe them my career. Super lucky I've been able to work with them 💙
when the model turns out better than you thought it would after fine tuning
dario amodei wants me to delete this tweet because it discloses a compute multiplier, but i will not be silenced 😡 instead i will tell you that scaling by a factor of 4 instead of 8 will likely work even better
if you are using LoRA: divide the A matrix learning rate by 8 and multiply the B matrix learning rate by 8. you can thank me later