Yejin Choi @YejinChoinka
professor at UW, director at AI2, adventurer at heart homes.cs.washington.edu/~yejin/ Seattle, WA Joined August 2017-
Tweets2K
-
Followers18K
-
Following329
-
Likes4K
We created reviewing guidelines for @COLM_conf. Not intended to automate the committee work, or dictate constraints. But, to inspire a thoughtful reviewing process, for an exciting and impactful program of the highest possible quality. We have a wonderful program committee ❤️
Yes, both the 8B and 70B are trained way more than is Chinchilla optimal - but we can eat the training cost to save you inference cost! One of the most interesting things to me was how quickly the 8B was improving even at 15T tokens.
Yes, both the 8B and 70B are trained way more than is Chinchilla optimal - but we can eat the training cost to save you inference cost! One of the most interesting things to me was how quickly the 8B was improving even at 15T tokens.
We took this on Day2 of #TED2024. Some #AI ROCK⭐️'s...@drfeifei Daniela Rus @MIT_CSAIL @hlntnr @YejinChoinka @ruchowdh Niceaunties. And speaking today @CatieCuan + @AnimaAnandkumar ...oh...and then there's me🤣
Just tried the new GPT4+v on our New Yorker caption contest task (arxiv.org/abs/2209.06293). It does OK! (70%, good for second on leaderboard). But, w/ performance ~25% below human, it still doesn't quite "get the joke". Maybe your model does? :-) capcon.dev
Introducing our best OLMo yet. OLMo 1.7-7B outperforms LLaMa2-7B, approaching LLaMa2-13B at MMLU and GSM8k. High-quality data and staged training are key. I am so proud of our team making such significant improvement in a short period after our first release.
Introducing our best OLMo yet. OLMo 1.7-7B outperforms LLaMa2-7B, approaching LLaMa2-13B at MMLU and GSM8k. High-quality data and staged training are key. I am so proud of our team making such significant improvement in a short period after our first release. https://t.co/9NNwCxAwj6
We released this new agenda on LLM-safety yesterday. This is VERY comprehensive covering 18 different challenges. My co-authors have posted tweets for each of these challenges. I am going to collect them all here! P.S. this is also now on arxiv: arxiv.org/abs/2404.09932
We released this new agenda on LLM-safety yesterday. This is VERY comprehensive covering 18 different challenges. My co-authors have posted tweets for each of these challenges. I am going to collect them all here! P.S. this is also now on arxiv: arxiv.org/abs/2404.09932
I’m super excited to release our 100+ page collaborative agenda - led by @usmananwar391 - on “Foundational Challenges In Assuring Alignment and Safety of LLMs” alongside 35+ co-authors from NLP, ML, and AI Safety communities! Some highlights below...
I will be talking about what differential privacy is, what it is not and what some common misconceptions are in privacy for generative AI in a couple hours @genlawcenter in DC! Join us on the live stream: tinyurl.com/genlaw-stream Slides: tinyurl.com/genlaw-dp-2024
I will be talking about what differential privacy is, what it is not and what some common misconceptions are in privacy for generative AI in a couple hours @genlawcenter in DC! Join us on the live stream: tinyurl.com/genlaw-stream Slides: tinyurl.com/genlaw-dp-2024 https://t.co/s17N1h4P3m
🥰Excited to share that I will be joining AI2 @allen_ai @ai2_mosaic this September as a predoctoral young investigator!! So excited to continue working with amazing @YejinChoinka @nouhadziri @liweijianglw @kavel_r and can't wait to collaborate with others!
Can we uncover memorization of pre-training data in LLMs, using other LLMs? Our iterative prompt optimization method finds prompts that propel an LM to output training data using other LMs. We show higher avg. data reconstruction & extract 1.4X more PII! arxiv.org/abs/2403.04801
The infini-gram paper is updated with the incredible feedback from the online community 🧡 We added references to papers of @JeffDean @yeewhye @EhsanShareghi @EdwardRaffML et al. arxiv.org/abs/2401.17377 Also happy to share that the infini-gram API has served 30 million queries!
Welcome to the new era of AI: "Deep" was once the buzzword at AI conferences, but it's no longer the case in COLM.
Welcome to the new era of AI: "Deep" was once the buzzword at AI conferences, but it's no longer the case in COLM. https://t.co/xo1soRwMjI
Version II of the tutorial on neural theorem proving: github.com/cmu-l3/ntptuto… Some new additions - Train a model that gets 29.5% on miniF2F - Data extraction in Lean, based on lean-training-data - LLMLean tool (github.com/cmu-l3/llmlean)
Version II of the tutorial on neural theorem proving: github.com/cmu-l3/ntptuto… Some new additions - Train a model that gets 29.5% on miniF2F - Data extraction in Lean, based on lean-training-data - LLMLean tool (github.com/cmu-l3/llmlean) https://t.co/Nm6DtxEHZ4
Updates of ⚔️𝕎𝕚𝕝𝕕𝕍𝕚𝕤𝕚𝕠𝕟-𝔸𝕣𝕖𝕟𝕒: We added more models such as @AnthropicAI's Claude3 and @RekaAILabs! Also, many new features for improving user experience and collecting better evaluation data. E.g., we support selecting models for sampling and inputting reasons…
A few thoughts on using a DPO model as a reward model with reference model. It's a losing battle because if the texts are different lengths there's a fundamental signal-to noise problem. Averaging, norm, sum, whatever, will make it hard to extract the signal from a list of…
Do you need free access to GPT-4? It's here 😎huggingface.co/spaces/yuntian… 🌟
Do you need free access to GPT-4? It's here 😎huggingface.co/spaces/yuntian… 🌟
@yoavartzi @COLM_conf Presenting: Breakin' It Down at COLM
I hear text-to-music is having its moment. Can someone generate a theme song for @COLM_conf ? I am not musical enough to even prompt it
LLM often could not correct its own mistakes. However, using a fine-grained feedback model, we could teach LLM how to correct its incorrect generation. Introducing LLMRefine: the power of simulated annealing on top of fine-grained feedback! Check out: arxiv.org/abs/2311.09336
LLM often could not correct its own mistakes. However, using a fine-grained feedback model, we could teach LLM how to correct its incorrect generation. Introducing LLMRefine: the power of simulated annealing on top of fine-grained feedback! Check out: arxiv.org/abs/2311.09336
DBRX-Base from @databricks also achieves the top position in the URIAL Bench, which tests Base LLMs on the MT-bench with URIAL prompts (3-shot instruction-following examples). Check out the full results here on @huggingface 🤗: huggingface.co/spaces/allenai… Related Xs: 1️⃣ [URIAL…
(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistKyunghyun Cho @kchonyc
60K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Graham Neubig @gneubig
30K Followers 583 Following Associate professor at CMU, studying natural language processing and machine learning.Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Christopher Manning @chrmanning
126K Followers 114 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Jacob Andreas @jacobandreas
13K Followers 956 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwYoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCWilliam Wang @WilliamWangNLP
14K Followers 714 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Rosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRKayo Yin @kayo_yin
8K Followers 555 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵Tal Linzen @tallinzen
16K Followers 894 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAIJia-Bin Huang @jbhuang0604
51K Followers 285 Following Associate Professor @umdcs; Part-time Research Scientist @Meta. I like pixels.Luca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai, open source science fan, @QueerInAI organizer 🤖☕️🍕they/themFelix Hill @FelixHill84
9K Followers 776 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sAna Marasović @anmarasovic
4K Followers 603 Following Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷Naomi Saphra @nsaphra
7K Followers 1K Following Waiting on a robot body. ML/NLP. All opinions are universal and held by both employers and family. Same username on every lifeboat off this sinking ship.Nayan Saxena @SaxenaNayan
2K Followers 2K Following Brought artificial intelligence to @RBC, @Glowforge, @Wombo, @Bell & beyond.Matt Ahmann @mattahmann
372 Followers 2K Following Finance @usouthflorida | MSF Candidate @vanderbiltu | Space🚀, Tech🖥️, AI 🤖, Biotech 🧬, Sustainable Energy ☀️, Finance 💹, College Sports Fan 🏀🏈🏟️leoooX @leoooX10
15 Followers 206 FollowingSara Gemelli @saragemelli_
105 Followers 162 Following PhD student in Linguistics, @unipv & @UniBergamo 🔎 Discourse analysis in anti-feminist online communities 🔎 Representation of femicide cases in Italian newsHannah @HEchenoz
1K Followers 375 Following Researcher & Faculty @UCBerkeley @CISPA @LIGLab @Inria @ncataggies;Alum @Columbia. NetSys| Wireless |5G| XR | HCI | Edge |Comp. Linguist |RL. Twin: @HaniaBPgilangarisptr @Gilangarisptr
159 Followers 1K Following 🎯 My tweets are my own 🗒️ My Retweets are my notes 📊 IG : @jurnaldata.idPhil C @philchen
179 Followers 323 FollowingDanh Le Phuoc @danhlephuoc
4 Followers 73 FollowingDanielle Perszyk @drperszyk
84 Followers 263 Following Artificial Intelligence. PhD in Cognitive Science.poda3351 @poda335164881
29 Followers 449 FollowingDanial Namazifard @IamDanialNamazi
70 Followers 463 Following MSc Student in AI, NLP Researcher @ UT #NLProc #MachineLearningRb @richhpalbangra4
5 Followers 3K FollowingJisang Park @jacejisangpark
0 Followers 27 Following Robotics Research Associate @ Korea Advanced Institute of Science and Technology. Interested in embodied NLP and HRI for personal robots.Nikita @nikitavoloboev
4K Followers 5K Following Make @LearnAnything_ Learn in public: https://t.co/GbFvuErkYn macOS course: https://t.co/JdbJWru6zG https://t.co/94R8ER7K2h https://t.co/ROkqhyhpEKTrina Stephens @TrinaSteph22445
0 Followers 5 FollowingDr. Catie Cuan @CatieCuan
1K Followers 695 Following Robot Choreographer | #Choreorobotics | Robotics/AI PhD @stanford | @IfThenSheCan | Formerly @theteamatx @tw_arts @The_RAD_Lab @TEDtalksvishnu vs @log_root_
5 Followers 377 FollowingKim Yu-Ji @ug___k
10 Followers 61 FollowingAgam A. Shah @shahagam4
354 Followers 490 Following Quantitative Finance and NLProc, Georgia Tech | DA-IICT'19. Tweets my own & CC BY-SA 4.0.Becky @DD93565
71 Followers 257 Following I'm from Hong Kong, I travel a lot, I've been to every country in the world, I went to Iceland to see the Northern lights, I went to Denmark to see fairy tales.Eva Maria Vecchi @emvecchi
175 Followers 373 Following NLP Researcher @ims_stuttgart & @Cambridge_NLP Argument Mining, e-Deliberation, Bias, Meaning representations, Cognitive Modeling, #NLProc methodologyAbdulrahman Tabaza @embed_dim
7 Followers 580 Following Enjoyer of various vector spaces, encoders and modalitiesCarlos E. Mora @carlosemorama
67 Followers 106 Following Founder & CEO @Uptonmart, Applied Mathematics ITAM, Public Administration Columbia | SIPAMeasurer Star @feigaobox
144 Followers 1K FollowingHualong @ValonLee
14 Followers 59 FollowingPrachi Garg @PrachiG68526104
109 Followers 320 Following Incoming CS Ph.D @UofIllinois | Masters Student @CMU_Robotics @CarnegieMellonRoya @roya_kandalan
156 Followers 752 FollowingTessa Long @zhixuan_long
0 Followers 164 FollowingYu-Min Tseng @ym_tseng
2 Followers 56 FollowingAnish Acharya @AnishAc10645870
69 Followers 278 Following PhD UT Austin || ex Applied Scientist Amazon Alexa AI || Research interns at Meta, MSR ; TTIC || Researcher in ML Theory, NLP, Optimization,Helen Toner @hlntnr
21K Followers 1K Following Interests: China+ML, natsec+tech, brains+words+absurdity | Current: @CSETGeorgetown (opinions my own) | Former: @open_philAlberto Tono @albertotono3
1K Followers 1K Following PhD Candidate @Stanford | @StanfordHAI Graduate Fellow | AI Research Scientist Intern @Autodesk | Founder @CDInstitutHarry Ho @Heruien
11 Followers 253 FollowingAR0575 @ar057562841
0 Followers 464 FollowingAmrith Krishna @krishnamrith12
678 Followers 497 Following Founder https://t.co/QBKyFF8Qo4 | Alum Postdoc: @cambridgenlp @ITUkbh | Phd: @IITKgp | 10+ years in AI Research | AE @ReviewAcl | consults for BharatGPTElrond1701 @elrond1701
6 Followers 70 FollowingMincho Ovesov @MinchoOvesov
5 Followers 12 FollowingXiwen Wei @XiwenWei_
11 Followers 56 FollowingLin Ai @_Lin_Ai_
0 Followers 37 FollowingRaúl @raul_ap
155 Followers 806 Following Director for all things data & ML/AI @Plentific. Prev @intel. Views are my own.(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistKyunghyun Cho @kchonyc
60K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzGraham Neubig @gneubig
30K Followers 583 Following Associate professor at CMU, studying natural language processing and machine learning.Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Christopher Manning @chrmanning
126K Followers 114 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Jacob Andreas @jacobandreas
13K Followers 956 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwYoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCYi Tay @YiTayML
28K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Rosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRTal Linzen @tallinzen
16K Followers 894 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAILuca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai, open source science fan, @QueerInAI organizer 🤖☕️🍕they/themFelix Hill @FelixHill84
9K Followers 776 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sAna Marasović @anmarasovic
4K Followers 603 Following Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷Naomi Saphra @nsaphra
7K Followers 1K Following Waiting on a robot body. ML/NLP. All opinions are universal and held by both employers and family. Same username on every lifeboat off this sinking ship.Allen Institute for A.. @allen_ai
53K Followers 361 Following AI for the Common Good. › Join us: https://t.co/DqTs1G4bGO › Get our newsletter: https://t.co/tvb1VpySfLTim Dettmers @Tim_Dettmers
28K Followers 819 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Raphaël Millière @raphaelmilliere
10K Followers 2K Following Philosopher of Artificial Intelligence & Cog Science @Macquarie_Uni Past @Columbia @UniofOxford Also on other platforms Blog: https://t.co/2hJjfSid4ZSeungju Han @SeungjuHan3
181 Followers 232 Following Incoming predoctoral researcher + now visiting student researcher @ai2_mosaic @allen_ai working on LLMs. Undergrad @SeoulNatlUnilmsys.org @lmsysorg
35K Followers 172 Following Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtmnoahdgoodman @noahdgoodman
2K Followers 108 Following Professor of natural and artificial intelligence @Stanford. Research Scientist at @GoogleDeepMind. (@StanfordNLP @StanfordAILab etc)Claire Lehmann @clairlemon
233K Followers 5K Following 🚀founded @quillette ✍️ writes for @australian 📧 subscribe: https://t.co/04OP3stmMNRichard Dawkins @RichardDawkins
3.0M Followers 360 Following UK biologist & writer. Richard Dawkins Foundation donor: https://t.co/rZZdjPoMUe. For Details about the Upcoming Tour: https://t.co/sSo5FL6CWbPessimists Archive @PessimistsArc
91K Followers 64 Following Exploring technophobia and moral panic through the ages. A litany of shameful cynicism and spite. Curated by @louisanslowInflection AI @inflectionAI
49K Followers 3 Following We are an AI studio creating a personal AI for everyone. Our first is @pi, a supportive and empathetic conversational AI.Kevin Patrick Murphy @sirbayes
42K Followers 334 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Yuntian Deng @yuntiandeng
3K Followers 3K Following #NLProc Postdoc @ai2_mosaic | Assistant Professor @UWaterloo '24 | Faculty Affiliate @VectorInst '24 | PhD @HarvardDan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Conference on Languag.. @COLM_conf
2K Followers 6 Following https://t.co/GhGCMEoa4A Abstract submission: March 22, 2024Omar Sanseviero @osanseviero
31K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Andrew Ng @AndrewYNg
1.0M Followers 909 Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCsOwain Evans @OwainEvans_UK
7K Followers 241 Following Research Associate @fhioxford, Oxford University. AI alignment. Prefer email to DM.main @main_horse
8K Followers 467 Following AGI Believer. Haven't applied @OpenAI. Likes are not always endorsement.Noam Brown @polynoamial
34K Followers 610 Following Researching reasoning @OpenAI | Co-created Libratus/Pluribus, the first superhuman no-limit poker AIs | Co-created CICERO | PhD from @SCSatCMUDavid @DavidSHolz
53K Followers 5K Following founder @midjourney, prev founder leap motion, nasa, max planckTri Dao @tri_dao
18K Followers 363 Following Incoming Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.Stella Biderman @BlancheMinerva
15K Followers 748 Following Open source LLMs and interpretability research at @BoozAllen and @AiEleuther. My employers disown my tweets. She/herNathan Lambert @natolambert
25K Followers 687 Following Figuring out AI @allen_ai, "rl boi" DM me papers. Writes @interconnectsai, talks @retortai Has phd and some credentialsRylan Schaeffer @RylanSchaeffer
3K Followers 973 Following CS PhD student with @sanmikoyejo at @stai_research @StanfordAILabAshwin Ramaswami @ashwinforga
3K Followers 2K Following Candidate for GA State Senate District 48. Contact me at [email protected]Yao Fu @Francis_YAO_
13K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep runningShane Legg @ShaneLegg
51K Followers 57 Following Co-founder and Chief AGI Scientist, Google DeepMindGuillaume Lample @GuillaumeLample
37K Followers 648 Following Cofounder & Chief Scientist https://t.co/hLfvKLkFHd (@MistralAI). Working on LLMs. Ex @MetaAI | PhD @Sorbonne_Univ_ | MSc @CarnegieMellon | X11 @PolytechniqueJonathan Frankle @jefrankle
16K Followers 685 Following Chief Scientist, Neural Networks @Databricks via MosaicML. PhD @MIT_CSAIL. BS/MS @PrincetonCS. DC area native. Making AI efficient for everyone at @DbrxMosaicAIDatabricks Mosaic Res.. @DbrxMosaicAI
30K Followers 115 Following We remove the barriers to state-of-the-art generative AI model development and make data + AI available to all.Jürgen Schmidhuber @SchmidhuberAI
106K Followers 0 Following Invented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.rishi @RishiBommasani
4K Followers 2K Following Stanford CS PhD @StanfordCRFM @StanfordNLP @StanfordAILab @StanfordHAI Advisers: @percyliang @jurafsky Previous: @CornellCIS @clairecardie #FoundationModelsNouha Dziri @nouhadziri
3K Followers 670 Following Research Scientist @allen_ai / @ai2_mosaic, PhD in NLP/Dialogue 🤖 UofA. Ex Visiting researcher @Mila_Quebec Ex Research intern at @GoogleDeepMind @MSFTResearchDaphne Koller @DaphneKoller
26K Followers 5 Following Founder and CEO of @insitro, Machine Learning pioneer, co-founder of Coursera, adjunct CS Professor at Stanford, avid travelerHelen Toner @hlntnr
21K Followers 1K Following Interests: China+ML, natsec+tech, brains+words+absurdity | Current: @CSETGeorgetown (opinions my own) | Former: @open_philEmily Chang @emilychangtv
205K Followers 2K Following Host and executive producer of “The Circuit” on @Bloomberg Originals. Author of Brotopia. Proud mama and wife of @jonstullAndrew Lampinen @AndrewLampinen
7K Followers 1K Following Interested in cognition and artificial intelligence. Research Scientist @DeepMind. Previously cognitive science @StanfordPsych. Tweets are mine.Ate-a-Pi @8teAPi
38K Followers 2K Following self aware neuron; historian from 2130; epistemic polluter; 95 yr old man;Sharif Shameem @sharifshameem
53K Followers 3K Following founder @LexicaArt • in pursuit of good explanationsLilian Weng @lilianweng
94K Followers 147 Following Working on AI safety, past on robotics, applied research @OpenAI; Writing ML blogs to help myself & others to learn; Ideas my own.Oriol Vinyals @OriolVinyalsML
166K Followers 82 Following VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead. Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.Christian Szegedy @ChrSzegedy
32K Followers 2K Following #deeplearning, #ai research scientist. Opinions are mine.Jan Leike @janleike
44K Followers 322 Following ML Researcher, co-leading Superalignment @OpenAI. Optimizing for a post-AGI future where humanity flourishes.Zachary Lipton @zacharylipton
59K Followers 2K Following Professor: CMU/@acmi_lab, CTO / CSO: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷Colin Raffel @colinraffel
30K Followers 655 Following nonbayesian parameterics, sweet lessons, and random birds. Friend of @srush_nlpWe created reviewing guidelines for @COLM_conf. Not intended to automate the committee work, or dictate constraints. But, to inspire a thoughtful reviewing process, for an exciting and impactful program of the highest possible quality. We have a wonderful program committee ❤️
Yes, both the 8B and 70B are trained way more than is Chinchilla optimal - but we can eat the training cost to save you inference cost! One of the most interesting things to me was how quickly the 8B was improving even at 15T tokens.
We took this on Day2 of #TED2024. Some #AI ROCK⭐️'s...@drfeifei Daniela Rus @MIT_CSAIL @hlntnr @YejinChoinka @ruchowdh Niceaunties. And speaking today @CatieCuan + @AnimaAnandkumar ...oh...and then there's me🤣
Excited to share a preview of Llama3, including the release of an 8B and 70B (82 MMLU, should be the best open weights model!), and preliminary results for a 405B model (still training, but already competitive with GPT4). Lots more still to come... ai.meta.com/blog/meta-llam…
Just tried the new GPT4+v on our New Yorker caption contest task (arxiv.org/abs/2209.06293). It does OK! (70%, good for second on leaderboard). But, w/ performance ~25% below human, it still doesn't quite "get the joke". Maybe your model does? :-) capcon.dev
Introducing our best OLMo yet. OLMo 1.7-7B outperforms LLaMa2-7B, approaching LLaMa2-13B at MMLU and GSM8k. High-quality data and staged training are key. I am so proud of our team making such significant improvement in a short period after our first release.
Announcing our latest addition to the OLMo family, OLMo 1.7!🎉Our team's efforts to improve data quality, training procedures and model architecture have led to a leap in performance. See how OLMo 1.7 stacks up against its peers and peek into the technical details on the blog:…
This is one of the most interesting papers I have seen in a while: arxiv.org/abs/2310.01929… These folks manage to convincingly conceptualize how text-to-image models capture cultural differences.
@jlibovicky @mor_ventura95 @bd_eyal @annalkorhonen @roireichart New relevant paper arxiv.org/abs/2404.10199 by @YejinChoinka @huihan_li (not citing the other one, but I still find the two relevant to each other)
We released this new agenda on LLM-safety yesterday. This is VERY comprehensive covering 18 different challenges. My co-authors have posted tweets for each of these challenges. I am going to collect them all here! P.S. this is also now on arxiv: arxiv.org/abs/2404.09932
I’m super excited to release our 100+ page collaborative agenda - led by @usmananwar391 - on “Foundational Challenges In Assuring Alignment and Safety of LLMs” alongside 35+ co-authors from NLP, ML, and AI Safety communities! Some highlights below...
The 2024 AI Index tacitly shows Natural Language Processing rising to be the central technology of AI 2004: NLP way off in the AI margins 2014: A little excitement over chatbots 2024: AI Index leads with impressive progress of LLMs aiindex.stanford.edu/report/ #NLProc #BiasedTakes
Usman deserves so much credit for leading and organizing this effort! It's been a long haul, but I'm really happy with the result!
Super excited about the release of this 🔥agenda paper on “Foundational Challenges in Assuring Alignment and Safety of LLMs!” that has been described as ‘particularly comprehensive' and 'epic piece of work' in private reviews. 😅
Excited about this for many reasons, but the biggest are 1. T5 is very very widely used IRL and better models are a good thing. 2. Checkpoints saved every 10,000 steps enabling research on learning dynamics and interp for s2s models like what Pythia has done for decoder models.
🚀 Introducing Pile-T5! 🔗 We (EleutherAI) are thrilled to open-source our latest T5 model trained on 2T tokens from the Pile using the Llama tokenizer. ✨ Featuring intermediate checkpoints and a significant boost in benchmark performance. Work done by @lintangsutawika, me…
I’m super excited to release our 100+ page collaborative agenda - led by @usmananwar391 - on “Foundational Challenges In Assuring Alignment and Safety of LLMs” alongside 35+ co-authors from NLP, ML, and AI Safety communities! Some highlights below...
I will be talking about what differential privacy is, what it is not and what some common misconceptions are in privacy for generative AI in a couple hours @genlawcenter in DC! Join us on the live stream: tinyurl.com/genlaw-stream Slides: tinyurl.com/genlaw-dp-2024
So excited to announce an event @genlawcenter has been working on! We're discuss the misconceptions b/w the technical capabilities of evaluating generative AI, and what policymakers and civil society want... April 15th @GtownTechLaw, and live on zoom: dc-workshop.genlaw.org
Thanks for the shout out! We'll be updating the course website with materials as the term progresses, and hope others find it useful. We're also thrilled to have guest lectures by @tydsh, @denny_zhou, @KaiyuYang4!
CS159: LLMs for reasoning lecture slides from Caltech are really good. Link: sites.google.com/view/cs-159-20… Thank you for making them public @yisongyue and @acbuller
Defended my thesis yesterday :) Its been a fantastic ride at @columbianlp and I am grateful to my advisor @SmaraMuresanNLP for believing in my work. Special thanks to @VioletNPeng who introduced me to Creative NLG which made a lot of the work in my thesis possible
I’m an Outstanding Senior Researcher @gtcomputing. Anyone who knows anything knows that it’s the members of my lab that really won this award. Congratulations, fam!