Antoine Bosselut @ABosselut
Helping machines make sense of the world. Asst Prof @ICepfl; Before: @stanfordnlp @allen_ai @uwnlp @MSFTResearch #NLProc #AI atcbosselut.github.io Joined March 2013-
Tweets988
-
Followers3K
-
Following602
-
Likes2K
📣 CALL FOR PAPERS An @icmlconf workshop on LLMs 🤖and Cognition 💭 (LLMCog) 📍 Vienna July 27 Submit your 4-page papers due May 22 Attend to hear from our invited speakers, the amazing @MelMitchell1, @rao2z, @chelseabfinn, and @ABosselut ! llm-cognition.github.io
We released 🍷FineWeb: 15T high quality tokens from the web. It's the best ready-to-use AND the largest pretraining dataset. Outperforms all other datasets in our 350B token ablations but scales to much longer training runs due to its sheer size! hf.co/datasets/Huggi…
We released 🍷FineWeb: 15T high quality tokens from the web. It's the best ready-to-use AND the largest pretraining dataset. Outperforms all other datasets in our 350B token ablations but scales to much longer training runs due to its sheer size! hf.co/datasets/Huggi… https://t.co/6qfX6cUCAQ
Beyond scaling, understanding human-AI interaction modes is the next frontier in producing systems that are most useful to human users. Led by the incredible @MinaLee__ and @katyilonka , we produced a design space for AI writing assistants!
Beyond scaling, understanding human-AI interaction modes is the next frontier in producing systems that are most useful to human users. Led by the incredible @MinaLee__ and @katyilonka , we produced a design space for AI writing assistants!
For everyone asking for the solution, take 200 hours and generate a meaningful (500-1k entries) test set and never show it to anyone.
A tweak in the architecture of #Transformers can significantly boost accuracy! With direct access to all previous blocks’ outputs, a 48-block #DenseFormer outperforms a 72-block Transformer, with faster inference! A work with @akmohtashami_a,@francoisfleuret, Martin Jaggi. 1/🧵
I will be presenting our paper REFINER @EACL2024 today at 11:00 CET in Malta 🇲🇹 🧐Can small specialized LMs improve the CoT generated by LLMs? --> Yes! Paper Link: aclanthology.org/2024.eacl-long…
Our Geo-Regional Africa Group is looking forward to hosting @bkhmsi next week on Tuesday, March 18th. Badr will present "Investigating Cultural Alignment of Large Language Models" (arxiv.org/abs/2402.13231), be sure to join us! Learn more: cohere.com/events/c4ai-Ba…
NEW PAPER ALERT: We propose DiffuCOMET, a family of diffusion-based knowledge models that generate relevant contextual commonsense when presented with FULL narrative contexts, rather than just uncontextualized individual KG triples.
Check out our work on DiffuCOMET, our new method for generating context-grounded commonsense knowledge graphs using #diffusion ! Joint work with @Sony Research
Check out our work on DiffuCOMET, our new method for generating context-grounded commonsense knowledge graphs using #diffusion ! Joint work with @Sony Research https://t.co/yGBnkPwHZz
I partially agree! - Retiring some old datasets is too harsh, and leakage is hard to quantify ❌ - Close-sourced evals should be more welcomed. 🔥 Shameless plugin of our close-sourced eval for commonsense reasoning 🐦⬛CROW 🔗mete.is/crow/ @mismayilsoy
Angelika Romano starts her talk about quantifying the strength of causal relationships in real world data At our Causal Parrots Workshop @RealAAAI #causaltwitter #causality #aaai2024
🚀Introducing Nemotron-4 15B by @nvidia! 🎉 With 15B parameters and trained on 8T tokens, it's impressive in multilingual AI. Outperforms all similarly-sized models and dominates in multilingual tasks, even surpassing models 4x larger! #NVIDIA #Nemotron4 arxiv.org/pdf/2402.16819…
Excited to present our causal benchmark 🦀 CRAB at the Causality & LLMs Workshop (LLM-CP) @RealAAAI (llmcp.cause-lab.net)! If you're around, check out my talk today at 4 pm (PST)! You can also download and use our benchmark here: github.com/agromanou/CRAB
Excited to present our causal benchmark 🦀 CRAB at the Causality & LLMs Workshop (LLM-CP) @RealAAAI (llmcp.cause-lab.net)! If you're around, check out my talk today at 4 pm (PST)! You can also download and use our benchmark here: github.com/agromanou/CRAB
Delighted to announce Aleksander Madry @aleks_madry Head of Preparedness, @OpenAI, as our closing keynote speaker at #AMLDEPFL2024 on March 26. Join us. 🎟Tickets go.epfl.ch/AMLDEPFLGetTic…
🚨 New Paper What happens when Anthropologists and ML researchers work together? 1. Propose a framework for measuring Cultural Alignment of LLMs 2. Show that languages in the pretraining data and that of the prompt affect alignment 3. Introduce Anthropological Prompting (1/n)
If you are at @RealAAAI , please pass by our poster! We will show you ConVQG, our Visual Question Generation method, to generate question with multimodal guidance! with @limi_rs , @javi_fcn , @ABosselut, X. Dai and S.Montariol arxiv.org/abs/2402.12846 limirs.github.io/ConVQG/
Meta presents Efficient Tool Use with Chain-of-Abstraction Reasoning huggingface.co/papers/2401.17… In mathematical reasoning and Wiki QA domains, we show that our method consistently outperforms previous chain-of-thought and tool-augmented baselines on both in-distribution and…
@naval It exists - This is called Meditron, and developped at @EPFL_en - @ABosselut
New Paper!! 🚨 Partial Diacritization is often overlooked in #ArabicNLP despite its importance in improving reading speed and accuracy. Check the 🧵to read about our CCPD algorithm, new metrics, model and behavioral study that supports our work! Demo: huggingface.co/spaces/bkhmsi/…
New Paper!! 🚨 Partial Diacritization is often overlooked in #ArabicNLP despite its importance in improving reading speed and accuracy. Check the 🧵to read about our CCPD algorithm, new metrics, model and behavioral study that supports our work! Demo: huggingface.co/spaces/bkhmsi/…
Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Yoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCSam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).William Wang @WilliamWangNLP
14K Followers 717 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Bill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscAllen Institute for A.. @allen_ai
54K Followers 361 Following AI for the Common Good. › Join us: https://t.co/DqTs1G4bGO › Get our newsletter: https://t.co/tvb1VpySfLTim Dettmers @Tim_Dettmers
29K Followers 820 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Sebastian Gehrmann @sebgehr
5K Followers 2K Following Head of NLP, CTO office, @Bloomberg. (he/him) Generating natural language, one word at a time. Also making sense of that language afterwards. views my ownKayo Yin @kayo_yin
8K Followers 556 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵Dipanjan Das @dipanjand
4K Followers 308 Following Senior Director of Research at @GoogleDeepmind. Working on improving the factuality of LLM generated content.Greg Durrett @gregd_nlp
6K Followers 752 Following CS professor at UT Austin. I do NLP most of the time. he/himrishi @RishiBommasani
4K Followers 2K Following Stanford CS PhD @StanfordCRFM @StanfordNLP @StanfordAILab @StanfordHAI Advisers: @percyliang @jurafsky Previous: @CornellCIS @clairecardie #FoundationModelsWenhu Chen @WenhuChen
11K Followers 520 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.Nathan Schneider @complingy
4K Followers 1K Following Computational Linguist and Professional Nerd at Georgetown University he/him pronouns, ALL the prepositions @[email protected] @complingy.bsky.socialssteevens @Steevens43
152 Followers 4K FollowingAastha Pal @apal9_pal
198 Followers 730 Following Comp Bio @StanfordMed | @BU_tweets alum | Prev @BMSnews 🧬💻💊aurelie savasta @AurelieSavasta
150 Followers 249 Following citoyenneté, respect, bienveillance,laïcité,liberté,travail, justessepengch fan @FanPengch
214 Followers 6K FollowingNick Mumero @nickdee96
131 Followers 1K Following Cofounder at Continuum Ads. Focusing on NLP, Simulation Modelling and Optimization.Ekue @ekpodar
1K Followers 1K Following I am interested in Tech/AI, Marketing, and complex systems, I will posts random stuff in those categorieswangzhuxi666 @wangzhuxi666
91 Followers 4K FollowingVikram Dutt @vd_
818 Followers 7K FollowingPensé FFun @inftyCategory
108 Followers 6K FollowingSreejith Krishnan R @skr_research
0 Followers 331 FollowingHarshali Ranjan @harshaliranjan
0 Followers 83 FollowingAnnie Wierman @AnnieWierm
34 Followers 5K Followingjoyce @ucc996
0 Followers 65 FollowingFlavia Reavley @fla_reavl
55 Followers 5K FollowingDr. Yu-Dai Tsai @YuDai_Tsai
2K Followers 4K Following Incoming Director's Fellow @LosAlamosNatLab; Postdoc @UCIrvine; Formerly @Fermilab @UChicago. https://t.co/lPYqoPpt0v https://t.co/je5EsvIWXoGuilherme G. Rafare @guilhermerafare
0 Followers 5K Following # Eu não aceito seguidores # I don’t accept followersIndy Proue @IProue53298
46 Followers 5K FollowingBacklinkGPT @BacklinkGPT
12 Followers 103 Following Automate Your 🔗 Link-Building with AI-Personalized Outreach | AI-Driven Outreach Personalization | One-Click Link Prospecting | Automated Contact DiscoveryKaran Vaidya @KaranVaidya6
1K Followers 1K Following Building the communication layer for AI Agents @ https://t.co/2y6pZtTAZe Past: @NirvanaTechInc @rubrikInc @Google @iitbombay CSGussie Trapani @trapa_guss
27 Followers 5K FollowingJoe Stacey @_joestacey_
576 Followers 1K Following PhD student at Imperial and Apple Scholar. I love running, NLP and travelling (in no particular order). Ex teacher and PwC Consultant. #NLProcIbrahim Ahmad @Ibrahim63433664
85 Followers 3K Followingkumar @kumar__nn
0 Followers 1K FollowingMoira Cocker @MoiraC53257
76 Followers 5K Following방소연 @bangsoyeon19951
29 Followers 2K FollowingWei Shi @weishi
45 Followers 936 FollowingQasim Ali @QasimAliSidhu
168 Followers 1K Following AI First Tech Savvy Technical Customer Support Engineer #AI #GenerativeAI #GenAI #FutureAILeaders #AIFirstYaroslav Golubev @areyde
405 Followers 1K Following Research administrator and data analyst in the ML4SE lab @JetBrains. Love empirical SE, writing papers and poetry, philosophy, and the world.AB M @abdelmehdi_ab
44 Followers 1K FollowingZach0 @Zach0__
293 Followers 4K Following statsparrot. bayescraft. torch. audio transformers. uplift. causal inference. conformal prediction. xgboost.Alexander Wan @alexwan55
472 Followers 944 Following CS at Berkeley; @BerkeleyML @BerkeleyNLP; NLP researchALIYU MUNNIR @AMunnir73834
49 Followers 257 FollowingSunqi Fan @Sunqi_Fan
104 Followers 566 Following a third-year undergrad @Tsinghua_Uni, studying NLP/LLM/CV. Seeking for 25 Fall Ph.D. positionINAM KHAN @inamullahnaseeh
185 Followers 4K Following 🚀 BSCS grad 🎓 | Passionate about AI, Machine Learning, and Data Science 💻 | Eagerly seeking internships to dive into the world of cutting-edge tech!Moshood Olawale @Quantymosh
173 Followers 1K Followingwoody72 @woody7219
100 Followers 250 FollowingMina Lee @MinaLee__
3K Followers 452 Following Postdoc at @MSFTResearch | Assistant Professor at @UChicagoCS (2024) | PhD at @Stanford | Language models, AI-assisted writing, Human-AI interaction ✍️tgp @tgp8544
36 Followers 355 FollowingJoonhyung_Kwak @nicholas_0429
30 Followers 41 FollowingImperial NLP @imperial_nlp
70 Followers 336 Following We are the Natural Language Processing community here at Imperial College London. Looking forward to sharing more of our work over the coming months! #NLProcAlex Meshkin, GED @alexmeshkin
33K Followers 4K Following Founder of @FlowHealthHQ Since you keep asking, yes, I am a high school dropout. *I don't proof read my tweets(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Yoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCAI at Meta @AIatMeta
531K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).William Wang @WilliamWangNLP
14K Followers 717 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Christopher Manning @chrmanning
126K Followers 115 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Graham Neubig @gneubig
31K Followers 586 Following Associate professor at CMU, studying natural language processing and machine learning.Richard Socher @RichardSocher
101K Followers 970 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindYi Tay @YiTayML
29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Bill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscTal Linzen @tallinzen
16K Followers 893 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAIAllen Institute for A.. @allen_ai
54K Followers 361 Following AI for the Common Good. › Join us: https://t.co/DqTs1G4bGO › Get our newsletter: https://t.co/tvb1VpySfLTim Dettmers @Tim_Dettmers
29K Followers 820 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Sebastian Gehrmann @sebgehr
5K Followers 2K Following Head of NLP, CTO office, @Bloomberg. (he/him) Generating natural language, one word at a time. Also making sense of that language afterwards. views my ownMina Lee @MinaLee__
3K Followers 452 Following Postdoc at @MSFTResearch | Assistant Professor at @UChicagoCS (2024) | PhD at @Stanford | Language models, AI-assisted writing, Human-AI interaction ✍️Conference on Languag.. @COLM_conf
2K Followers 6 Following https://t.co/GhGCMEoa4A Abstract submission: March 22, 2024Ana Klimovic @anaklimovic
2K Followers 673 Following Assistant Professor in Computer Science @ETH Zurich. I work on computer systems. Former Research Scientist @Google. EE PhD from @Stanford. EngSci 1T3 @UofT.EleutherAI @AiEleuther
19K Followers 76 Following A non-profit research lab focused on interpretability, alignment, and ethics of artificial intelligence. Creators of GPT-J, GPT-NeoX, and VQGAN-CLIPTeven Le Scao @Fluke_Ellington
2K Followers 549 Following Researcher @MistralAI, producer @ my bedroom, no BLOOM slander authorized on this accountAlexandre Défossez @honualx
4K Followers 490 Following Founding researcher @kyutai_labs, with strong interests in stochastic optimization, audio generative models, and AI for science.Demi Guo @demi_guo_
22K Followers 693 Following Co-founder & CEO @pika_labs | ex @StanfordAILab @HarvardLeandro von Werra @lvwerra
6K Followers 310 Following Machine learning @huggingface: co-lead of @bigcodeproject and maintainer of TRL.Niloofar (Fatemeh) Mi.. @niloofar_mire
4K Followers 1K Following Postdoc @uwcse-@uwnlp, Ph.D. from @ucsd_cse /Privacy, ML, NLP, @winlpworkshop chair, @MSFTResearch - Semantic MachinesYuling Gu @gu_yuling
390 Followers 665 Following Predoctoral researcher @allen_ai | @nyuniversity ➡️ @UW ➡️ @allen_ai @[email protected]BlackboxNLP @BlackboxNLP
371 Followers 13 Following The largest workshop on analysing and interpreting neural networks for NLP. BlackboxNLP will be held at EMNLP 2024 in Miami! Account run by @JumeletJNaomi Saphra @nsaphra
7K Followers 1K Following Waiting on a robot body. ML/NLP. All opinions are universal and held by both employers and family. Same username on every lifeboat off this sinking ship.Geoffrey Irving @geoffreyirving
8K Followers 258 Following Research Director at the UK AI Safety Institute (AISI). Previously DeepMind, OpenAI, Google Brain, etc. @[email protected]Khai Loong Aw @Khai_Loong_Aw
147 Followers 508 Following Studying intelligence in brains and machines, NeuroAI. Past: @EPFL_en, @mpi_sws_, @sgSMU. 🇸🇬Jacques Fellay @jacquesfellay
2K Followers 505 Following Head of Precision Medicine @CHUV, Assoc Professor @EPFL and @unil, genomics of infection and immunity, personalized healthKatie Link @katieelink
6K Followers 905 Following Machine learning for health. Previously @huggingface, @nyulangone, @Google @theteamatx. Views my own.kyutai @kyutai_labs
6K Followers 6 FollowingMete @mismayilsoy
138 Followers 543 Following Cogito ergo sum. PhD candidate in Computer Science @EPFLA. Mathis Group @amathislab
1K Followers 532 Following A. Mathis Lab @EPFL run by a GAN trained on @TrackingPlumes est. Aug 2020 located at @CampusBiotechElena Lloret @elloretpastor
272 Followers 173 Following Investigadora en PLN y Profesora Titular de Universidad en @UA_Universitat/ NLP Researcher and Lecturer at the University of Alicante @UA_UniversidadWenhu Chen @WenhuChen
11K Followers 520 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.Shayne Longpre @ShayneRedford
4K Followers 998 Following PhD @MIT. Prev: @Google Brain, @apple ML, @stanfordnlp. 🇨🇦 Interests: AI/ML/NLP, Data-centric AI, transparency & societal impactChris Olah @ch402
91K Followers 173 Following Reverse engineering neural networks at @AnthropicAI. DMs open! Previously @distillpub, OpenAI Clarity Team, Google Brain. Personal account.Isabel🌻 @isabelunraveled
24K Followers 1K Following a student of being human • recent essay: https://t.co/k7PDN8kkCuAhmad Beirami @abeirami
4K Followers 2K Following Building safe, helpful, and scalable generative AI @Google | ex-{@AIatMeta, @EA, @MIT, @Harvard, @DukeU} | @GeorgiaTech PhD | زن زندگی آزادی | opinions my ownJohn Schulman @johnschulman2
39K Followers 609 Following Cofounder @openai, lead post-training for ChatGPT and the API. Interested in reinforcement learning, alignment, birds, jazz musicHeng Ji @hengjinlp
4K Followers 236 FollowingEPFL School of Engine.. @EPFLEngineering
4K Followers 405 Following The official account of EPFL's School of Engineering.EPFL Innovation Park @EPFL_Park
13K Followers 4K Following A springboard for your high-tech enterprise in Lausanne, SwitzerlandDragoș Tudorache @IoanDragosT
4K Followers 383 Following MEP (Renew Europe, Romania); Chair Special Committee on AI; LIBE rapporteur on AI Act; former Interior Minister and DG HOME Head of Unit.Sebastian Schuster @sebschu
2K Followers 2K Following Lecturer @LinguisticsUCL, and starting in 2025, Assistant Professor @univienna. #nlproc, computational and experimental semantics and pragmatics. he/him.Nouha Dziri @nouhadziri
3K Followers 672 Following Research Scientist @allen_ai / @ai2_mosaic, PhD in NLP/Dialogue 🤖 UofA. Ex Visiting researcher @Mila_Quebec Ex Research intern at @GoogleDeepMind @MSFTResearchBeatriz Borges @obiwit
71 Followers 68 Following #NLProc PhD student at #ICepfl - aiming to better align language models with us!Jason Weston @jaseweston
9K Followers 568 Following Research @MetaAI+NYU. Pretrain+FT: NLP from Scratch (2011). Multilayer attention+position embed+LLM: MemNets (2015). Recent (2023+):Sys 2 Attn, Self-Rewarding..Jonathan Frankle @jefrankle
16K Followers 685 Following Chief Scientist, Neural Networks @Databricks via MosaicML. PhD @MIT_CSAIL. BS/MS @PrincetonCS. DC area native. Making AI efficient for everyone at @DbrxMosaicAIJasmijn Bastings @jasmijnbastings
4K Followers 2K Following Sr Research Scientist @GoogleDeepMind. Interested in gender, feminism, fairness, bias & ethics in #NLProc/#AI. Views my own. She/they.Badr AlKhamissi @bkhmsi
853 Followers 805 Following PhD @EPFL_en Ex @MetaAI, @SonyAI_global, @Microsoft MSc @CoCoNeuro_Gold BSc CS @AUC Egyptian 🇪🇬Caglar Gulcehre @caglarml
4K Followers 1K Following ML Researcher Prof @ EPFL, PI @ CLAIRE lab Ex: Staff Research Scientist @ Deepmind, MSR, IBM Research Follow me on Mastodon: https://t.co/LZ5sWt7AsjIrene Solaiman @IreneSolaiman
4K Followers 578 Following ai social impact+safety+policy, @huggingface 🤗 views=mine former: @OpenAI @Harvard aspiring ukulele-singer she/herMona Diab @MonaDiab77
1K Followers 687 Following Director of LTI, CMU. ACL Fellow. I am passionate about language, mind, responsible technologies, technology/society, history, politics, nutrition!Susan Zhang @suchenzang
20K Followers 503 Following @ Google Deepmind. Past: @MetaAI, @OpenAI, @unitygames, @losalamosnatlab, @Princeton etc. Always hungry for compute.Jack Rae @drjwrae
9K Followers 353 Following Principal Scientist @ Google DeepMind Work on Gemini 💎♊ Compression is all you need LLMs (e.g. Gopher, Chinchilla, Gemini) 💼 Past: OpenAI, QuoraFrançois Fleuret @francoisfleuret
31K Followers 456 Following Prof. @Unige_en, Adjunct Prof. @EPFL_en, Research Fellow @idiap_ch, co-founder @nc_shape. AI and machine learning since 1994. I like reality.Mengjie Zhao @mengjie_zhao
248 Followers 534 Following an NLP researcher @Sony. Views my own. Previously phd @CisLmu. #nlprocBen Meer @SystemSunday
364K Followers 147 Following The Systems Guy • Follow me for systems on health, wealth, & free time ⚡ Cornell MBA • 2M+ audienceOmar Khattab @lateinteraction
11K Followers 2K Following CS PhD candidate @StanfordNLP. 2022 Apple Scholar in AI/ML. Author of ColBERT (https://t.co/2ZtgXoa1np), DSPy (https://t.co/BH7WmMKDXR), & various retrieval & LM systems.Kayo Yin @kayo_yin
8K Followers 556 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵📣 CALL FOR PAPERS An @icmlconf workshop on LLMs 🤖and Cognition 💭 (LLMCog) 📍 Vienna July 27 Submit your 4-page papers due May 22 Attend to hear from our invited speakers, the amazing @MelMitchell1, @rao2z, @chelseabfinn, and @ABosselut ! llm-cognition.github.io
Unlike any sane person who gets a PhD in NLP right now, afterwards I made a game. I just released it in early access talktomehuman.com Talk to NPCs who talk back at you, try to persuade your way out of sticky situations
We released 🍷FineWeb: 15T high quality tokens from the web. It's the best ready-to-use AND the largest pretraining dataset. Outperforms all other datasets in our 350B token ablations but scales to much longer training runs due to its sheer size! hf.co/datasets/Huggi…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
According to the license, you must name all models that use llama 3 in any way “LLaMa 3 XXX” llama.meta.com/llama3/license/ They don't say that you can't give your models nicknames though... "LLaMa 3 Robert Archibald Percival Fortescue Language Model" aka "BobLM"
So proud of my brilliant spouse @vibhuti_ramach !!!
Congrats to Vibhuti Ramachandran, @UCIrvine global & international studies, who's received the @AIISIndia Joseph W. Elder Prize in the Indian Social Sciences for her forthcoming book, “Immoral Traffic”: An Ethnography of Law, NGOs, & the Governance of Prostitution (@CUPAcademic)!
Not the one who told Noam this, but I don't filter based on paper counts. I find publication record to often be more of a distraction than a helpful signal ¯\_(ツ)_/¯
Someone on the admissions committee for a top CS PhD program told me they no longer filter based on paper count because too many of the applicants already have multiple publications. Instead, they now filter by citation count. Not sure if he was joking but I believed it.
Thrilled to share a review on THE LANGUAGE NETWORK AS A NATURAL KIND—a culmination of ~20 yrs of thinking about+studying language from linguistic, psycholinguistic, and cog neuro perspectives. @NatRevNeurosci rdcu.be/dEylV With the amazing @neuranna @tamaregev 🥳 🧵1/n
Delighted to announce that Scarlet Schwiderski-Grosche will be joining the EPFL AI Center as Executive Director, starting May 1st. Her demonstrated leadership in cultivating collaborations across various sectors, will be invaluable to the Center. Welcome aboard Scarlet!
A Design Space for Intelligent and Interactive Writing Assistants #CHI2024 👩🏻✏️🤖 What writing assistants do you use? What else are out there and how do they differ? What do we need to consider when designing new writing assistants? 🔗 arxiv.org/abs/2403.14117 (1/6)
For everyone asking for the solution, take 200 hours and generate a meaningful (500-1k entries) test set and never show it to anyone.
Honored to be on the list! 🙏 I’m actively recruiting students who are interested in AI and writing to understand how AI will change the way we communicate. ✍️ Please consider applying to @UChicagoCS and @DSI_UChicago! More info: minalee.info/prospective-st…
Here's a list of NLP superstars 🌟 who are beginning their journey 🚀 in the 2023/24 academic year. @jieyuzhao11 @GabrielSaadia @acbuller @Lianhuiq @ManlingLi_ @yuntiandeng @rajammanabrolu @YueDongCS @tanyaagoyal @MinaLee__ @yuntiandeng @alsuhr @wellecks @hllo_wrld @Xinya16
A tweak in the architecture of #Transformers can significantly boost accuracy! With direct access to all previous blocks’ outputs, a 48-block #DenseFormer outperforms a 72-block Transformer, with faster inference! A work with @akmohtashami_a,@francoisfleuret, Martin Jaggi. 1/🧵
I will be presenting our paper REFINER @EACL2024 today at 11:00 CET in Malta 🇲🇹 🧐Can small specialized LMs improve the CoT generated by LLMs? --> Yes! Paper Link: aclanthology.org/2024.eacl-long…
Since cat is out of the bag, it’s time I share: I’ll be starting a new adventure with an incredible team of friends and long-time collaborators to take on the big challenge of robot learning at scale! It's called Physical Intelligence (Pi… or π, like the symbol). 🧵👇
I’m really excited to be starting a new adventure with multiple amazing friends & colleagues. Our company is called Physical Intelligence (Pi or π, like the policy). A short thread 🧵
For a little bit more info, check out -- Our website: physicalintelligence.company Nice article by @ashleevance: bloomberg.com/news/articles/…
Our Geo-Regional Africa Group is looking forward to hosting @bkhmsi next week on Tuesday, March 18th. Badr will present "Investigating Cultural Alignment of Large Language Models" (arxiv.org/abs/2402.13231), be sure to join us! Learn more: cohere.com/events/c4ai-Ba…