Allen Institute for AI @allen_ai
AI for the Common Good. › Join us: https://t.co/DqTs1G4bGO › Get our newsletter: https://t.co/tvb1VpySfL allenai.org Seattle, WA Joined September 2015-
Tweets2K
-
Followers53K
-
Following361
-
Likes1K
Our teammates consistently cite "the people" as their favorite part of AI2. We're currently looking to hire a superstar to help build out our team and culture! If you want to work with the best people solving the biggest problems, come join the fun! 🪩 boards.greenhouse.io/thealleninstit…
Starting today, Dolma is now using the ODC-BY license. We've made this change to better support our community — you can learn more about our decision here: blog.allenai.org/making-a-switc…
“I’m very happy to see any effort in openness. We need more of this.” Our CEO Ali Farhadi spoke with @willknight about openness in AI, like @databricks' #DBRX.
“I’m very happy to see any effort in openness. We need more of this.” Our CEO Ali Farhadi spoke with @willknight about openness in AI, like @databricks' #DBRX.
PS: if you are also attending GenLaw and are looking for opportunities to research at the intersection of AI, Law, and Policy, let's chat 😊
PS: if you are also attending GenLaw and are looking for opportunities to research at the intersection of AI, Law, and Policy, let's chat 😊
🎉Congratulations to Elaine Zhong, recipient of our 2024 Outstanding Engineer Scholarship! We’re impressed and inspired by Elaine’s determination and desire to make a real impact in AI. Get to know Elaine on the blog: blog.allenai.org/elaine-zhong-a…
AI is transforming climate forecasts — and our Climate Modeling team is leading the way. They recently developed a machine-learning emulator called ACE, which accurately forecasts atmospheric variables 6 hours ahead of a comparable physics-based model and runs 100 times faster…
We’re at an important inflection point for AI that requires a community of researchers, engineers and technologists to better understand it and drive meaningful innovation. We’re glad to be working with organizations like @allen_ai!
“We’re at an important inflection point for AI that requires a community of researchers, engineers and technologists to better understand it and drive meaningful innovation.” - @mechanicaldirk, AI2 principal research engineer, on #DBRX 👇This is meaningful innovation.
“We’re at an important inflection point for AI that requires a community of researchers, engineers and technologists to better understand it and drive meaningful innovation.” - @mechanicaldirk, AI2 principal research engineer, on #DBRX 👇This is meaningful innovation.
Meet #DBRX: a general-purpose LLM that sets a new standard for efficient open source models. Use the DBRX model in your RAG apps or use the DBRX design to build your own custom LLMs and improve the quality of your GenAI applications. dbricks.co/43xaCMj
@allen_ai @databricks Thanks, @allen_ai! We think people will find OLMo 7B to be great for finetuning for their custom applications. Finetuning a 7B #LLM for a specific use case can provide comparable quality to much larger general-purpose models, with improved latency and lower serving costs.
(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistSasha Rush @srush_nlp
51K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzKyunghyun Cho @kchonyc
60K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Rosanne Liu @savvyRL
32K Followers 965 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRSam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Jia-Bin Huang @jbhuang0604
51K Followers 285 Following Associate Professor @umdcs; Part-time Research Scientist @Meta. I like pixels.Thomas Wolf @Thom_Wolf
67K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceLuca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai, open source science fan, @QueerInAI organizer 🤖☕️🍕they/themNatasha Jaques @natashajaques
25K Followers 1K Following Senior Research Scientist at @GoogleAI and Assistant Professor @uwcse. Social Reinforcement Learning in multi-agent and human-AI interactions. PhD from @MIT.Tal Linzen @tallinzen
16K Followers 894 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAIPasquale Minervini �.. @PMinervini
7K Followers 4K Following Researcher in ML/NLP at the University of Edinburgh (faculty @InfAtEd @EdinburghNLP), @ELLISforEurope, @UCL_NLP, PI for @Clarify2020, https://t.co/WydvfU8ugz he/theyFelix Hill @FelixHill84
9K Followers 776 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sAna Marasović @anmarasovic
4K Followers 602 Following Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷Bill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscOfir Press @OfirPress
9K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Leo Boytsov @srchvrs
7K Followers 2K Following Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on unnatural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.Shaily @shaily99
5K Followers 2K Following PhD @LTIatCMU Prev: @GoogleAI @MSFTResearch. Working on #NLProc evaluation, fairness & culture. Usually ranting, often about research & DEI. 📚 @readsndrantsVince 🐦 @odtvince
198 Followers 448 FollowingLee Stanley @LStanley1013
245 Followers 845 Following Among the Seaweed and the slime. #Data Engineer, #Auburn alum. #UAB grad. #nature #golf #travel #WarEagle 🦅Priyanshu Kumar @kpriyanshu256
20 Followers 368 FollowingKatherine O'Toole @KatherineO13637
11 Followers 171 Following Ph. D. Candidate at Northwestern University studying the dynamics of co-creative behaviors within socio-technical systems and human-AI interactions.Educarte IA @EducarteIa
271 Followers 3K Following Desarrollador de soluciones con inteligencia artificial / Consultor Bussines IA / Researcher IA / especialista en SEO / Formulador de proyectosLivingstoneWu @livingstone_wu
1 Followers 18 FollowingAbdulrahman Tabaza @embed_dim
2 Followers 447 Following Enjoyer of various vector spaces and modalitiesLawrence @KisemboLawrenc1
990 Followers 5K FollowingMiguel Tejada @mite45
238 Followers 1K Following Estadístico y entusiasta de todo lo relacionado a datos.Jims Young @JimsYoung_
2K Followers 2K Following Views are my own, investment @YoubiCapital | Traveler, 22 countries|#saxxedDeacon Michael Hogan @Deaconmike53
5K Followers 6K Following Student of astronomy astrophysics artificial intelligence physics machine learning meteorologyAllan Huang @AllanHuang1
6 Followers 77 FollowingDMV AI GUILD (Powered.. @dmvAIguild
0 Followers 14 Following 🤖 A Unique AI Event Community for DC/MD/VA Professionals to Connect, Network & Explore Everything AI. Powered by IdeaFire™. #dmvAIguild #IdeaFireAISyed Waqas Zamir @waqas_zamir
116 Followers 211 Following Research Scientist at IIAI. Computer vision, Generative AI, image restoration and enhancement.Busingye Festo Kukund.. @BusingyeFesto
362 Followers 4K Following248chinmay Rkampli @crk1245
2 Followers 68 Following陈先生 @chen_xian7737
1 Followers 36 FollowingMK @michaelkazinda
143 Followers 718 Following God & Family | Energy & Earth Resources | Founder & CE at @Optimum_EarthMohammed Amine BEN CH.. @AmineLehocine
30 Followers 1K FollowingMugerwa @ArthurMugerwa
154 Followers 348 Following Christian. Husband. Father. Son. Entrepreneur. Community Leader. A Computer Engineer Learning @MakerereLaw @clet_ug John 1:16TrekWithTim @TrekkingTimmy
2K Followers 3K Following Tours, treks and expeditions! Real discovery is not in seeking new landscapes... it’s in seeing with new eyes.Iliya Tsekov @lytskv
74 Followers 502 FollowingMesubsetofRunionC @mesubsetof
23 Followers 327 FollowingMing Tan @MingTan83344874
7 Followers 79 FollowingEconomistas AI @EconomistasAi
2 Followers 22 FollowingHaoxuan (Steve) Chen @SC96004870
248 Followers 762 Following Ph.D. student at @ICMEStanford; B.S. in Math (PMA) & Data Science (CMS) @Caltech'22; Applied and Computational Math/Statistics/OptimizationTimothy Malche @tim3in
7 Followers 58 FollowingLynnee Ai @lynnee_ai
14 Followers 109 FollowingGihan Lakmal @Gihandsilva
10 Followers 203 Followingveeroll @veerollofficial
2 Followers 58 FollowingKANTPAT @KANPAT31070
4 Followers 42 Following钟辉 @zhnghu008049148
10 Followers 283 FollowingTrending AI @market_ai101
2 Followers 36 Following Your go-to source for expert insights, reviews, and exclusive deals on the latest AI tools. Join us as we navigate the world of AI innovation.Jeff Tatarchuk @jtatarchuk
1K Followers 2K Following Co-founder @tensorwavecloud - Pioneering the next wave of AI compute. Need GPUs? DM me.Necto @nectohq
5 Followers 10 Following Universal Artificial Intelligence/Machine Learning Models provider.fito diazgranados @fitodiazgranad1
0 Followers 218 FollowingNathaniel Burgdorfer @__burgdorfer__
25 Followers 126 Following Computer Science Ph.D. Student | Stevens Institute of Technology(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Christopher Manning @chrmanning
126K Followers 114 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Kyunghyun Cho @kchonyc
60K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Yoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCLuca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai, open source science fan, @QueerInAI organizer 🤖☕️🍕they/themAna Marasović @anmarasovic
4K Followers 602 Following Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷Edward Grefenstette @egrefen
36K Followers 773 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.Bill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscLeo Boytsov @srchvrs
7K Followers 2K Following Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on unnatural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.Jonathan Frankle @jefrankle
16K Followers 685 Following Chief Scientist, Neural Networks @Databricks via MosaicML. PhD @MIT_CSAIL. BS/MS @PrincetonCS. DC area native. Making AI efficient for everyone at @DbrxMosaicAIStanford NLP Group @stanfordnlp
144K Followers 178 Following Computational Linguists—Natural Language—Machine Learning @chrmanning @jurafsky @percyliang @ChrisGPotts @tatsu_hashimoto @MonicaSLam @Diyi_Yang @StanfordAILabEMNLP 2024 @emnlpmeeting
12K Followers 41 Following EMNLP 2024 - The 2024 Conference on Empirical Methods in Natural Language Processing, November 12 –16, 2024 Hashtag: #EMNLP2024Prithviraj (Raj) Amma.. @rajammanabrolu
5K Followers 517 Following Interactive & grounded AI, RL, NLP. Assistant Prof @UCSanDiego. Research Scientist @DbrxMosaicAI. Prev: @allen_ai, @GeorgiaTechSebastian Ruder @seb_ruder
80K Followers 1K Following Multilingual LLMs @cohere • Prev: @GoogleDeepMind • Newsletter: https://t.co/7JGh2qpG98Nathan Lambert @natolambert
25K Followers 685 Following Figuring out AI @allen_ai, "rl boi" DM me papers. Writes @interconnectsai, talks @retortai Has phd and some credentialsBeen Kim @_beenkim
23K Followers 453 Following Research Scientist at Google DeepMind, PhD from MIT. Make machines empower people. @[email protected]Suchin Gururangan @ssgrn
4K Followers 248 Following he/him Research scientist on Llama team, @meta GenAI prev: PhD @uwcse + @uwnlpBo Wang @BoWang87
8K Followers 2K Following Assistant Prof. CS,LMP @UofT; CIFAR AI Chair @VectorInst; Chief AI Scientist, @UHN; former PHD, CS @Stanford; opinions my own. #AI #healthcare #combioDaniel Liden @danjliden
185 Followers 630 Following Developer Advocate @Databricks | Former @bitdotioinc @GuinnCenterDatabricks @databricks
69K Followers 1K Following Databricks is the data and AI company, helping data teams solve the world’s toughest problems.clem 🤗 @ClementDelangue
89K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform to build machine learningHugging Face @huggingface
340K Followers 188 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhateInterconnects @interconnectsai
2K Followers 1 Following What you need to know about AI research trends, from @natolambert Wednesday mornings weekly, sometimes extra posts.Steven Overly @StevenOverly
8K Followers 3K Following Host of @POLITICO Tech, a daily podcast. Tell me how tech is disrupting politics and policy. Past: @washingtonpost. @Bagehots alum. @nlgjadc pres. He/Him.U.S. National Science.. @NSF
1.3M Followers 153 Following Explore #NSFfunded research that is transforming the world. Social media policy: https://t.co/IRuZ2l1oLWJulien Chaumond @julien_c
46K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueYann LeCun @ylecun
708K Followers 716 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Kempner Institute at .. @KempnerInst
1K Followers 90 Following The Kempner Institute for the Study of Natural and Artificial Intelligence at @Harvard University. RTs ≠ EndorsementsZichen "Charles" Zhan.. @ZCCZHANG
167 Followers 268 Following PYI @allen_ai | Specialist can generalize and Generalist can specializeYuntian Deng @yuntiandeng
3K Followers 3K Following #NLProc Postdoc @ai2_mosaic | Assistant Professor @UWaterloo '24 | Faculty Affiliate @VectorInst '24 | PhD @HarvardYue Yang @YueYangAI
301 Followers 237 Following PhD student @upennnlp, interested in vision and language.Yejin Kim @_YejinKim
67 Followers 359 Following A roboticist, a painter and a dog mom. insta: https://t.co/wf6qpOaLbHSean (Xiang) Ren @xiangrenNLP
6K Followers 576 Following Building @SaharaLabsAI | @USCViterbi Early Career Chair, Professor @nlp_usc | @MIT TR 35 , @ForbesUnder30 | Prev: @allen_ai, @Snapchat, @Stanford, @UofIllinoisWinson Han @WinsonHan
14 Followers 34 Following Game Designer @allen_ai, interested in Computer Vision, Art, Music, Anime, MMOs, Action GamesWei-Chiu Ma @weichiuma
1K Followers 138 Following Incoming Assistant Professor @Cornell @CornellCIS. Postdoc @Allen_AI @UWCSE. PhD @MIT_CSAIL. Prev @UberATG @Waabi_AI. #ComputerVision #Robotics Opinions my own.Valentina Pyatkin @valentina__py
2K Followers 1K Following Postdoc at the Allen Institute for AI @allen_ai and @uwnlpValentin Hofmann @vjhofmann
964 Followers 228 Following Young Investigator (Postdoc) @allen_ai @ai2_allennlp | Formerly @UniofOxford @CisLMU @stanfordnlp @GoogleDeepMindtusharkhot @tusharkhot
276 Followers 188 Following Senior Research Scientist, Allen Institute for AITal August @tal_august
558 Followers 189 Following Incoming assistant professor @IllinoisCS Fall 2024, current postdoc @allen_ai, former PhD student @uwcse. HCI + NLP. Designing language for different people.Sean Welleck @wellecks
3K Followers 222 Following Assistant Professor at CMU. Marathoner, @thesisreview.Ranjay Krishna @RanjayKrishna
5K Followers 414 Following I teach machines to see and interact with people. + Assistant Professor @UWcse - Prev. Research scientist @MetaAI - PhD @StanfordAILab - Instructor @StanfordPiper Wolters @piper_wolters
5 Followers 14 FollowingPeter Hase @peterbhase
2K Followers 684 Following Google PhD Fellow at @uncnlp. Interested in interpretable ML, natural language processing, AI Safety, and Effective Altruism.Pao Siangliulue @Siangliulue
285 Followers 630 Following 🎒 {Creativity, AI, People} | HCI researcher & software eng | @allen_ai | previously @B12, @Harvard, @Stanford | Also on Bluesky 🦋Nathaniel Weir @Nathaniel_Weir
507 Followers 847 Following PhD candidate @jhuclsp working on reasoning. Formerly @ai2_aristo, MS Semantic Machines, @MSFTResearch, @BrownCSDept. On the job market (industry/postdoc)Minyoung Hwang @robominyoung
243 Followers 209 Following Research Intern @carnegiemellon, Previously @allen_ai, MS Grad @SNU | Robotics | Preference-based Reinforcement Learning | Human-Robot InteractionLianhui Qin @Lianhuiq
4K Followers 393 Following Incoming Assistant Professor at UCSD CSE. Currently postdoc at AI2 Mosaic. NLP, ML, AI. I’m recruiting PhD students.Kolby Nottingham @kolbytn
210 Followers 225 Following CS PhD at @UCIrvine researching RL+NLP and interactive LLMs. Upcoming intern @riotgames. Previously @allen_ai, @AiDungeon, @unity, and @nvidia .Khyathi Chandu @khyathi_chandu
1K Followers 444 Following Research Scientist @AI2 | Previously at : @MetaAI @LTICMU @SCSatCMU @GoogleAI @Apple | RisingStars2020Kaitlyn Zhou @KaitlynZhou
459 Followers 315 Following Currently @allen_ai @ai2_mosaic PhD student @StanfordNLPJordan Steward @jordansteward
1K Followers 1K Following WI native, MN fan, @ChelseaFC supporter and Seattlite. Comms @earthrangertech and @skylightmarineJohann Dahm @jdahm
68 Followers 137 Following Software engineering and programming language aficionado. Former computational fluid dynamics nerd. Outdoors enthusiast.Joel Jang @jang_yoel
932 Followers 478 Following PhD student @uwcse. Research Intern at @nvidiaai robotics. Prev: @allen_aiJeremy McGibbon @jeremy_mcgibbon
633 Followers 501 Following Researcher and PhD, using machine learning and software engineering to improve climate models. mastodon: [email protected]Jacob Morrison @jacobcares
223 Followers 263 Following PYI & Policy @ @allen_ai @ai2_allennlp Political opinions are my own, and do not represent the views of AI2Ian Magnusson @IanMagnusson
250 Followers 294 Following Predoctoral Young Investigator on AllenNLP at @allen_ai. Working on domain adaptation, reproducibility, and evaluation in NLP.Hamish Ivison @hamishivi
489 Followers 595 Following Antipodean Abroad. he/him. I (try to) do NLP research. PhD student @uwcse, prev @Sydney_Uni @allen_ai 🇦🇺🇨🇦🇬🇧Emma Strubell @strubell
4K Followers 936 Following assistant professor @LTIatCMU & visiting scientist @allen_ai. natural language processing and efficient ML. she/her/dad (2 dogs). 🏳️🌈. hiking and food. BLM.Doug Downey @_DougDowney
277 Followers 174 Following Research Manager at @allen_ai, Prof at @northwesterncsDa Yin @Wade_Yin9712
770 Followers 420 Following PhD @uclanlp | Intern at AI2 Mosaic @ai2_mosaic | Amazon PhD Fellow in 2023 @AmazonScienceBrian Henn @BrianMHenn1
78 Followers 33 Following hydrologist, data scientist, engineer, new(ish) parent@yuzhaouoe I **loved** your paper Yu ❤️ When we started OLMo, we discussed similar approaches, but ran out of time to do research on it & write efficient code. I'm so glad y'all published this study–Llama 3 blogpost doesn't do the problem justice! Definitely on the table now 😉
🌻 Super excited about my first Computer Science publication at @naaclmeeting (main)! @mbodhisattwa and I study the language of deception and how language models fare at detecting them. And guess what we've found: arxiv.org/pdf/2311.07092… (1/n) 🧵 @EconUofU @allen_ai
Our team is incredibly proud to partner with @allen_ai and thrilled to see them cook! Achieving such a massive improvement in MMLU, while reducing the compute budget, is a fantastic win. And doing it fully open? Everyone wins. Congrats! Can't wait to see what's next 👀
Announcing our latest addition to the OLMo family, OLMo 1.7!🎉Our team's efforts to improve data quality, training procedures and model architecture have led to a leap in performance. See how OLMo 1.7 stacks up against its peers and peek into the technical details on the blog:…
In case folks are curious of what two-stage training gets you: x.com/soldni/status/…
@xlr8harder just dug it out of our log, hope it helps!
@mer__edith Literally better at the benchmark 😊 At @allen_ai, we work to demystify what actions LLM devs take to improve numbers. Our goal is to transparently explain what's going on w benchmarks! I guess we do things a bit differently than most shops out there?... 😅 perks of nonprofit
@mer__edith @allen_ai oh 100% agree with you, none of this MMLU hyper-optimization makes any sense. I personally think it's important to show the exact steps one has to follow to game the score, and we are planning follow-up work showing what compromises doing that entails!
Today we released a new version of OLMo 7B, which has significantly improved performance on MMLU. We also discuss a lot of how we got the improvements, big shoutout to the team! Check out that performance-efficiency tradeoff 🤩 this new model is on the Pareto frontier!
Announcing our latest addition to the OLMo family, OLMo 1.7!🎉Our team's efforts to improve data quality, training procedures and model architecture have led to a leap in performance. See how OLMo 1.7 stacks up against its peers and peek into the technical details on the blog:…
Amazing work as always by @soldni and and the folks at AI2. Major improvements on MMLU by tweaking the recipe on data cleaning and mixing. Read the blog and learn from the best. Open science is so important!
What does it take to get a good MMLU score? Turns out: decent data, instructions in pretraining, fuzzy dedup, and quality filtering. just dropped OLMo 1.7-7b… nice perf lift over 1.0! Blog: blog.allenai.org/olmo-1-7-7b-a-… Model: huggingface.co/allenai/OLMo-1… Data: huggingface.co/allenai/dolma
Shout-out to the team – this came together really quickly! Y’all thoroughly rocked it @soldni, @kylelostat, @natolambert, @gu_yuling, @HannaHajishirzi, and a bunch of people who don't have twitter because they know getting stuff done is more important!
We released OLMo 1.7 7B + Dolma 1.7 today 🔥 With the juiciness of Dolma 1.7 + staged training we have improved OLMo’s MMLU score by 24 pts, clearly better than Llama2 7B! Blog post: blog.allenai.org/olmo-1-7-7b-a-… Model: huggingface.co/allenai/OLMo-1… Dataset: huggingface.co/datasets/allen…
notable stuff: 🦉ton of perf boost from mixing instruct data at end (e.g., flan) 🐋anneal learning rate (Fig 9b in arxiv.org/abs/2403.08763) 🐞changing data mix boosts MMLU at some cost to other evals 🍇huggingface.co/allenai/dolma 🧀huggingface.co/allenai/OLMo-1…
Announcing our latest addition to the OLMo family, OLMo 1.7!🎉Our team's efforts to improve data quality, training procedures and model architecture have led to a leap in performance. See how OLMo 1.7 stacks up against its peers and peek into the technical details on the blog:…
Allen AI team is moving fast. They're on the ~Pareto frontier~ now, and it's built in the open, so we all get to see how it's done! gratz @mechanicaldirk @soldni @natolambert
Announcing our latest addition to the OLMo family, OLMo 1.7!🎉Our team's efforts to improve data quality, training procedures and model architecture have led to a leap in performance. See how OLMo 1.7 stacks up against its peers and peek into the technical details on the blog:…
Nice write-up on the path to higher data quality from @allen_ai. They released their dataset AND their FastText quality filter, which I really appreciate. Also, annealing per OpenBMB is well and truly SOP --I wonder if schedule-free optimizers will change this.
What does it take to get a good MMLU score? Turns out: decent data, instructions in pretraining, fuzzy dedup, and quality filtering. just dropped OLMo 1.7-7b… nice perf lift over 1.0! Blog: blog.allenai.org/olmo-1-7-7b-a-… Model: huggingface.co/allenai/OLMo-1… Data: huggingface.co/allenai/dolma
Introducing our best OLMo yet. OLMo 1.7-7B outperforms LLaMa2-7B, approaching LLaMa2-13B at MMLU and GSM8k. High-quality data and staged training are key. I am so proud of our team making such significant improvement in a short period after our first release.
Announcing our latest addition to the OLMo family, OLMo 1.7!🎉Our team's efforts to improve data quality, training procedures and model architecture have led to a leap in performance. See how OLMo 1.7 stacks up against its peers and peek into the technical details on the blog:…
@allen_ai @allen_ai That's awesome news. Congrats on OLMo 1.7. Can't wait to see the improvements. #UpgradeComplete
more details in this announcement! fixed data link: huggingface.co/datasets/allen…
Announcing our latest addition to the OLMo family, OLMo 1.7!🎉Our team's efforts to improve data quality, training procedures and model architecture have led to a leap in performance. See how OLMo 1.7 stacks up against its peers and peek into the technical details on the blog:…