Sebastian Riedel (@[email protected]) @riedelcastro
Researcher in NLP/ML @deepmind, @ucl_nlp, @[email protected] on Mastodon riedelcastro.org London, England Joined September 2009-
Tweets2K
-
Followers15K
-
Following470
-
Likes6K
Today is a big day for our friends at @ProfluentBio! 🧬 We're announcing a $35M Series A, led by @sparkcapital, with significant participation from @airstreet. Here's why I'm psyched for what's to come:
I often feel that the main reason I made it through school was my quite extraordinary forgetfulness. I really had to rederive everything all the time. @yihong_thu showed how this trait can also help language models in her NeurIPS paper arxiv.org/abs/2307.01163. Now on Quanta!
I often feel that the main reason I made it through school was my quite extraordinary forgetfulness. I really had to rederive everything all the time. @yihong_thu showed how this trait can also help language models in her NeurIPS paper arxiv.org/abs/2307.01163. Now on Quanta!
Periodically resetting the embeddings may sound like a terrible idea when training language models, but it can make them easier to extend to new languages! Quanta magazine is now covering our NeurIPS paper on this topic (led by the amazing @yihong_thu).
Periodically resetting the embeddings may sound like a terrible idea when training language models, but it can make them easier to extend to new languages! Quanta magazine is now covering our NeurIPS paper on this topic (led by the amazing @yihong_thu).
Come work with Sida and us!
Do LLMs “secretly” (in its layers) perform some kind of multi-hop inference when facing multi-hop prompts? The amazing @soheeyang_ led our gang at @GoogleDeepMind to explore this question.
Do LLMs “secretly” (in its layers) perform some kind of multi-hop inference when facing multi-hop prompts? The amazing @soheeyang_ led our gang at @GoogleDeepMind to explore this question.
🚨 New Paper 🚨 LLMs excel at storing facts & in-context reasoning like CoT. But do they latently💭 reason over their parametric knowledge without answering step-by-step? We found positive evidence 👀 But it varies for different relation types, and scaling doesn't help much! 1/N
I am really excited to reveal what @GoogleDeepMind's Open Endedness Team has been up to 🚀. We introduce Genie 🧞, a foundation world model trained exclusively from Internet videos that can generate an endless variety of action-controllable 2D worlds given image prompts.
Very excited to see these guys launch! Been fantastic to work with @YuxiangJWu during his PhD. Now can’t wait for him to make AI take over my job. That’s only fair!
Very excited to see these guys launch! Been fantastic to work with @YuxiangJWu during his PhD. Now can’t wait for him to make AI take over my job. That’s only fair!
If you're deciding which #NeurIPS23 poster to check out tomorrow, don't forget our forgetting paper! Visit poster #328 Thursday morning to dive into the world of active forgetting. Discover how it enhances language models with greater language plasticity. See you there!
Years of government underinvestment in public services has left us with crumbling schools and record NHS waiting lists. These crises will get worse and worse. We need wealth taxes now, or we face the end of the welfare state. taxjustice.uk/blog/the-uk-fa…
👋 I'm excited to unveil @airstreet’s second fund of $121,212,121 as we accelerate our mission to back ambitious AI-first companies in North America and Europe! 🧵 My reflections on the journey, opportunity and what this means for our founders and community:
1/9 Excited to share "Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis". We've performed a unified evaluation & analysis on probability-based prompt selection methods, increasing the effectiveness from 87.79% to 99.44%! soheeyang.github.io/publication/ya…
obsolete methods for obsolete task
obsolete methods for obsolete task
Introducing ChatArena 🏟 - a Python library of multi-agent language game environments that facilitates communication and collaboration between multiple large language models (LLMs)! 🌐🤖 Check out our GitHub repo: github.com/chatarena/chat… #ChatArena #NLP #AI #LLM 1/8 🧵
Does One Large Model Rule Them All? maithraraghu.com/blog/2023/does… New post with @matei_zaharia and @ericschmidt on the future of the AI ecosystem. Our key question: does the rise of large, general AI models means the future AI ecosystem is dominated by a single general model? ⬇️
I'm excited to share our recent paper on "Ɛ KÚ [MASK]: Integrating Yorùbá cultural greetings into machine translation" that recently got accepted at AfricaNLP @ #ICLR2023 (non-archival) & C3NLP workshop @#EACL2023. paper: arxiv.org/abs/2303.17972 Project led by @alabi_jesujoba
🎉🌐 Big news from @samaya_AI. We have two shiny new offices in #London & #MountainView 🏢, staffed with an incredible team of brilliant minds💡🚀. Check out our freshly launched website at samaya.ai 🌟
Excited to announce that the entire Blueshift team has joined @DeepMind! We will be working with @OriolVinyalsML and others to advance capabilities of LLMs developed by DM / Alphabet! We hope to continue to grow DM's presence in Bay Area and New York in the coming months :-)
Today we release LLaMA, 4 foundation models ranging from 7B to 65B parameters. LLaMA-13B outperforms OPT and GPT-3 175B on most benchmarks. LLaMA-65B is competitive with Chinchilla 70B and PaLM 540B. The weights for all models are open and available at research.facebook.com/publications/l… 1/n
Jakob Foerster (@j_foerst) and I are hiring a PhD student for our FAIR-Oxford program to work at the intersection of language and RL. The student will spend 50% of their time @UniofOxford and 50% @metaai (FAIR), while completing a DPhil (Oxford PhD). Deadline: 1st of March
(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzSam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Pasquale Minervini �.. @PMinervini
7K Followers 4K Following Researcher in ML/NLP at the University of Edinburgh (faculty @InfAtEd @EdinburghNLP), @ELLISforEurope, @UCL_NLP, PI for @Clarify2020, https://t.co/WydvfU8ugz he/theyGraham Neubig @gneubig
31K Followers 585 Following Associate professor at CMU, studying natural language processing and machine learning.Edward Grefenstette @egrefen
36K Followers 774 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.Danish Pruthi @danish037
7K Followers 628 Following Faculty at Indian Institute of Science, Bangalore. PhD from @LTIatCMU.Jay Alammar @JayAlammar
35K Followers 1K Following Machine learning and language models R&D. Builder. Writer. Visualizing AI, ML, and LLMs one concept at a time. @Cohere. https://t.co/TquuQXlLOJLuca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (makin Dolma 🍇), open source science fan, @QueerInAI organizer 🤖☕️🍕they/themJacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwZachary Lipton @zacharylipton
59K Followers 2K Following Professor: CMU/@acmi_lab, CTO / CSO: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷Tal Linzen @tallinzen
16K Followers 894 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAIThomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceLeo Boytsov @srchvrs
7K Followers 2K Following Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on unnatural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.En La Masmédula @EnLaMasmedula
56 Followers 235 FollowingPhillip Lindsay @EastLAPinche
59 Followers 385 FollowingWenzhao Qiu @WenzhaoQiu
2 Followers 138 FollowingDaddyXXXmas @AthomsG
12 Followers 173 FollowingNick Mumero @nickdee96
131 Followers 1K Following Cofounder at Continuum Ads. Focusing on NLP, Simulation Modelling and Optimization.gilangarisptr @Gilangarisptr
158 Followers 1K Following 🎯 My tweets are my own 🗒️ My Retweets are my notes 📊 IG : @jurnaldata.idJack FitzGerald @jgmfitz
4 Followers 187 Following Principal, Applied Scientist at Amazon AGI org; AI model and system builder; LLM researchXubin Ren @xubinrencs
583 Followers 1K Following Ph.D. student of @hkudatascience and @HKUniversity Data Intelligence Lab, fortunately advised by @huang_chao4969. Trying to be a good data science researcher.Simon Dobnik @SimonDobnik
119 Followers 287 Following Professor at University of Gothenburg, Sweden. NLP researcher and lecturer.Jeovane H. Alves @jeohalves
6 Followers 461 Following PhD in Computer Science Research Associate, SEDAN, SnT, University of Luxembourgحسن نجفی @Njfy36986H
10 Followers 209 FollowingKarl Stratos @karlstratos
84 Followers 21 Followingabderrahim zine @abderrahimzine6
25 Followers 593 FollowingAIQUEST @ProAiquest
108 Followers 454 Following Exploring the latest in AI tools and technologies. Join me on a journey into the future of innovation and automation. #AI #chatgpt #TechEnthusiastRafał Okuniewski @dartagnan_pl
35 Followers 679 Following I am interested in machine learning and AI, hope to be better at this every day! Sometimes I retweet politics - please forgive meaman chourasia @aman245_tweets
21 Followers 56 FollowingSgt. Amalia Gump @SgtAmalia45123
18 Followers 600 Following 🇺🇸 USA veteran ⭐ ⭐America First ⭐Secure the Border. I will never accept defeat. I will never quit. God bless U.S.A.🇺🇸Srijith P K @srijithpk
9 Followers 116 Following Machine Learning Researcher, Faculty at IIT HyderabadV @coderboys5
9 Followers 61 FollowingHAFSA SADAF @HAFSA10177938
40 Followers 996 Following Bridging AI and Code || Engineer by Day, AI Enthusiast AlwaysBecky @DD93565
64 Followers 255 Following I'm from Hong Kong, I travel a lot, I've been to every country in the world, I went to Iceland to see the Northern lights, I went to Denmark to see fairy tales.Abdulrahman Tabaza @embed_dim
3 Followers 716 Following enjoyer of various vector spaces, encoders and modalitiesShreya Kapoor @SKapoor_18
329 Followers 1K Following PhD @CogCoVi |Formerly Data Scientist @MPI-CBS| https://t.co/HWJLt7Jhwk. Life Science Informatics @UniBonnFelix Molitor @FelixMolitor
213 Followers 2K Followingyue wang @yuewang89985829
2 Followers 44 Followingraj das @rajdas1947790
15 Followers 295 FollowingXiwen Wei @XiwenWei_
15 Followers 60 FollowingXuhui Zhang @XuhuiZhangXHZ
3 Followers 218 FollowingFreund @sindfreund
46 Followers 183 FollowingJonas Bacci @jonasbacci
12 Followers 64 FollowingCaroline Craig @Carolin35456174
10 Followers 188 Followingララどり d/age IS.. @presklux49
147 Followers 547 Following シンギュラリタリアン。老化を治療し、永遠の若さを手に入れることを目指しています。老化研究を促進するツールとして、人工知能も重視しています。私の夢は、超知能が管理する色々な箱庭世界で、悠久の時を過ごすことです。TensorWave @TensorWaveCloud
572 Followers 613 Following Power up your AI with the leading GPU cloud, featuring AMD Instinct™ MI300X. First-to-market MI300X launch partner with GPUs available and ready to utilize now!Alo @Hal90910
0 Followers 2K FollowingPony @ponylavan
67 Followers 541 Following(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzSam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Christopher Manning @chrmanning
126K Followers 115 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Pasquale Minervini �.. @PMinervini
7K Followers 4K Following Researcher in ML/NLP at the University of Edinburgh (faculty @InfAtEd @EdinburghNLP), @ELLISforEurope, @UCL_NLP, PI for @Clarify2020, https://t.co/WydvfU8ugz he/theyGraham Neubig @gneubig
31K Followers 585 Following Associate professor at CMU, studying natural language processing and machine learning.Edward Grefenstette @egrefen
36K Followers 774 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.Yoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCJacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwZachary Lipton @zacharylipton
59K Followers 2K Following Professor: CMU/@acmi_lab, CTO / CSO: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷Tal Linzen @tallinzen
16K Followers 894 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAIThomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceTim Dettmers @Tim_Dettmers
29K Followers 818 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Sara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Felix Hill @FelixHill84
9K Followers 777 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sOlcan @olcan
4K Followers 867 Following Director of Product @GoogleDeepMind. Prev. Founder/CEO @ Scaled Inference, Engineer @Google (Search, Research, X, Brain). Creator of @EnjoyMindPage.Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Kelvin Guu @kelvin_guu
3K Followers 333 Following Senior staff research scientist @ Google DeepMind leading cross-functional teams of 40+ (research/eng/PM/UI/UX), turning our SOTA research into new AI products.Laurent Sifre @laurentsifre
1K Followers 411 Following Research Scientist @DeepMind since 2014. Worked on #AlphaGo #AlphaFold and #AlphaStar, now focused on #NLP at scale.Aida Nematzadeh @aidanematzadeh
2K Followers 261 Following Research scientist at @DeepMind. She/her.Sarah Catanzaro @sarahcat21
12K Followers 1K Following “All methods are sacred if they are internally necessary” (GP @amplifypartners, prev @canvasvc; Head of Data @Mattermark; @palantirtech; @c4ads)Marina Hyde @MarinaHyde
516K Followers 3K Following Guardian columnist. Co-host of The Rest Is Entertainment podcast.Alexander Holden Mill.. @alex_h_miller
759 Followers 441 Following Research Engineering Manager at @MetaAIGeoffrey Irving @geoffreyirving
8K Followers 258 Following Research Director at the UK AI Safety Institute (AISI). Previously DeepMind, OpenAI, Google Brain, etc. @[email protected]Keir Starmer @Keir_Starmer
1.4M Followers 415 Following MP for Holborn and St Pancras and Leader of the Labour Party. Former Director of Public Prosecutions.Sohee Yang @soheeyang_
1K Followers 427 Following PhD student/research scientist intern at @ucl_nlp/@GoogleDeepMind (50/50 split). Previously MS at @kaist_ai and research engineer at Naver Clova. #NLProc & MLYossi Adi @adiyossLC
669 Followers 322 Following Assistant Professor @ The Hebrew University of Jerusalem, CSE; Research Scientist @ Meta AI (FAIR); Drummer @ Lucille Crew 🤖🥁🎤🎧🌊Jane Dwivedi-Yu @JaneDwivedi
442 Followers 67 Following Researcher @MetaAI | Former PhD @UCBerkeley and @Cornell alumna.Joelle Pineau @jpineau1
10K Followers 352 Following AI researcher. VP AI Research (FAIR), @AIatMeta. Professor of Computer Science, @mcgillu. Core academic member, @Mila_QuebecMaria Lomeli @MariaLomeli_
318 Followers 256 Following Researcher and engineer @AIatMeta, FAIR labs | PhD from @GatsbyUCL and former postdoc @CambridgeMLGDario Amodei @Dario_Amodei
2K Followers 15 FollowingDaniela Amodei @DanielaAmodei
6K Followers 300 Following President @AnthropicAI. Formerly @OpenAI, @Stripe, congressional staffer, global developmentMaithra Raghu @maithra_raghu
17K Followers 475 Following Cofounder and CEO @Samaya_AI. Formerly Research Scientist Google Brain (@GoogleAI), PhD in ML @Cornell.Stephen Mayhew @mayhewsw
2K Followers 858 Following Following Ratinov and Roth (2009), we choose to use a Twitter BILOU instead of a Twitter BIO. @duolingoAdji Bousso Dieng @adjiboussodieng
17K Followers 335 Following AI Scientist: @Vertaix_ @GoogleDeepMind 🍒 Academic: @Princeton 🐯 Advocate: @theafricaiknow_🌱Maria Perez-Ortiz �.. @MPerezOrtiz_
501 Followers 760 Following Assist. Prof. @UCLCS & @AI_UCL | Director MSc on #AI for #Sustainability | Into #Ecology, #Education & #PolicyAnna Rogers 🇺🇦�.. @annargrs
9K Followers 863 Following Associate professor @ITUkbh: LLM interpretability, generalization, AI & society. Co-editor-in-chief @ACLRollingReviewMarco Baroni @kumaraja2000
15 Followers 3 FollowingJonas Pfeiffer @PfeiffJo
3K Followers 686 Following Research Scientist @GoogleDeepMind | @AdapterHub | previously @nyuniversity @TUDarmstadt @UKPLab @MetaAI @spotify | https://t.co/oPoAvcAx97 | (he/him)Robert L. Logan IV @rloganiv
232 Followers 506 FollowingLouis Kirsch @LouisKirschAI
2K Followers 776 Following PhD at IDSIA with @SchmidhuberAI. Working on self-improving AI that generalizes (MetaGenRL, VSML, GPICL). @DeepMind @GoogleAI intern, @UCL, @HPI_DE alumnus.Mark Riedl @mark_riedl
32K Followers 1K Following AI for storytelling, games, explainability, safety, ethics. Professor @GeorgiaTech. Associate Director @MLatGT. Time travel expert. Geek. Dad. he/himArthur Mensch @arthurmensch
40K Followers 871 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcxAna Marasović @anmarasovic
4K Followers 604 Following Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷Ming-Wei Chang @mchang21
1K Followers 509 Following Research Scientist @GoogleDeepMind. BERT co-author. Gemini project.David Ifeoluwa Adelan.. @davlanade
2K Followers 1K Following @DeepMind Academic Fellow @uclcs, incoming assistant Professor @mcgillu, Canada CIFAR AI Chair @CIFAR_News | interested in multilingual NLP | Disciple of JesusOreva Ahia @orevaahia
1K Followers 969 Following PhD student @uwcse | ex: AI/ML Research Intern @apple | Co-organizer @AISaturdayLagos | Researcher @MasakhaneNLPthomas @tom_gxt
91 Followers 124 FollowingFlorian Mai 🇺🇳 @_florianmai
1K Followers 1K Following Postdoc at the Machine Learning / Language Intelligence and Information Retrieval group @CW_KULeuven. PhD from @EPFL_en.Robert Stojnic @rbstojnic
3K Followers 488 Following Open source AI. ⌛Past: Llama 2 and Llama 3 technical leadership at Meta AI, Papers with Code co-creator.Jimmy Lin @lintool
13K Followers 842 Following I profess CS-ly at the @UWaterloo and gaze into the technological crystal ball at @Primal. I used to write code for @Twitter and slides for @Cloudera.So glad to share that I am one of the recipients of an @OpenAI Superaligment Fast Grant on the topic of #CoTfaithfulness 🥳🥳
The superalignment fast grants are now decided! We got a *ton* of really strong applications, so unfortunately we had to say no to many we're very excited about. There is still so much good research waiting to be funded. Congrats to all recipients!
Watching AIDE autonomously tackle ML problems by designing, implementing, and iterating on code, conducting experiments, and evaluating results has been nothing short of fascinating. Can't wait to see how it will revolutionize DS and ML workflows, enabling a broader range of…
We're excited to announce AIDE has become the first human-level AI agent for data science! AIDE outperforms half of human data scientists on a wide range of Kaggle competitions, surpassing conventional AutoML, LangChain agents, and ChatGPT with human assistance. 🏆
This seems like a good time to mention that I've taken a part-time role at @GoogleDeepMind working on AI Safety and Alignment!
So excited and so very humbled to be stepping in to head AI Safety and Alignment at @GoogleDeepMind. Lots of work ahead, both for present-day issues and for extreme risks in anticipation of capabilities advancing.
Update: I left Meta yesterday. After 7.5 years. I am sad, nervous, and excited. Sad because I'll miss Meta! I've felt tremendously valued my entire time at Meta (first in FAIR and recently in GenAI). I'll miss the people and being in the thick of things. Nervous because who in…
Today is a big day for our friends at @ProfluentBio! 🧬 We're announcing a $35M Series A, led by @sparkcapital, with significant participation from @airstreet. Here's why I'm psyched for what's to come:
Convince me I'm wrong: Generative AI is the new name for structured prediction. An interviewer asked for a def of GenAI & offhand: "an AI system that generates a complex output at once (vs a single prediction)" I later realized that's ≈identical to the def of SP I'd give ~2005
Anna Rogers joins the ARR as a new Editor-in-Chief! aclrollingreview.org/new-eic/
Glad to share that our AfriCOMET paper has been accepted at #NAACL2024 . See you in Mexico. Try out our model on @huggingface huggingface.co/models?sort=tr… with @jiayiwang0720, @swetaagrawal20 @RicardoRei7 @ebriakou @MarineCarpuat @zodiacJRH @MasakhaneNLP
Happy to share our new paper on developing COMET evaluation metric for African languages. Joint work with Jiayi Wang, Marek Masiak, @zodiacJRH and Pontus Stenetorp from UCL, @MarineCarpuat @swetaagrawal20 @ebriakou @RicardoRei7 and @MasakhaneNLP Paper: github.com/masakhane-io/a…
🥳🍾 It's official - I'm a tenured associate professor! This job is incredible luck and privilege, and @ITUkbh is an amazing place to work at!!! More PhD student and postdoc positions will be announced soon.
Thank you @CIFAR_News , it's official. I am happy to announce that I have been awarded a Canada CIFAR AI Chair 🥳🎉 I am very grateful to CIFAR, @Mila_Quebec and @mcgillu
David Ifeoluwa Adelani (@davlanade @mcgillu @Mila_Quebec) develops machine learning models for under-resourced languages, such as African, Latin-American and Indigenous languages, creating better automatic translations and text-to-speech services. cifar.ca/cifarnews/2024…
Something intensely funny about frontier AI labs spending > $100M to train models, but base public benchmarks on evals put together by poor grad students. (These benchmarks have been pivotal, but prob worth paying the marginal cost to clean them up.)
Frontier models capping out at ~90% on MMLU isn't a sign of AI hitting a wall. It's a sign that a lot of MMLU questions are busted. The field desperately needs better evals.
@riedelcastro @yihong_thu This resonates so much. Not being able to remember mate me rederive all the time too
@sirbayes I think we need to increase the data and its quality, and as in LMs do fine tuning (aka grounding). The model already has compression bottlenecks, which we could improve sure, but the concepts are already there.
I stand by it! Laziest review I've ever seen. I wish people used the public comment feature more to call out this kind of malpractice. (For context, the paper was 6 pages long out of a max page limit of 9)
Just found this OpenReview comment by @NeelNanda5
To learn more flexibly, a new machine learning model selectively forgets what it already knows. @settostun reports: quantamagazine.org/how-selective-…
Periodically resetting the embeddings may sound like a terrible idea when training language models, but it can make them easier to extend to new languages! Quanta magazine is now covering our NeurIPS paper on this topic (led by the amazing @yihong_thu).
To learn more flexibly, a new machine learning model selectively forgets what it already knows. @settostun reports: quantamagazine.org/how-selective-…
Work done with my amazing collaborators @elenagri_, @KassnerNora, @megamor2, @riedelcastro at @GoogleDeepMind ✨ Check out our paper for full details 👉 arxiv.org/abs/2402.16837 🧵🔚
Had a lot of fun working on this with @soheeyang_ and team! I think our results, together with recent works, show that there's still a lot to understand and improve on how LLMs capture and utilize knowledge dependencies.
🚨 New Paper 🚨 LLMs excel at storing facts & in-context reasoning like CoT. But do they latently💭 reason over their parametric knowledge without answering step-by-step? We found positive evidence 👀 But it varies for different relation types, and scaling doesn't help much! 1/N
Come work with Sida and us!
I'm hiring a PhD intern for the FAIR CodeGen (Code Llama) team. Do research on Code LLMs, execution feedback, evaluation, etc. Apply here: metacareers.com/jobs/170210647…