Pavel Izmailov @Pavel_Izmailov
Working on LLM reasoning @OpenAI 🤖 Incoming Assistant Professor @nyuniversity 🏙️ #StopWar 🇺🇦 izmailovpavel.github.io San Francisco Joined March 2010-
Tweets590
-
Followers6K
-
Following1K
-
Likes2K
If you're coming to @eccvconf consider our freshly accepted tutorial on: A Bayesian Odyssey in Uncertainty: from Theoretical Foundations 📝 to Real-World Applications 🚀 w/ the amazing @GianniFranchi10 @_olivierlaurent @a1mmer @Pavel_Izmailov More info coming soon #eccv2024
Another fascinating discussion this morning on the future of generative AI in physics, including the insight that "we'll be like cats...is that so bad?" (But in all seriousness, a lot of compelling back and forth on what it means to do science).
We are excited to be organizing a Symposium on the Impact of Generative AI in the Physical Sciences next Thursday, March 14 and Friday, March 15! Join us on the 8th Floor of @MIT_SCC for a great lineup of speakers and panelists. Zoom link available soon. iaifi.org/generative-ai-…
I'm excited to be speaking tomorrow at Boston University, as part of their distinguished speaker series. My talk will be on prescriptive foundations for building autonomous intelligent systems. Talk details: bu.edu/hic/air-distin…
I was very impressed by @martinmarek1999 in this project, look out for more exciting research from him!
I was very impressed by @martinmarek1999 in this project, look out for more exciting research from him!
Have you ever done a dense grid search over neural network hyperparameters? Like a *really dense* grid search? It looks like this (!!). Blueish colors correspond to hyperparameters for which training converges, redish colors to hyperparameters for which training diverges.
I'm glad to see losslandscape.com is still going strong. @ideami has beautiful visualizations. The geometric properties of neural network training objectives, such as mode connectivity, make deep learning truly distinct.
We extend the deadline for one week, the new deadline is Feb 10, 2024, AOE! Looking forward to your submissions!
We extend the deadline for one week, the new deadline is Feb 10, 2024, AOE! Looking forward to your submissions!
Very cool experiment from the preparedness team ☣️
Very cool experiment from the preparedness team ☣️
SWA really is elite.
SWA really is elite. https://t.co/7t84WcPjNq
Weight averaging and model merging for LLMs seem to be the most interesting themes in 2024 so far. What are the benefits? Combining multiple models (or checkpoints) into a single one can improve training convergence, overall performance, and also robustness. I will probably do…
Can LLMs generalize from easy to hard problems? Models actually solve college test questions when trained on 3rd grade questions! 🚨New paper: “The Unreasonable Effectiveness of Easy Training Data for Hard Tasks” 🧵1/6
Excited to announce the Workshop on Reliable and Responsible Foundation Models at @iclr_conf 2024 (hybrid workshop). We welcome submissions! Please consider submitting your work here: iclr-r2fm.github.io (deadline: Fed 3, 2024, AOE) Hope to see you in Vienna or…
I'm teaching a new course on AI Alignment this term at the University of Toronto. The first half will cover idealized models of future AI systems (optimal planners, universal induction, etc.), and the second half will cover practical alignment techniques in the context of LLMs.
openai.com/blog/superalig… This group from OpenAI are among the smartest people i have ever met. I'm very pleased to be one of their supporters, please review and apply to work with them !!!!!!!!!!!!
Find us at burning man 2024 🔥 @gruver_nate @LotfiSanae @psiyumm @samuel_stanton_ @polkirichenko @Pavel_Izmailov @KuangYilun @m_finzi @timrudner @ShikaiQiu @yucenlily @andrewgwils
Very nice to see a proper paper from OpenAI once more, congrats to the team! Also really glad that there is this paragraph and citation, because if not, I was gonna go on a rant lol:
Very nice to see a proper paper from OpenAI once more, congrats to the team! Also really glad that there is this paragraph and citation, because if not, I was gonna go on a rant lol: https://t.co/hsvq0lJlUr
New direction for AI alignment — weak-to-strong generalization. Promising initial results: we used outputs from a weak model (fine-tuned GPT-2) to communicate a task to a stronger model (GPT-4), resulting in intermediate (GPT-3-level) performance.
New direction for AI alignment — weak-to-strong generalization. Promising initial results: we used outputs from a weak model (fine-tuned GPT-2) to communicate a task to a stronger model (GPT-4), resulting in intermediate (GPT-3-level) performance.
new paper! one reason aligning superintelligence is hard is because it will be different from current models, so doing useful empirical research today is hard. we fix one major disanalogy of previous empirical setups. I'm excited for future work making it even more analogous.
new paper! one reason aligning superintelligence is hard is because it will be different from current models, so doing useful empirical research today is hard. we fix one major disanalogy of previous empirical setups. I'm excited for future work making it even more analogous. https://t.co/JdMkm1xxm5
Excited to see this. Academics interested in alignment, take a look:
Excited to see this. Academics interested in alignment, take a look:
Kevin Patrick Murphy @sirbayes
42K Followers 333 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Lucas Beyer (bl16) @giffmana
56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Dan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Michael Bronstein @mmbronstein
43K Followers 4K Following #DeepMind Professor of #AI @UniofOxford / Fellow @ExeterCollegeOx / ML Lead @ProjectCETI / https://t.co/kZpGpDzYeVAndreas Kirsch 🇮�.. @BlackHC
9K Followers 5K Following Past: 🧑🎓 DPhil @AIMS_oxford @ExeterCollegeOx @UniofOxford (4.5yr) 🧙♂️ RE @DeepMind (1yr) 📺 SWE @Google (3yrs) 🎓 @TU_Muenchen 👤 Fellow @nwspkEric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pyobibyte @y0b1byte
15K Followers 2K Following Kurin ViTaly, senior research scientist @IsomorphicLabs, ML PhD from @UniofOxford on RL, Multitask learning & GraphsSam Power @sp_monte_carlo
17K Followers 7K Following Lecturer in Maths & Stats at Bristol. Interested in probabilistic + numerical computation, statistical modelling + inference. (he / him)Polina Kirichenko @polkirichenko
3K Followers 1K Following PhD student at New York University, Visiting Researcher at @MetaAI FAIR Labs 🇺🇦Sara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Mathieu Alain @miniapeur
19K Followers 2K Following Researching @ai_ucl. Co-organises @uclcsml and @logconference. FR, EN, trying ES. 🇹🇼🇨🇦🇬🇳🇺🇸🇩🇴🇫🇷🇪🇸🇬🇧🇿🇦Naomi Saphra @nsaphra
7K Followers 1K Following Waiting on a robot body. ML/NLP. All opinions are universal and held by both employers and family. Same username on every lifeboat off this sinking ship.Zachary Lipton @zacharylipton
59K Followers 2K Following Professor: CMU/@acmi_lab, CTO / CSO: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷Tom Goldstein @tomgoldsteincs
23K Followers 2K Following Professor at UMD. AI security & privacy, algorithmic bias, foundations of ML. Follow me for commentary on state-of-the-art AI.Kyle Cranmer @KyleCranmer
16K Followers 3K Following Director Data Science Institute @UWMadison @datascience_uw. EiC @MLSTjournal. Physics, stats/ML/AI, open science. same handle @sigmoid.social and bskyDustin Tran @dustinvtran
40K Followers 648 Following Research Scientist at Google DeepMind. I lead evaluation at Gemini / Bard. AI, Bayesian statistics, deep learning.Jascha Sohl-Dickstein @jaschasd
19K Followers 623 Following Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.Alex Dimakis @AlexGDimakis
13K Followers 2K Following UT Austin Professor. Researcher in Machine Learning and Information Theory. National AI Institute on the Foundations of Machine Learning (IFML) Co-director.haroon thabit @haroon_thabit
4 Followers 45 FollowingPrithvijit @prithvijitch
371 Followers 1K Following On the industry job market | CS PhD student @ICatGT; CV / ML; Former Intern @allen_ai , @MSFTResearch, @virginia_techZezheng Song @ZezhengSong96
132 Followers 403 Following Ph.D. Candidate in Applied Mathematics at UMD | Scientific machine learning, dynamical systems, numerical linear algebra, etc.Matthew Clarke @Matthew05049818
0 Followers 2K FollowingMochigh @0bllz
6 Followers 74 FollowingYating Wu @YatingWu96
142 Followers 188 Following ECE Ph.D student @ UT Austin, advised by @jessyjli and @AlexGDimakis | テキサス大学に在籍する博士生aod @AOguzDogru1
15 Followers 1K Followingirfan1143 @gimseun82430329
1 Followers 20 FollowingQianyi Zhang @QianyiZhan95891
8 Followers 71 FollowingJohn @JohnKingJrLead
23 Followers 245 FollowingParallel @useparallel
1K Followers 218 Following The HiringOS for Modern Teams. Everything you need to build your team, powered by AI. Match with talent instantly, and save up to 70% on your next hire.AW @AW_415
0 Followers 1K FollowingIndrashis Das 🇮�.. @IndrashisDas98
31 Followers 211 Following Pursuing https://t.co/gx6JVed94Y. CS (AI) @UniFreiburg | Student Assistant @AutoML_org | Working Student at Greenventory | All about ML, DL, RL, AutoML, NLP, CVAakanksha Chowdhery @achowdhery
7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to changeJunsol Kim @JunsolK
214 Followers 516 Following Ph.D. student in Sociology @UChicago. Computational Social Science, Collective/Artificial Intelligence.Aditya Chetan @justachetan
1K Followers 2K Following PhD student at @CornellCIS (@cs_cornell); ex-Research Fellow at @MSFTResearch India (@IndiaMSR); @IIITDelhi alum; All opinions are my own. he/him/hisvinicius @v_riffel
21 Followers 25 FollowingNayantara Mudur @nmudur97
2 Followers 97 Following Physics PhD student @Harvard | Student Researcher @Google | IIT Delhi '19Srijith P K @srijithpk
9 Followers 116 Following Machine Learning Researcher, Faculty at IIT HyderabadLeland Rayner US @NVIDIARayner
3 Followers 48 Following彡 Nᗩᖇᒪi 彡 @BGnarli
12K Followers 2K Following ┃Hello fellow! 💜┃AI ART CONDUCTOR┃Tech-Art┃Deep within latent space.┃#Art #digitalart #AIart ┃Air Thirstytaozr @TaoZerui
5 Followers 58 FollowingJordan Plows @jordantplows
1K Followers 999 Following building a GPS for LLM's @stableagentsai + tinkering with AI Hardware.dlw @dlwj_contact
12 Followers 282 Following muse for the AI. a muse creates musing while a rival muse creates amusingMichael Spencer @Michael_AI_bro
143 Followers 462 Following AI Supremacy | AI Report | OK Robot | Semiconductor Things | + 8 others.Shuge Lei @lei_shuge
27 Followers 119 Following Fifth-year doctoral student in CS @UofSC Working on #AIforHealthcare #TrustworthyAI #MedicalImaging #AIforMedConsultingGautham Elango @gautham_elango
644 Followers 2K FollowingMariam.. @MaroAbdElRahman
273 Followers 1K FollowingCHILLQQQ @CHILLQQ
37 Followers 224 Following Mechanical engineering PhD student at Tufts University.Joe (COGSPA: Cognitiv.. @cogspa
5K Followers 5K Following Exploring generative art, human art sketches, and 3D design & printing. Pioneering new art innovations. Author of 'Beginning Design for 3D Printing'Amardeep Kumar @ad_6398
15 Followers 24 Following MSCS @NYU_Courant || CS'20 - @IITISM_DHANBAD Building and Scaling #LLM apps Interested in #NLPproc #ML #GenAICas (Stephen Casper) @StephenLCasper
3K Followers 1K Following #AI safety & responsibility. PhD Candidate @ #MIT_CSAIL.Wenyue Hua @HuaWenyue31539
96 Followers 265 Following Ph.D. candidate @RutgersU CS B.S. Math & B.A. Linguistics @UCLA ex-intern @AmazonScience, #Tencent Trustworthy AI, LLM, LLM-based agentdanish @danish30394793
92 Followers 5K Following Technology & entrepreneurship - build & inspire orgs - Computation for DL, Accelerators, Signal Pr, Systems & Bioinformatics; https://t.co/7Rs1gdYZSzBob @BlastingRayBob
7 Followers 78 FollowingKatya Klinova @klinovakatya
1K Followers 2K Following I lead data and AI efforts at the UN Secretary-General's Innovation Lab. Views my own.Vincent Weisser @vincentweisser
9K Followers 487 Following founder @primeintellect 🤖 / decentralized ai + science 🔬 / core @vita_dao @molecule_dao @bio_xyzMatteo Olivato @mttlvt93
16 Followers 116 FollowingKevin Patrick Murphy @sirbayes
42K Followers 333 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Lucas Beyer (bl16) @giffmana
56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Rosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRAlfredo Canziani @alfcnz
86K Followers 268 Following Musician, math lover, cook, dancer, 🏳️🌈, and an ass prof of Computer Science at New York UniversityKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).David Pfau @pfau
22K Followers 1K Following Knowledge manifests itself in radiant dreams that shimmer like the wild sun Views are my own pfau at sigmoid dot social on 🦣 https://t.co/xqtVHHVI17 on 🦋Michael Bronstein @mmbronstein
43K Followers 4K Following #DeepMind Professor of #AI @UniofOxford / Fellow @ExeterCollegeOx / ML Lead @ProjectCETI / https://t.co/kZpGpDzYeVNeurIPS Conference @NeurIPSConf
111K Followers 35 Following New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to [email protected].Andreas Kirsch 🇮�.. @BlackHC
9K Followers 5K Following Past: 🧑🎓 DPhil @AIMS_oxford @ExeterCollegeOx @UniofOxford (4.5yr) 🧙♂️ RE @DeepMind (1yr) 📺 SWE @Google (3yrs) 🎓 @TU_Muenchen 👤 Fellow @nwspkPyTorch @PyTorch
379K Followers 77 Following Tensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundationFrançois Fleuret @francoisfleuret
31K Followers 456 Following Prof. @Unige_en, Adjunct Prof. @EPFL_en, Research Fellow @idiap_ch, co-founder @nc_shape. AI and machine learning since 1994. I like reality.Behnam Neyshabur @bneyshabur
18K Followers 689 Following Senior Staff Research Scientist @GoogleDeepMind, Interested in reasoning w. LLMs, traveling & backpackingProf. Anima Anandkuma.. @AnimaAnandkumar
25K Followers 2K Following Bren Professor @caltech, Fmr Sr Director of #AI research @nvidia, Fmr Principal Scientist @awscloud, AI+Science, PDE, Neural operators. Views my own.Sindy Löwe @sindy_loewe
3K Followers 361 Following PhD Student with @WellingMax at the University of Amsterdam. Deep Learning with Structured Representations.Frank Nielsen @FrnkNlsn
23K Followers 1K Following Machine Learning & AI, Information Sciences & Information Geometry, Distances & Statistical models, HPC. "Geometry defines the architecture of spaces" @SonyCSLFerenc Huszár @fhuszar
40K Followers 1K Following Secular Bayesian. Associate Professor in Machine Learning @Cambridge_CL. Talent aficionado at https://t.co/RbJkoLguey Alum of @Twitter, Magic Pony and @BaldertonSoumith Chintala @soumithchintala
186K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pJia-Bin Huang @jbhuang0604
51K Followers 285 Following Associate Professor @umdcs; Part-time Research Scientist @Meta. I like pixels.Aakanksha Chowdhery @achowdhery
7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to changeKatya Klinova @klinovakatya
1K Followers 2K Following I lead data and AI efforts at the UN Secretary-General's Innovation Lab. Views my own.Vincent Weisser @vincentweisser
9K Followers 487 Following founder @primeintellect 🤖 / decentralized ai + science 🔬 / core @vita_dao @molecule_dao @bio_xyzRoss @rpoo
25K Followers 1K FollowingTom Brown @nottombrown
5K Followers 524 Following @AnthropicAI, GPT-3, AI alignment, robustness, etc. Cautiously optimistic.Grant Sanderson @3blue1brown
365K Followers 362 Following Pi creature caretaker. Contact/faq: https://t.co/brZwdQfdifElon Musk @elonmusk
181.2M Followers 584 FollowingVictoria X Lin @VictoriaLinML
3K Followers 761 Following Research Scientist @AIatMeta Foundational AI Research • ex-@SFResearch • PhD @uwcse 📜 https://t.co/j6QTac5q0rSamuel Marks @saprmarks
695 Followers 79 Following Postdoc studying interpretability for AI safety under @davidbau. PhD in math from @harvard. Previously director of technical programs at https://t.co/FxRv4QgERO.Xian Li @xl_nlp
2K Followers 242 Following Research Scientist @MetaAI. NLP, ML. Opinions are my own.Rohin Shah @rohinmshah
5K Followers 89 Following Research Scientist at DeepMind. I publish the Alignment Newsletter.Sooyeon Jeong @SooyeonJeong6
90 Followers 297 Following Assistant Professor at @PurdueCS, Director of @HAIPurdue Formerly @cbitshealth @medialab @MIT Design and deploy interactive agents for human flourishing!Eliot Eshelman @hpc_twit
564 Followers 1K Following I enable researchers & educators with leading HPC/AI technologies @NVIDIA. From muons to bacteriophages to astrophysics. Opinions my own. he/him #InclusionYuhuai (Tony) Wu @Yuhu_ai_
23K Followers 411 Following Co-Founder @xAI. Minerva, STaR, AlphaGeometry, AlphaStar, Autoformalization, Memorizing transformer.Dawn Song @dawnsongtweets
29K Followers 840 Following Professor in Computer Science at UC Berkeley; Research in AI, Security, Blockchain; Serial entrepreneurDevendra Chaplot @dchaplot
8K Followers 364 Following Building next-gen AI at @MistralAI. Past: Research Scientist at Facebook AI Research. Ph.D. @SCSatCMU, BTech @iitbombay CS.Teven Le Scao @Fluke_Ellington
2K Followers 549 Following Researcher @MistralAI, producer @ my bedroom, no BLOOM slander authorized on this accountMistral AI @MistralAI
90K Followers 0 Following Fast, open-source and secure language models. Join us https://t.co/INALdNGvCPnoahdgoodman @noahdgoodman
2K Followers 109 Following Professor of natural and artificial intelligence @Stanford. Research Scientist at @GoogleDeepMind. (@StanfordNLP @StanfordAILab etc)mrinank ⭐️ @MrinankSharma
817 Followers 435 Following alignment, poetry, soulmaking, devotion "live to the point of tears", camusPriya Goyal @priy2201
1K Followers 499 Following Founding member @datologyai, ex-Google Deepmind, ex-Facebook AI Research (FAIR).Greg Yang @TheGregYang
53K Followers 661 Following Cofounder https://t.co/SpHbO7FZNV. Morgan Prize Honorable Mention 2018. Developing the theory of #TensorPrograms and the practice of scaling #neuralnetworks.Erika Alden DeBenedic.. @erika_alden_d
3K Followers 2K Following PI @TheCrick and founder of @Align_Bio. Former astronomer 🌎 recovering computer scientist 🤖 current synthetic biologist 🧬🧪Stephen Wolfram @stephen_wolfram
149K Followers 4 Following Creating ideas, technology, science, companies, books, ... #WolfLang #WolframPhysics #WolframAlpha #Mathematica @WolframResearchAlexander Borzunov @sasha_borzunov
476 Followers 300 FollowingThea Klaeboe Aarresta.. @Thea_kaa
647 Followers 393 Following Particle physicist at @ETH. Like ML, FPGAs and quantum fields. Preferably together. Also at @CERN @CMSexperimentMarat Dukhan @MaratDukhan
1K Followers 219 Following Building AGI @OpenAI. Previously TLM for XNNPACK @GoogleAI, lead for QNNPACK @FacebookAI & author of NNPACK. Opinions are my ownDavid W Hogg @davidwhogg
11K Followers 802 Following peace, cosmology, stars, exoplanets, engineering, data analysis, emcee, wobble, The Cannon, https://t.co/GDgZayQiDJ, The Joker, #openscience, #otherpeoplesdataDivya Shanmugam @dmshanmugam
957 Followers 725 Following typing enthusiast (and PhD student @MIT_CSAIL)Denis Semenenko @ds_dsdsdsdsds
5 Followers 34 FollowingIAIFI @iaifi_news
1K Followers 328 Following The NSF AI Institute for Artificial Intelligence and Fundamental InteractionsIfigeneia Apostolopou.. @ifaposto
273 Followers 139 FollowingAnton Bakhtin @ SF @anton_bakhtin
2K Followers 126 Following MTS at @AnthropicAI, Ex @MetaAI, Ex @Google Three logicians walk into a bar ...Michael Hu @michahu8
260 Followers 291 Following PhD student @NYU. NLP & training data. @NSF GRFP fellow. Previously @princeton_nlp, @cocosci_lab.Ted Sanders @sandersted
6K Followers 730 Following Researcher at OpenAI. Be kind to others, and yourself.Alex Yanko 🇺🇦 @LeopolisDream
1K Followers 3K Following Former Head of Data and Analytics, working on SaaS projects. Data Science 🔭 Analytics 📈 Engineering.trieu @thtrieu_
2K Followers 241 Following thinking about thinking. created alphageometry, darkflow. prev: nyu, google brain/deepmindSophia @sopharicks
667 Followers 1K Following Former ballerina turned AI writer& communicator. OpenAI alumni. Fan of astrophysics, open-source, conversations about singularity. Founder of BuzzRobot.Peter Tong @TongPetersb
296 Followers 53 Following Berkeley 23', CS PhD Student in NYU Courant advised by Professor @ylecun and Professor @sainingxieMason Meyer @masonmeyer_
233 Followers 125 Following research @openai. but ever with the eternal goal of the true, the beautiful, and the good.Qinyuan Ye @qinyuan_ye
2K Followers 1K Following 👩💻 Ph.D. student @nlp_usc @CSatUSC @USC_ISI | 🐾 Teaching machines to be more versatile and curious.Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
Congrats to @AIatMeta on Llama 3 release!! 🎉 ai.meta.com/blog/meta-llam… Notes: Releasing 8B and 70B (both base and finetuned) models, strong-performing in their model class (but we'll see when the rankings come in @ @lmsysorg :)) 400B is still training, but already encroaching…
It was very nice to be able to meet and interact with the US House’s Bipartisan Task force in AI. Hope the conversation can lead to more reliable and trustworthy applications of AI
Thrilled to be invited to to meet and engage with the U.S. House’s Bipartisan Task Force on AI! Excited to dive into critical conversations about the future of artificial intelligence. democraticleader.house.gov/media/press-re…
I really respect @Meta's open approach to LLM research. Llama-3, an open model with first class performance. Congratulations to the team. I'm looking forward to the paper!
Llama 3 has been my focus since joining the Llama team last summer. Together, we've been tackling challenges across pre-training and human data, pre-training scaling, long context, post-training, and evaluations. It's been a rigorous yet thrilling journey: 🔹Our largest models…
I got tenure! It was fitting that I got to celebrate with the lab right after the news. Working together for the last 6.5 years has been a blast.
If you're coming to @eccvconf consider our freshly accepted tutorial on: A Bayesian Odyssey in Uncertainty: from Theoretical Foundations 📝 to Real-World Applications 🚀 w/ the amazing @GianniFranchi10 @_olivierlaurent @a1mmer @Pavel_Izmailov More info coming soon #eccv2024
Grok is going multimodal! It’s incredible to see how fast a small, focused team can move. Kudos to the amazing team @xai that made this possible x.ai/blog/grok-1.5v
Imagine a future where toddlers need two NeurIPS papers to get into top daycares—one as the lead author!
Glad to have made a small contribution to it!
Our new GPT-4 Turbo is now available to paid ChatGPT users. We’ve improved capabilities in writing, math, logical reasoning, and coding. Source: github.com/openai/simple-…
I'm so glad that I wasn't working on ML papers during high school summers, even if that had been possible. Go outside. Learn a musical instrument. Read. Write. Travel. Daydream. Organically develop your interests without pressure. Grow your foundations.
Our research on easy-to-hard generalization will be supported by the OpenAI Superalignment Fast Grant. Congratulations to the team and stay tuned!😎
🌟Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision 🌟 arxiv.org/abs/2403.09472 How can we keep improving AI systems when their capabilities surpass those of human supervisors? (1/n)
@ESYudkowsky while computers may excel at soft skills like creativity and emotional understanding, they will never match human ability at dispassionate, mechanical reasoning
A Mamba Primer (w/ Yair Schiff youtube.com/watch?v=dVH1dR… ) Mamba is a nice jumping off point to summarize foundational ideas in sequence modeling, parallel algorithms, continuous-time representations, and GPU aware algorithms. We try to put these together in the context of LMs.
I’ll give a talk about UViM (and similar methods X-Decoder, UnifiedIO, etc), RL-tuning in vision and maybe PaLI. Very relevant to recent LLM+vision I’ll do my best to make the general idea as accessible as possible rather than saying “here’s our method, it’s sota, look, tables”
Announcing 📢 a special ZurichCV Meetup #2! Lucas Beyer @giffmana from @GoogleDeepMind will be speaking about structured computer vision in the age of LLMs on April 16th at 17:00. Come check it out! 🚀 zurich-nlp.ch/event/zurichcv…
⌘R+ Welcoming Command R+, our latest model focused on scalability, RAG, and Tool Use. Like last time, we're releasing the weights for research use, we hope they're useful to everyone! txt.cohere.com/command-r-plus…
What algorithms can Transformers learn? They can easily learn to sort lists (generalizing to longer lengths), but not to compute parity -- why? 🚨📰 In our new paper, we show that "thinking like Transformers" can tell us a lot about which tasks they generalize on!
Tool use is now available in beta to all customers in the Anthropic Messages API, enabling Claude to interact with external tools using structured outputs.
🚨 Are leading safety-aligned LLMs adversarially robust? 🚨 ❗In our new work, we jailbreak basically all of them with ≈100% success rate (according to GPT-4 as a semantic judge): - Claude 1.2 / 2.0 / 2.1 / 3 Haiku / 3 Sonnet / 3 Opus, - GPT-3.5 / GPT-4, - R2D2-7B from…
If this were a science paper, you would expect a country that picks its science workforce at random as a “weak baseline” and a leading nation like the US to actively experiment towards state-of-the-art, or at least beat the baseline. Not providing a guaranteed path for…
H1B lottery ❌ It was less than a 1 in 3 chance, but sucks anyway!