Valentin Thomas @_valthomas
Researcher, PhD from Mila_Quebec. Former intern at @deepmind, @Google Brain and @element_ai. Interested in all things around RL and control. valthom.github.io Joined July 2016-
Tweets97
-
Followers281
-
Following989
-
Likes1K
Maybe to one's surprise, taking KL estimates as `kl_loss` to minimize does *not* enforce the KL. This implementation, however, is quite common in open source RL repos and recent research papers. In short: grad of an unbiased KL estimate is not an unbiased estimate of KL grad.
Can neural networks learn to map from observational datasets directly onto causal effects? YES! Introducing CausalPFN, a foundation model trained on simulated data that learns to do in-context heterogeneous causal effect estimation, based on prior-fitted networks (PFNs). Joint…
Optimization hyperparameters (LR, schedule, weight decay) do not affect loss-to-loss scaling of LLMs (which could be seen as a proxy for generalization). ☄️ Unclear: how about different optimizers (Shampoo, ScheduleFree...)? Plots from this paper: arxiv.org/pdf/2502.12120
*Every single* cure for a disease ultimately flowed from basic exploratory research. Stopping basic research is like stopping the mountain rains and expecting rivers of cures to still flow. Examples: 1) studying saliva of Gila monster -> GLP1's 2) studying funghi -> first…
*Every single* cure for a disease ultimately flowed from basic exploratory research. Stopping basic research is like stopping the mountain rains and expecting rivers of cures to still flow. Examples: 1) studying saliva of Gila monster -> GLP1's 2) studying funghi -> first…
Today we’re thrilled to announce that real-time and historical AI-based weather forecasts from @Google’s WeatherNext suite of models are now available on Earth Engine and BigQuery. Anyone can access and use these data for research, analysis and operational decision making, which…
With R1, a lot of people have been asking “how come we didn't discover this 2 years ago?” Well... 2 years ago, I spent 6 months working exactly on this (PG / PPO for math+gsm8k), but my results were nowhere as good. Here’s my take on what blocked me and what’s changed: 🧵
Bonjour-Hi! 1) We moved to Montreál! It is good to be back and lovely so far. 2) I joined the Department of Computer and Software Engineering of the Polytechnique Montréal @polymtl as an associate professor and Mila @Mila_Quebec as the core academic member. 🇨🇦 More news to come!
🚀Our tabular foundation model (TabDPT) is out and performs out-of-the-box classification/regression tasks with a single forward pass! 📊Test out the code: github.com/layer6ai-labs/… 📜Paper: arxiv.org/pdf/2410.18164… To learn more, follow the thread! 🧵(1/5)
Introducing ReSearch: An iterative self-reflection algorithm that enhances LLM's self-restraint abilities: • Encouraging abstention when uncertain • Producing accurate, informative content when confident Result: Significant accuracy boost for Llama2 7B Chat and Mistral 7B! 🚀
OK, time for some tweets about distances between Markov chains! Actually this is about a preprint we've just posted on arxiv with Sergio Calo, Anders Jonsson, Ludovic Schwartz & Javier Segovia-Aguas. FFO optimal transport & bisimulation. Let's dig in! arxiv.org/abs/2406.04056 1/n
100%! Investigative papers are my favorite to write yet they are so hard to publish if you don't have SotA or an all-explaining theoretical result. However the issues they raise are sometimes what makes science progress further later on.
@le_roux_nicolas, @BachFrancis: today it is the 10-year anniversary of the stochastic average gradient (SAG) paper getting rejected from ICML.
Aaand it's a 100 ⭐️ I've been working on #PaperMemory with 1 goal: automate the recording of papers I read on @arxiv_org or @openreviewnet and the discovery code w/ @paperswithcode. It's a simple browser extension that changed my #research workflow github.com/vict0rsch/Pape…
My PhD thesis is now online (thesis.library.caltech.edu/14178/1/joseph…).
We're pleased to inform you that your ICML'21 submission Beyond Variance Reduction: Understanding the True Impact of Baselines [...] has been accepted! Huge congrats to Wesley & Valentin for their persistence and hard-work! w/ @wes_chung*, @lanternol* & @le_roux_nicolas. 🧵👇🏻
We're pleased to inform you that your ICML'21 submission Beyond Variance Reduction: Understanding the True Impact of Baselines [...] has been accepted! Huge congrats to Wesley & Valentin for their persistence and hard-work! w/ @wes_chung*, @lanternol* & @le_roux_nicolas. 🧵👇🏻
Ça s'est passé samedi à Paris. 15 minutes de coups et d'insultes racistes. La folle scène de violences policières que nous révélons est tout simplement inouie et édifiante. Il faut la regarder jusqu'au bout pour mesurer toute l'ampleur du problème.

Mila - Institut québ... @Mila_Quebec
34K Followers 549 Following Le plus grand centre de recherche universitaire en apprentissage profond. The largest academic research center in deep learning. 🦋@mila-quebec.bsky.social
Pau Rodríguez @prlz77
2K Followers 1K Following Research Scientist @Apple MLR on #machine_learning understanding and robustness. @ELLISforEurope member. Previously at ServiceNow and Element AI in Montréal.
Lucas Caccia @LucasPCaccia
1K Followers 681 Following Sr Researcher @ MSR Montréal. PhD from MILA / McGill
Dinghuai Zhang 张鼎... @zdhnarsil
4K Followers 2K Following Researcher at @MSFTResearch. Prev: PhD at @Mila_Quebec, intern at @Apple MLR and FAIR Labs @MetaAI, math undergraduate at @PKU1898.
Michal Valko @misovalko
8K Followers 8K Following Building something new · Chief Models Officer @ Stealth Startup & Inria & MVA - Ex: Llama @AIatMeta Gemini and BYOL @GoogleDeepMind
Gauthier Gidel @gauthier_gidel
1K Followers 192 Following I am an assistant professor at Université de Montréal (UdeM) at DIRO, a core faculty member of Mila, and a Canada CIFAR AI chair holder
Mohammad Pezeshki @mpezeshki91
1K Followers 250 Following
Ethan Caballero @ethanCaballero
11K Followers 2K Following ML @Mila_Quebec ; previously @GoogleDeepMind
John D. Martin @jdmartin86
599 Followers 740 Following Fellow @ Openmind Research Institute. Adjunct Professor @UAlberta. Thinking about AI and RL.
Marlos C. Machado @MarlosCMachado
7K Followers 737 Following Assistant Professor @UAlberta. @AmiiThinks Fellow, Canada CIFAR AI chair.
Andreas Kirsch 🇺�... @BlackHC
14K Followers 6K Following My opinions only here. 👨🔬 RS @DeepMind, @Midjourney 1y 🧑🎓 DPhil @AIMS_oxford @UniofOxford 4.5y 🧙♂️ RE DeepMind 1y 📺 SWE @Google 3y 🎓 TUM 👤 @nwspk
Sébastien Lachapelle @seblachap
804 Followers 484 Following Research Scientist at SAIL Montreal (Samsung) interested in causality and identifiable representation learning. PhD from @Mila_Quebec, @UMontrealDIRO
Benno Krojer @benno_krojer
2K Followers 2K Following AI phding @Mila_Quebec @mcgillu (past: @AIatMeta). Interests: interpretability, language grounding (V+L), evals, reasoning. Vanier Scholar. 🥏⚽🥨
Vitória Pacela @vpacela
668 Followers 1K Following PhD student @Mila_Quebec, @UMontreal. Currently interning at @CSHL. Previously: @AIatMeta, @helsinkiuni. She/elle/ela.
Jad Kabbara @jad_kabbara
1K Followers 749 Following NLP Postdoc @MIT Center for Constructive Communication (CCC). PhD from McGill University @rllabmcgill & @Mila_Quebec. @AUB_Lebanon alum.
Vincent François-Lav... @VinFL
2K Followers 796 Following Assistant Professor in machine learning @VUAmsterdam. Abstract representations+Learning+Reasoning. Deep RL book: https://t.co/09Gk8lkE7o.
Massimo Caccia @MassCaccia
2K Followers 618 Following Research Scientist @ServiceNowRSRCH. Gradient-descent enthusiast building LLM agents. Formerly @Mila_Quebec, @GoogleDeepmind, @AWScloud, @SpotifyResearch.
Rofu @Rofu9683
28 Followers 2K Following
Taco Cohen @TacoCohen
27K Followers 3K Following Post-trainologer at FAIR. Into codegen, RL, equivariance, generative models. Spent time at Qualcomm, Scyfer (acquired), UvA, Deepmind, OpenAI.
Alek Dimitriev @tensor_rotator
455 Followers 1K Following Inference @AnthropicAI, prev Gemini @Google, prev prev PhD @UTAustin
FK @byambamk
165 Followers 1K Following Full Stack Developer. Microsoft Azure, Amazon Web Services. Biometric Solutions.
Tristan Lecourtois @tristanlecourt
7 Followers 152 Following M2 MVA @ENS_Paris_Saclay | Prev. @NASAJPL | Mines Saint-Étienne
Samuel Lavoie @lavoiems
770 Followers 561 Following On the job market ~ PhD candidate @Mila_quebec, @UMontreal. Ex: FAIR @AIatMeta. Amortizing compute and learning representations.
Li_F2_H2 @Li_F2_H2
27 Followers 1K Following
Alessandro Belli @BelliAlessandro
14 Followers 637 Following
Valentina Tardelli @ValentinaT32922
87 Followers 6K Following
Tamuz Gindes @TeaG111
5 Followers 232 Following
Yair Wolff @WolffYair
344 Followers 6K Following
The 69 Controversies ... @69AIControversy
238 Followers 7K Following The 69 Controversies of AI Adoption | Spreading the Word on AI Adoption | From the author of The Last AI @The_Last_AI @s_m_sohn |5/25/25| https://t.co/eMyARc66RG
Yura Gorishniy @YuraFiveTwo
396 Followers 119 Following I do research on deep learning for tabular data
Harris Chan @SirrahChan
3K Followers 2K Following Research Scientist at @GoogleDeepMind, ML PhD @UofT/@VectorInst. EngSci Grad. Former Canadian Rubik's Cube Champion.
Shrawal @shrawalneema
262 Followers 465 Following
Vahid Balazadeh @vahidbalazadeh
88 Followers 218 Following PhD Student at @UofT and @VectorInst. Prev. Research Intern @Autodesk
Sawakau @Sawakau97939
2 Followers 21 Following
Erdal SATIK @erdalsatik
460 Followers 991 Following Software Development Manager | Engineering Management @ Ex. Trendyol, Turkcell | AI & ML Enthusiast | Driving Innovation in Tech | Agile Leader
Mehrdad Noori @NooriMe95
337 Followers 5K Following PhD at LIVIA/ILLS, ÉTS Montreal | ML Researcher at Zebra
Utopic e/λ @UtopicDev
271 Followers 5K Following AI Designer and Builder. Technology to save the world. There Is No Planet B...
Pierre Chambon @PierreChambon6
820 Followers 2K Following NLP/Code Generation PhD at FAIR (Meta AI) and INRIA - previously researcher at Stanford University - MS Stanford 22’ - Centrale Paris P2020
Giulio Corallo @giuliocorallo98
10 Followers 23 Following
Emile van Krieken @EmilevanKrieken
2K Followers 816 Following Postdoc @ VU Amsterdam, prev University of Edinburgh | Neurosymbolic Machine Learning Mostly moved to 🦋, will only post news here
Alec Segal @alikrs
37 Followers 2K Following
Yigal Weinberger @yigalpage
2K Followers 7K Following Posts about LLM's, Data Science, Finance and Personal Growth. While I'm not here, the CTO Of Analysta - AI for risk estimation for the commodities market
I07XNbUI4 @DeepFeed2
88 Followers 6K Following
Nadhir Hassen @vincehass
2 Followers 39 Following ML Graduate Research, interested in deep RL and Bayesian Optimization. Application in industrial optimization an drug discovery. PhD Candidate in ML.
Salisu Borodo @Thalithu
497 Followers 7K Following
Chand @Chand7449215950
56 Followers 3K Following
Smoatough @SmoatoughJbk
109 Followers 3K Following
Pierfrancesco Beneven... @PierBeneventano
537 Followers 861 Following Postdoc at @MIT | ex PhD student @Princeton | Exploring how to train AIs and their interaction with the world, while brewing my espresso.
angel_a_i @angel_a_i
668 Followers 7K Following People got to experience the AngelAi magic and how easy it is to use.
Joel Jojo @JoelSwanjo
1 Followers 250 Following
Subramanyam Sahoo @iamwsubramanyam
193 Followers 4K Following Independent AI Safety researcher, M. Tech x Summa Cum Laude @NITHamirpurHP. BASIS Fellow @UCBerkeley, RA @HarvardAISafety. Get Published or Die Trying.
aleho @Maikusobu
396 Followers 3K Following
Tad Borthwick @TadBorth
0 Followers 627 Following
Ameen Patel @Ameen_ml
1K Followers 1K Following Inference @PrimeIntellect, prev @togethercompute, @AmazonScience, @uwaterloo
leloy! @leloykun
7K Followers 4K Following Math @ AdMU • NanoGPT speedrunner • Muon fan 🤍 • prev ML @ XPD • 2x IOI & 2x ICPC • https://t.co/nfO038itfn
Shibo Hao @Ber18791531
2K Followers 1K Following Ph.D. student @UCSanDiego Previous Research Scientist Intern @AIatMeta. B.S. in Computer Science @PKU1898 Coconut, Reasoning-via-planning, ToolkenGPT
Thijs Bergkamp @ThijsBergkamp
81 Followers 7K Following
Ioannis Mitliagkas (�... @bouzoukipunks
4K Followers 876 Following Associate prof. at the University of Montréal and Mila. Research scientist Google DeepMind. Previously Stanford; UT Austin.
Pablo Samuel Castro @pcastr
13K Followers 830 Following Señor swesearcher @ Google DeepMind. Adjunct prof @ U de Montreal & Mila. Musician. From 🇪🇨 living in 🇨🇦.
Marc G. Bellemare @marcgbellemare
16K Followers 349 Following CSO & co-founder, Reliant AI. Ex RL research lead at Google Brain, DeepMind. Known for Atari 2600 RL benchmark, Distributional RL (MIT Press 2023).
Yann LeCun @ylecun
955K Followers 765 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
Google DeepMind @GoogleDeepMind
1.2M Followers 279 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
Koustuv Sinha @koustuvsinha
3K Followers 810 Following Research Scientist @MetaAI; PhD from @mcgillu + @Mila_Quebec; I organize ML Reproducibility Challenge (@repro_challenge). I do research in Multimodal ML
Mila - Institut québ... @Mila_Quebec
34K Followers 549 Following Le plus grand centre de recherche universitaire en apprentissage profond. The largest academic research center in deep learning. 🦋@mila-quebec.bsky.social
Lucas Caccia @LucasPCaccia
1K Followers 681 Following Sr Researcher @ MSR Montréal. PhD from MILA / McGill
Clément Canonne (on ... @ccanonne_
37K Followers 65 Following Senior Lecturer @Sydney_Uni. Formerly Postdocs @IBMResearch, @Stanford; PhD @Columbia. Converts ☕ into puns: sometimes theorems. He/him. @ccanonne.bsky.social
🇺🇦 Dzmitry Bahd... @DBahdanau
10K Followers 37 Following Team member at something young. Adjunct Prof @ McGill. Member of Mila, Quebec AI Institute. Stream of consciousness is my own.
Taylor W. Killian @tw_killian
3K Followers 891 Following Senior Research Scientist @MBZUAI @a16z , interested in Decision Making & Generalization // @BYU '13; @Harvard '17; @UofT '24
Dinghuai Zhang 张鼎... @zdhnarsil
4K Followers 2K Following Researcher at @MSFTResearch. Prev: PhD at @Mila_Quebec, intern at @Apple MLR and FAIR Labs @MetaAI, math undergraduate at @PKU1898.
Khimya @khimya
4K Followers 975 Following Research Scientist @GoogleDeepmind Affiliate Faculty @Mila_Quebec Past: PhD @mcgillu @MSFTResearch @Intel @UF @IITKanpur Bosch @VIT_univ she/her Views are mine!
Hattie Zhou @oh_that_hat
10K Followers 852 Following I want to understand things deeply and explain them well. Building friendly AI @AnthropicAI Give me anonymous feedback: https://t.co/7aBNrpbad8
Michal Valko @misovalko
8K Followers 8K Following Building something new · Chief Models Officer @ Stealth Startup & Inria & MVA - Ex: Llama @AIatMeta Gemini and BYOL @GoogleDeepMind
Ferenc Huszár @fhuszar
42K Followers 1K Following Secular Bayesian. Professor of Machine Learning @Cambridge_CL. Talent aficionado at https://t.co/RbJkoLguey Alum of @Twitter, Magic Pony and @Balderton
Siddarth Venkatraman @siddarthv66
477 Followers 436 Following PhD at Mila | RL and other stuff I find interesting
Catherine Arnett @linguist_cat
1K Followers 581 Following NLP Researcher @AiEleuther. PhD @UCSanDiego Linguistics. Previously @pleiasfr @EdinburghUni. Interested in multilingual NLP, tokenizers, open science. She/her.
Yu Zhang 🐈🐙 @yzhang_cs
610 Followers 660 Following @Kimi_Moonshot; PhD Student @ Soochow University; working on efficient methods for LLMs; disciple of parallel programming; INTP
Ali Taha @AliesTaha
650 Followers 169 Following gpu perf @modular | ex @Tesla comp eng @uwaterloo [email protected]
Federico Vaggi @F_Vaggi
2K Followers 2K Following Whereof one cannot speak, thereof one must be silent. Ex-Amazon, now GoogleX. My (bad) tweets are my own and don't represent anyone. Same handle on elefant.
Alek Dimitriev @tensor_rotator
455 Followers 1K Following Inference @AnthropicAI, prev Gemini @Google, prev prev PhD @UTAustin
Horace He @cHHillee
42K Followers 536 Following @thinkymachines Formerly @PyTorch "My learning style is Horace twitter threads" - @typedfemale
Seunghyun Seo @SeunghyunSEO7
3K Followers 810 Following deep learning enjoyer. from speech to llm, now exploring image space @midjourney
Yandex Research @YandexResearch
1K Followers 58 Following
Ahmad Beirami @abeirami
10K Followers 4K Following sth new // ex Gemini RL+Inference @GoogleDeepMind // Chat AI @Meta // RL Agents @EA // ML+Information Theory @MIT+@Harvard+@GeorgiaTech // زن زندگی آزادی
Jinjie Ni @NiJinjie
2K Followers 523 Following AI researcher building foundation models. I'm on the job market.
Michael Galkin @michael_galkin
7K Followers 339 Following Senior Research Scientist @GoogleAI. Prev: @Intel, Postdoc @Mila_Quebec & McGill. Graph Learning & LLMs. Grandmaster of 80's music (according to Spotify)
clem 🤗 @ClementDelangue
157K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI builders
Thang Luong @lmthang
27K Followers 95 Following Lead Superhuman Reasoning team @GoogleDeepMind. AI IMO Gold. Co-led #DeepThink, #AlphaGeometry, #Bard (now Gemini) Multimodality, #MeenaBot. LuongAttention.
Chi Jin @chijinML
5K Followers 469 Following Assistant Prof @Princeton. Previously: ML theory, RL & optimization. Now: AI for math, games & decision making.
Shengjia Zhao @shengjia_zhao
52K Followers 231 Following Chief Scientist @ Meta MSL. Formerly MTS @ OpenAI, PhD @ Stanford. I train models. All opinions my own.
Chen Sun 🤖🧠🇨... @ChenSun92
2K Followers 398 Following Research Scientist @ Google DeepMind Building memory & open-ended AI ex-neuroscientist ex-IMO team Canada Views are mine alone not GDM's.
Julian Schrittwieser @Mononofu
16K Followers 100 Following Member of Technical Staff at Anthropic AlphaGo, AlphaZero, MuZero, AlphaCode, AlphaTensor, AlphaProof Gemini RL Prev Principal Research Engineer at DeepMind
Michael Druggan @Michael_Druggan
13K Followers 409 Following Bodybuilder, powerlifter, Math Olympiad winner. CMU class of 2015. @swole_druggan on Instagram
John Burn-Murdoch @jburnmurdoch
481K Followers 6K Following Columnist and chief data reporter @FinancialTimes | Stories, stats & scatterplots | Senior fellow @LSEdataScience | [email protected]
Zhiqing Sun @EdwardSun0909
19K Followers 1K Following Agents @Meta MSL TBD Lab. previously posttraining research @OpenAI train LLMs to do things: deep research, chatgpt agent, etc. CS PhD @LTIatCMU
Mario Sieg @_mario_neo_
3K Followers 122 Following ML | Game Engines | HPC | Art Research Engineer @PrimeIntellect
Vincent Abbott @vtabbott_
7K Followers 334 Following Maker of *those* diagrams for deep learning algorithms | @mit @mitlids incoming PhD
Kimi.ai @Kimi_Moonshot
53K Followers 100 Following Built by Moonshot AI to empower everyone to be superhuman.
Crystal @crystalsssup
13K Followers 655 Following Staff @Kimi_Moonshot prev. co-maker of ModelizeAI & gemsouls "Personality goes a long way" @UCSanDiego
Zvi Mowshowitz @TheZvi
32K Followers 278 Following Blogger primarily on AI and AI x-risk but also other things at Don't Worry About the Vase (SS/WP/LW), founding Balsa Research to fix policy.
Andrea Michi @andreamichi
2K Followers 1K Following Co-founder @ https://t.co/FiVtWkCxXC / Building intelligence to detect and remediate software vulnerabilities / Prev post-training / RL for Gemini @GoogleDeepMind
Yura Gorishniy @YuraFiveTwo
396 Followers 119 Following I do research on deep learning for tabular data
Harris Chan @SirrahChan
3K Followers 2K Following Research Scientist at @GoogleDeepMind, ML PhD @UofT/@VectorInst. EngSci Grad. Former Canadian Rubik's Cube Champion.
Konstantin Mishchenko @konstmish
7K Followers 658 Following Research Scientist @AIatMeta Previously Researcher @ Samsung AI Outstanding Paper Award @icmlconf 2023 Action Editor @TmlrOrg I tweet about ML papers and math
Daniel Wurgaft @danielwurgaft
145 Followers 188 Following PhD @Stanford working w @noahdgoodman Studying in-context learning and reasoning in humans and machines Prev. @UofT CS & Psych
Nan Jiang @nanjiang_cs
10K Followers 73 Following machine learning researcher, with focus on reinforcement learning. assoc prof @ uiuc cs. Course on RL theory (w/ videos): https://t.co/vqVKwY4RJE
Vahid Balazadeh @vahidbalazadeh
88 Followers 218 Following PhD Student at @UofT and @VectorInst. Prev. Research Intern @Autodesk
Lisan al Gaib @scaling01
22K Followers 704 Following lead them to paradise | intelligence is inherently about scaling | be kind to us AGI
Sasha Gusev @SashaGusevPosts
20K Followers 3K Following Statistical geneticist | Associate Prof at @DanaFarber / @harvardmed / @DFCIPopSci | Blogging at https://t.co/4D7UObBNdd
Damien Ferbach @damien_ferbach
579 Followers 195 Following PhD at @Mila_Quebec/ Previously student in Maths and theoretical Physics at @ENS_ULM
Mike Duncan @mikeduncan
120K Followers 107 Following History podcaster/author. Revolutions + The History of Rome.
Prime Intellect @PrimeIntellect
48K Followers 28 Following find compute. train models. contribute to open superintelligence. https://t.co/ZRZOsRRbwr
Shunyu Yao @ShunyuYao12
20K Followers 1K Following @OpenAI Language agents (ReAct, Reflexion, Tree of Thoughts, SWE-agent, CoALA) for digital automation (WebShop, SWE-bench, tau-bench)