Nikhil Anand @nikhil_anand91
Physicist-turned-machine-learner, currently research scientist @kempnerinst nikhilanand91.github.io Cambridge, MA Joined December 2019-
Tweets15
-
Followers59
-
Following274
-
Likes121
I respectfully disagree with Ed. Was Kepler's planetary analysis "real" mathematics or just astronomy? Are IMO problems "real" mathematics or just puzzles for high school students? Is photography "real" art or just tool use? The label "real" is a personal, aesthetic judgment,…
I respectfully disagree with Ed. Was Kepler's planetary analysis "real" mathematics or just astronomy? Are IMO problems "real" mathematics or just puzzles for high school students? Is photography "real" art or just tool use? The label "real" is a personal, aesthetic judgment,…
Interested in the latest work from the #KempnerInstitute? Check out papers and preprints from June's Research Roundup. kempnerinstitute.harvard.edu/kempner-commun… Abstracts and links below. 🧵 (1/21) #AI #neuroscience #NeuroAI
@isabelpapad @nsaphra @SimmonsEdler @RyanPaulBadman1 @RaymondRChua @johnjvastola @KanakaRajanPhD @elmelis @_valerie_costa_ @Napoolar @EkdeepL @BTolooshams @dunbar_ba @MorrisYau @gershbrain 'Decomposing Elements of Problem Solving: What "Math" Does RL Teach?' Tian Qin, @corefpark, Mujin Kwun, @aaronwalsman, @EranMalach, @nikhil_anand91, @Hidenori8Tanaka , @elmelis doi.org/10.48550/arXiv… (15/21)
New in the #DeeperLearningBlog: Kempner researchers Nikhil Anand (@nikhil_anand91) and Chloe Su (@Huangyu58589918) discuss new work on how numerical precision can impact the accuracy and stability of #LLMs. kempnerinstitute.harvard.edu/research/deepe… #AI (1/2)
Excited to share this work on understanding low-precision instabilities in model training! See our thread below for more details. Paper: arxiv.org/abs/2506.20752 Blogpost: tinyurl.com/lowprecinstabi…
Excited to share this work on understanding low-precision instabilities in model training! See our thread below for more details. Paper: arxiv.org/abs/2506.20752 Blogpost: tinyurl.com/lowprecinstabi…
🚨 New preprint! TL;DR: Backtracking is not the "holy grail" for smarter LLMs. It’s praised for helping models “fix mistakes” and improve reasoning—but is it really the best use of test-time compute? 🤔
How does RL improve performance on math reasoning? Studying RL from pretrained models is hard, as behavior depends on choice of base model. 🚨 In our new work, we train models *from scratch* to study the effect of the data mix on the behavior of RL. arxiv.org/abs/2504.07912
At NeurIPS? Come discuss loss-to-loss prediction and scaling laws with us!
At NeurIPS? Come discuss loss-to-loss prediction and scaling laws with us!
How do different data distributions interact with scaling laws? And how does training data affect test loss? We find simple shifted power law fits can relate performance across (sometimes very disparate) datasets and losses. See David's thread for more details!
How do different data distributions interact with scaling laws? And how does training data affect test loss? We find simple shifted power law fits can relate performance across (sometimes very disparate) datasets and losses. See David's thread for more details!
MoEs increase parameter count but not FLOPs. Do they offer "free lunch", improving performance without paying in compute? Our answer: for memorization, MoEs give performance gains "for free", but have limited benefit for reasoning! Arxiv: arxiv.org/pdf/2410.19034 🦜🦜🦜
Really cool work led by Devin Kwok (McGill/Mila) on making sense of example difficulty. Addresses some key ?s: E.g, How consistent is measured difficulty across inits and for different architectures? Can we fingerprint models using a few key sensitive/hard examples?
Really cool work led by Devin Kwok (McGill/Mila) on making sense of example difficulty. Addresses some key ?s: E.g, How consistent is measured difficulty across inits and for different architectures? Can we fingerprint models using a few key sensitive/hard examples?
Happy to share our EMNLP paper w/ @jtan189 where we apply Variance of Gradients (VoG) – originally developed by @_cagarwal, @mrdanieldsouza, and @sarahookr – for selecting important data in language-based tasks. At EMNLP? Let's connect to discuss data quality and/or LLMs! #EMNLP
Happy to share our EMNLP paper w/ @jtan189 where we apply Variance of Gradients (VoG) – originally developed by @_cagarwal, @mrdanieldsouza, and @sarahookr – for selecting important data in language-based tasks. At EMNLP? Let's connect to discuss data quality and/or LLMs! #EMNLP

Huang Qichang @huangq1_
92 Followers 2K Following
Urmish Thakker @UrmishThakker
609 Followers 2K Following LLM @SambanovaAI | | Ex-@arm research| @mlperf1| @BigscienceW| @TXInstruments,@AMD| @WisconsinCS| @bitspilaniindia
Dan Elton @moreisdifferent
7K Followers 3K Following Science & technology enthusiast. On my Substack I write about metascience, AI, & other topics. Leave me anonymous feedback here: https://t.co/LQ5eZWwDst
Reza Shamji @Reza_Shamji
49 Followers 157 Following Kempner Institute of Natural and Artificial Intelligence Research Engineer Intern
Peiyang Song @p_song1
363 Followers 250 Following CS Major w/ Robotics Minor @Caltech. #AI Researcher @UCBerkeley & @Stanford. Applying for 26Fall PhD positions in Computer Science.
Patrick Drake @time8machine
17K Followers 6K Following Neurodivergent physics student with a keen interest in multisensory integration and emergent perception. Exploring research on a proposed ‘sixth sense’. Δ
Jun Il Kwun @junil_kwun
184 Followers 3K Following
wanlin zhu @neuromanifold
30 Followers 4K Following
Aayush Karan @aakaran31
483 Followers 752 Following PhD student @Harvard and @GoogleDeepMind | Algorithmic insights for generative machine learning | @PDSoros 2024 | Prev @citsecurities, @Apple
Michael P. Brosnan @BrosnanP98
199 Followers 5K Following I have intent of helping but I usually help through credit cards,because I know the society’s need so helping hand as well
Ymarrau @Ymarrau2555
78 Followers 3K Following
MJK @MJK12341234
7 Followers 39 Following
Kempner Institute at ... @KempnerInst
3K Followers 364 Following The Kempner Institute for the Study of Natural and Artificial Intelligence at @Harvard University. RTs ≠ Endorsements
Depen Morwani @depen_morwani
247 Followers 146 Following PhD student at Harvard ML Foundations, Research Associate at Google AI, completed MS from IIT Madras
Alex Meterez @alexmeterez
248 Followers 890 Following cs phd student @harvard deep learning theory, optimization, scale pilled
Rosie Zhao @rosieyzh
546 Followers 583 Following PhD student with @hseas ML Foundations Group. Previously @mcgillu.
Ofir Press @OfirPress
15K Followers 7K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
Dylan Foster 🐢 @canondetortugas
3K Followers 1K Following Foundations of RL/AI @MSFTResearch. Previously @MIT @Cornell_CS https://t.co/vQIdUzsw8B RL Theory Lecture Notes: https://t.co/bhgL3aKIk0
Tursawez @TursawezVYs
45 Followers 5K Following
Thomas Fel @Napoolar
2K Followers 768 Following Explainability, Computer Vision, Neuro-AI @Harvard. Research Fellow @KempnerInst. Prev. @tserre lab, @Google, @GoPro. Crêpe lover.
Max Shad @maxshadx
490 Followers 515 Following Senior Director of AI/ML Research Engineering, Kempner Institute @KempnerInst @Harvard , views are my own
Chloe H. Su @Huangyu58589918
561 Followers 1K Following CS PhD @Harvard @KempnerInst Automated Reasoning @AmazonScience Prev @mldcmu @ntusg
Cengiz Pehlevan @CPehlevan
3K Followers 1K Following Theoretical neuroscience, theory of neural computation, physics of learning and intelligence. Assistant Professor of Applied Mathematics @Harvard SEAS
Naeem Khoshnevis @NaeemKhoshnevis
31 Followers 145 Following ML Research Engineer @KempnerInst @Harvard | Opinions my own
Yasin Mazloumi @y_mazloumi
128 Followers 266 Following Senior ML Research Engineer at @kempnerInst at @Harvard University | PhD from @UCR_CSE
Ella Batty @EllaBatty
2K Followers 771 Following Senior Machine Learning Research Scientist, Kempner Institute, Harvard. Board Member, Neuromatch. she/her. Views are my own.
Eva Louise Marie Gabr... @e681554349
11 Followers 7K Following
Yixin Lin @yixin_lin_
2K Followers 7K Following something new. prev: embodied AI @GoogleDeepMind, FAIR/@AIatMeta, Google Brain.
David Brandfonbrener @brandfonbrener
1K Followers 628 Following research scientist @AIatMeta. Previously: phd from @nyu_courant, research fellow @KempnerInst @Harvard
Bingbin Liu @BingbinL
942 Followers 261 Following Research Fellow at the Kempner Institute at Harvard University.
Sham Kakade @ShamKakade6
16K Followers 497 Following Harvard Professor. Full stack ML and AI. Co-director of the Kempner Institute for the Study of Artificial and Natural Intelligence.
Yang Wu @15tatt
442 Followers 4K Following Math/Music Composition Undergrad at Soochow Univ. in Taiwan. Theoretical Neuro RA at @AcadSinica Memory, Mental Simulation, DeepRL & Philosophy of Neuroscience
Susan @susan13larry
273 Followers 3K Following
Tothos @Tothos342325
14 Followers 1K Following The garden is full of spring scenery, with a few red flowers falling all over the ground
Carolin Holtermann @CarolinHolterm
54 Followers 72 Following PhD Student at the Data Science Chair in Hamburg
Yupei Du @YupeiDu
68 Followers 592 Following Postdoc at Saarland University, working on with @alkoller on #NLProc. LLM reasoning
Mukund Srinath @ EMNL... @MukundSrinath3
191 Followers 354 Following Machine Learning Scientist @ Expedia Group | NLP, IR and Trustworthy AI
Chirag Agarwal @_cagarwal
2K Followers 546 Following Assistant Professor @UVA; PI of Aikyam Lab; Prev - @Harvard, @Adobe @BoschGlobal @thisisUIC ; Increasing the sample size of my thoughts
Josh Tan @jtan189
69 Followers 406 Following
Sara Hooker @sarahookr
50K Followers 9K Following I lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
Daniel D'souza @mrdanieldsouza
859 Followers 967 Following Research Engineer @Cohere_Labs💙 | @UMichECE Alum 〽️ | 🇮🇳✖️🇺🇸 💫"The Universe Works in Mysterious Ways"💫
Michael Chang @mmmbchang
5K Followers 2K Following Amplify human creativity with Sora @openai Prev: Gemini and Project Astra @GoogleDeepMind, @LangChainAI, @MetaAI, @SchmidhuberAI PhD @berkeley_ai. BS @MIT
Scott Gray @scottgray76
9K Followers 794 Following GPU Geek at @OpenAI. I have a long standing interest in neuroscience and its application to machine learning. He/Him.
Stephanie Chan @scychan_brains
5K Followers 3K Following Staff Research Scientist at Google DeepMind. Artificial & biological brains 🤖 🧠 Views are my own.
Simran Arora @simran_s_arora
5K Followers 207 Following building ai systems, cs phd @stanford @hazyresearch, incoming asst. prof. @caltech
William Brandon @exists_forall
746 Followers 1K Following he/him • Trying to become compute-bound • PhD student at MIT CSAIL • Prev: CS & Math at UC Berkeley; ML Compilers at NVIDIA • Opinions my own
Lin Yang @lyang36
3K Followers 1K Following Associate Professor of ECE&CS@UCLA. ML, RL, big data, algorithms, astronomy.
Stuart Sul @stuart_sul
1K Followers 119 Following ml research @cursor_ai, cs @Stanford, mlsys @HazyResearch
Kevin Lu @_kevinlu
10K Followers 227 Following @thinkymachines. formerly: - @openai: RL, synthetic data, efficient models - @berkeley_ai: decision transformer, universal computation
Robert Nishihara @robertnishihara
9K Followers 783 Following Co-founder @anyscalecompute. Co-creator of @raydistributed. Previously PhD ML at Berkeley.
Ion Stoica @istoica05
5K Followers 20 Following Professor at UC Berkeley, co-founder of Databricks, Anyscale, LMArena, Conviva.
will depue @willdepue
51K Followers 2K Following (taking time off) RL posttraining @openai, past: sora 1 & 2, applied research
Alireza Fathi @alirezafathi
3K Followers 213 Following Senior Staff Research Scientist / Manager @ Google DeepMind
kipply @kipperrii
9K Followers 969 Following "uncanny ability to be mentioned in every slack thread about code that's mysteriously breaking" - claude | alt @kipperriiii
Ken Liu @kyliu99
35K Followers 111 Following SFF author, speaker, lawyer, programmer; Hugo, Nebula, World Fantasy; rep by Russ Galen; The Dandelion Dynasty & “The Paper Menagerie"
Hongyu Ren @ren_hongyu
23K Followers 693 Following research @meta superintelligence. CS PhD @stanford. prev @openai, led the development of o3-mini and o1-mini.
Shengjia Zhao @shengjia_zhao
52K Followers 231 Following Chief Scientist @ Meta MSL. Formerly MTS @ OpenAI, PhD @ Stanford. I train models. All opinions my own.
Andy Keller @t_andy_keller
4K Followers 1K Following Postdoctoral Fellow at The Kempner Institute at Harvard University -- Somewhere between Brains & Bits. PhD at UvA, Intern @ Apple MLR, Prev @ Intel AI & Nervana
Fred Zhang @FredZhang0
1K Followers 514 Following research scientist @googledeepmind, prev phd @berkeley_eecs, DM open
Jonathan Lee @jon_lee0
748 Followers 107 Following research @GoogleDeepMind. co-developed gemini deep think. co-led model training for IMO 🥇 | prev: RL PhD at @StanfordAILab
Wen Sun @WenSun1
729 Followers 76 Following Assistant professor at @cornell_tech and research scientist at @Databricks; working on Reinforcement Learning.
Jack Rae @jack_w_rae
23K Followers 453 Following Distinguished Scientist @ Meta LLMs (e.g. Gopher, Chinchilla, Gemini) Compression & RL ☯️ Past: Google, OpenAI, Quora
Davis Blalock @davisblalock
15K Followers 168 Following Research scientist @GoogleDeepMind. Past: @Databricks, first hire @MosaicML, @MIT PhD. I post about AI technical progress + sometimes the business side.
Xinyu Zhou @zxytim
2K Followers 1K Following
Reza Shamji @Reza_Shamji
49 Followers 157 Following Kempner Institute of Natural and Artificial Intelligence Research Engineer Intern
Yuchen He @YuchenHe07
2K Followers 647 Following learning @xai | prev @openai@meta@apple@uiuc@utaustin
Qian Huang @qhwang3
14K Followers 330 Following prev @xai | CS PhD student @StanfordAILab (on leave)
Eric Zelikman @ericzelikman
21K Followers 2K Following building for humans // was lgtm-ing @xAI, phd-ing @stanford
Gabriel Poesia @GabrielPoesia
1K Followers 288 Following CS PhD @Stanford Incoming Post-doc Fellow (Fall 25) @KempnerInst Incoming Assistant Professor (Fall 26) @UMichCSE AI, formal systems, open-ended learning
Yang Song @DrYangSong
14K Followers 939 Following Leading Strategic Explorations @OpenAI. Score-Based / Diffusion Models. Consistency Models. Optimization & Architecture.
Dan Fu @realDanFu
7K Followers 221 Following Incoming assistant professor at UCSD CSE in MLSys. Currently recruiting students! Also running the kernels team @togethercompute.
Albert Gu @_albertgu
18K Followers 88 Following assistant prof @mldcmu. chief scientist @cartesia_ai. leading the ssm revolution.
Jason Weston @jaseweston
13K Followers 725 Following @Meta+NYU. NLP from scratch(Pretrain+FT LLM) 2008,MemNet (pre-Transformer) 2015, DrQA(pre-RAG) 2017, BlenderBot(dialog pre-ChatGPT) 2018+, Self-Rewarding+more!
Joaquin Quiñonero Ca... @jquinonero
6K Followers 246 Following Head of Recruiting @OpenAI. Former Head of Preparedness @OpenAI. AI research and engineering at LinkedIn, Meta and Microsoft before that.
Marinka Zitnik @marinkazitnik
8K Followers 226 Following Associate Professor at Harvard | @Harvard @KempnerInst @broadinstitute @harvard_data | @ProjectTDC @AI_for_Science @ScientistTools
Aaron Walsman @aaronwalsman
219 Followers 284 Following
Hidenori Tanaka @Hidenori8Tanaka
6K Followers 1K Following Group Leader, Physics of Intelligence Program at Harvard University Physics of Artificial Intelligence Group, NTT Research, Inc.
Prithviraj (Raj) Amma... @rajammanabrolu
8K Followers 614 Following Reinforcement Learning and Language. Assistant Prof @UCSanDiego. Research Scientist @Nvidia.
Zoubin Ghahramani @ZoubinGhahrama1
32K Followers 673 Following VP Research, Google DeepMind, ex-head of Google Brain. Professor at University of Cambridge. Machine Learning Researcher. ex-Chief Scientist & VP of AI, Uber.
Surbhi Goel @SurbhiGoel_
2K Followers 549 Following Assistant Prof @PennCIS | Postdoc @MSFTResearch | PhD @UTCompSci | Co-founder @let4all
Percy Liang @percyliang
85K Followers 420 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist