Wrapped up Stanford CS336 (Language Models from Scratch), taught with an amazing team @tatsu_hashimoto@marcelroed@neilbband@rckpudi. Researchers are becoming detached from the technical details of how LMs work. In CS336, we try to fix that by having students build everything:
On the coming Tuesday (Aug 26th), we will have
@YuchenZhu_ZYC talking about “Beyond Euclidean data: Lie group and multimodal diffusion models"🚀, from 5pm to 6pm (UK time).
Join us via zoom: us05web.zoom.us/j/7780256206?p…
See more information below 👇
By popular demand, #NeurIPS2025 Workshop on "Dynamics at the Frontiers of Optimization, Sampling, and Games" (DynaFront)
has its submission deadline extended to August 29 (AoE).
Please submit high quality work at openreview.net/group?id=NeurI…
Georgia Tech AI4Science Center is soft launched, and I'm excited to be an Associate Director.
ai4science.ai.gatech.edu
Collaboration+Participation of all kinds are welcomed. Please get in touch!
Thanks to @gtsciences for supports.
Retweets appreciated! @GeorgiaTech#AI4Science
I will present
* accelerated manifold optimization,
in LA (ICCOPT) Wed 7/23
* fine tuning of diffusion model and stochastic optimal control for sampling,
in Montreal (SIAM) Tue 7/29
* fast sampling under nonconvex constraints,
in Chicago (MCM) Thu 7/31
Love to chat and learn!
If still around #ICML2025, plz consider checking out my collaborator @qu_1006 's Oral in the MemFM Workshop, 11am Sat West Meeting Room 223-224, on
A Closer Look at Model Collapse (in diffusion model): From a Generalization-to-Memorization Perspective
What if AI isn’t about building solo geniuses, but designing social systems?
Michael Jordan advocates blending ML, economics, and uncertainty management to prioritize social welfare over mere prediction.
A must-read rethink.
arxiv.org/abs/2507.06268…
I wish I could go to #ICML2025, but plz consider dropping by my student Yuchen Zhu's @YuchenZhu_ZYC fun posters!
* Diffuse Everything
* Learning to Stop
New lecture recordings on RL+LLM! 📺
This spring, I gave a lecture series titled **Reinforcement Learning of Large Language Models**. I have decided to re-record these lectures and share them on YouTube. (1/7)
Big thanks to the COLT 2025 organizers for an awesome event in Lyon! Here are the slides from my keynote this morning in case you’re curious about the references I mentioned: di.ens.fr/~fbach/fbach_o…
18K Followers 4K FollowingAssociate Professor at UC Berkeley. Former Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learning.
5K Followers 933 FollowingResearch fellow at Flatiron Institute, working on understanding optimization in deep learning. Previously: PhD in machine learning at Carnegie Mellon.
857 Followers 5K FollowingResearch Fellow @MSFTResearch India | Ex RA @gatech_scs, (B. + M.) Tech. CSE IITD | PL, Verification and Theorem Proving | Sports + Music + Food (in that order)
11 Followers 199 FollowingInterested in ML + Emergent Intelligence | Incoming Physics PhD student at @PennPhys, current MPhil candidate @ChemCambridge, @harvardphysics AB ‘24
108 Followers 1K Following🇬🇧British Army Soldier 👮Royal Military Police. I\'m not going to let myself pull me down anymore. Be yourself; everyone else is already taken ❣️
14 Followers 34 FollowingGraduate Researcher at @kaist_ai | Working on probabilistic ML based on stochastic optimal control (Schrödinger bridges, state space models)
78 Followers 2K FollowingWeb developer/financial experts. Crypto trading is a life changing investment. Kindly follow or DM me to mentor you on how to invest wisely. And thank me later.
2K Followers 818 FollowingPostdoc @ VU Amsterdam, prev University of Edinburgh | Neurosymbolic Machine Learning
Mostly moved to 🦋, will only post news here
18K Followers 4K FollowingAssociate Professor at UC Berkeley. Former Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learning.
38K Followers 485 FollowingDigital Geometer, Assoc. Prof. of Computer Science & Robotics @CarnegieMellon @SCSatCMU and member of the @GeomCollective. There are four lights.
36K Followers 2K FollowingInformation Geometry, Information Theory, and Geometric Science of Information (GSI) for machine learning and AI, visual computing, HPC, pyBregMan lib @SonyCSL
37K Followers 564 FollowingAssistant professor at Stanford; Co-founder of Voyage AI (https://t.co/wpIITHLgF0) ;
Working on ML, DL, RL, LLMs, and their theory.
5K Followers 933 FollowingResearch fellow at Flatiron Institute, working on understanding optimization in deep learning. Previously: PhD in machine learning at Carnegie Mellon.
344 Followers 285 FollowingVisiting Professor @Politecnico di Milano.
🎮 Working on Game optimization, alg. game theory, multi-agent design, AI/ML
bsky: https://t.co/A1NCLv4wsX
186 Followers 233 FollowingIncoming Assistant Professor, CCDS @ Nanyang Technological University. Currently a Research fellow @ Oxford Statistics Department.
3K Followers 434 FollowingAssistant Professor @JohnsHopkinsAMS, Optimization, PhD @Cornell_ORIE
Mostly here to share pretty maths/3D prints, sometimes sharing my research
964 Followers 298 FollowingResearch scientist @nvidia | postdoc @caltech | PhD @univienna | former research intern @MetaAI and @nvidia | views are my own
773 Followers 631 FollowingAssociate Professor at Georgia Tech Computer Science. Machine Learning researcher. Former professional juggler 🤹🏻♀️, a career I aim to return to.
2K Followers 581 FollowingAssociate professor in CS @ National Taiwan University. PhD in CS from EPFL. Learning, optimization, statistics, and some quantum information.
728 Followers 553 FollowingPhD Student in Machine Learning at Carnegie Mellon University @mldcmu @SCSatCMU. Intern @GoogleAI. Previously @Quora, @Illinois_Alma. Math of deep learning, LMs
902 Followers 894 FollowingAssistant Professor @mldcmu. Building generative models for science, engineering, and AI. Previously @Harvard, @MIT, @GoogleAI, @NYU_Courant.
6K Followers 2K FollowingAss. prof. of Machine Learning. PI of Generative Memory Lab (@DondersInst). Statistical physics, generative diffusion, memory, and generalization.