Julia Kempe @KempeLab

Silver Professor at NYU Courant and CDS, Research Scientist at FAIR Research in Machine Learning, past in Quantum Computing & Finance. Posts my own. Joined April 2024

Tweets

137
Followers

2K
Following

144
Likes

255

Rohan Paul @rohanpaul_ai

5 days ago

New @AIatMeta paper. The paper teaches a model to think using continuous tokens during reasoning, then answer using normal tokens. The big deal is that it gives the diversity benefits of continuous reasoning without changing serving or prompts. This keeps single try accuracy…

2 7 28 4K 18

Download Image

Natasha Butt @NatashaEve4

5 days ago

🔥New preprint: Soft Tokens, Hard Truths Introduces the first scalable continuous-token RL method for LLMs - no reference CoTs needed; scales to hundreds of thought tokens. Best to train soft, infer hard! Pass@1 parity ⚖️, Pass@32 gains 📈& better robustness 🛡️ vs. hard CoT 1/🧵

3 46 254 25K 198

Download Image

Julia Kempe @KempeLab

5 days ago

Grateful for this great summary of our recent work!

Aran Komatsuzaki @arankomatsuzaki

6 days ago

Grateful for this great summary of our recent work!

6 40 337 35K 280

Download Image

0 2 13 2K 3

Yunzhen Feng @feeelix_feng

6 days ago

🔥 NEW PAPER: What makes reasoning traces effective in LLMs? Spoiler: It's NOT length or self-checking. We found a simple graph metric that predicts accuracy better than anything else—and proved it causally. 🧵[1/n]

3 27 176 9K 115

Download Image

Rohan Paul @rohanpaul_ai

2 weeks ago

Beautiful @AIatMeta paper. Shows that when models are rewarded only for getting the final answer right, they do become more accurate, but they also lose variety in the answers they generate. That lack of variety hurts real-world use, because sampling multiple answers at test…

14 60 362 28K 303

Download Image

Yuda Song @yus167

3 weeks ago

LLMs lose diversity after RL post-training, and this hurts test-time scaling & creativity. Why does this collapse happen, and how can we fix it? Our new work introduces: 🔍 RL as Sampling (analysis) 🗺️ Outcome-based Exploration (intervention) [1/n]

8 88 467 36K 398

Download Image

Julia Kempe @KempeLab

3 weeks ago

Looking forward to collaborate with some of my favorite colleagues on "The Physics of Learning...". Thanks to @SimonsFdn for selecting our proposal!

NYU Center for Data Science @NYUDataScience

3 weeks ago

Looking forward to collaborate with some of my favorite colleagues on "The Physics of Learning...". Thanks to @SimonsFdn for selecting our proposal!

10 6 37 28K 18

0 2 11 4K 1

Julia Kempe @KempeLab

3 weeks ago

Great work by great Meta FAIR Paris intern and colleagues !

Sachin Goyal @goyalsachin007

3 weeks ago

Great work by great Meta FAIR Paris intern and colleagues !

5 64 326 31K 233

Download Image

0 1 11 2K 6

Simons Foundation @SimonsFdn

a month ago

Our new Simons Collaboration on the Physics of Learning and Neural Computation will employ and develop powerful tools from #physics, #math, computer science and theoretical #neuroscience to understand how large neural networks learn, compute, scale, reason and imagine:…

5 31 230 167K 83

Dr. Karen Ullrich @karen_ullrich

3 months ago

How would you make an LLM "forget" the concept of dog — or any other arbitrary concept? 🐶❓ We introduce SAMD & SAMI — a novel, concept-agnostic approach to identify and manipulate attention modules in transformers.

3 13 79 8K 64

Download Image

Charles Arnal @arnal_charles

3 months ago

❓How to balance negative and positive rewards in off-policy RL❓ In Asymmetric REINFORCE for off-Policy RL, we show that giving less weight to negative rewards is enough to stabilize off-policy RL training for LLMs! 💪 (1/8) Paper: arxiv.org/abs/2506.20520

2 28 156 16K 130

Download Image

NYU Center for Data Science @NYUDataScience

5 months ago

Congrats to 37 CDS researchers — faculty, postdocs, and PhD students — who had papers accepted to ICLR 2025, including Spotlighted work by @KempeLab, @feeelix_feng, @andrewgwils, @KuangYilun, and @JianyuZhang8. Full list: nyudatascience.medium.com/cds-researcher…