Taehyeon Kim @kimtaehyeon610

Research Scientist (Team Lead) - @LG_AI_Research. Prev: @GoogleAI (NYC 🇺🇸), @Qualcomm AI, @dynamo_ai (YCW22). Agent/LLM inference/alignment. 🎧 taehyeon.oopy.io Seoul, Korea Joined November 2021

Tweets

180
Followers

571
Following

250
Likes

2K

Sharon Y. Li @SharonYixuanLi

5 days ago

Multi-Agent Debate (MAD) has been hyped as a collaborative reasoning paradigm — but let me drop the bomb: majority voting, without any debate, often performs on par with MAD. This is what we formally prove in our #NeurIPS2025 Spotlight paper: “Debate or Vote: Which Yields…

10 66 449 32K 330

Download Image

Tanishq Mathew Abraham, Ph.D. @iScienceLuvr

4 days ago

practical, modern GRPO tweaks as described in Meta's Code World Models paper

13 86 878 238K 903

Download Image

Prakash Kagitha @prakashkagitha

3 weeks ago

There are 70+ "reasoning" papers accepted at COLM 2025 (Oct 7-10, Montreal). Most papers elicit long reasoning for different tasks or understand the reasoning abilities/limitations of LLMs. I wrote a blog post covering ~30 of those papers 👇

6 37 288 24K 311

Download Image

Jeff Dean @JeffDean

a month ago

AI efficiency is important. Today, Google is sharing a technical paper detailing our comprehensive methodology for measuring the environmental impact of Gemini inference. We estimate that the median Gemini Apps text prompt uses 0.24 watt-hours of energy (equivalent to watching an…

152 830 4K 727K 2K

Download Image

Vaish Shrivastava @VaishShrivas

2 months ago

Test-time scaling w/ GRPO boosts accuracy, but also adds “filler tokens” increasing length w/o real progress. We present Group Filtered Policy Optimization (GFPO):🧵 1️⃣ Sample more per prompt 2️⃣ Rank by token efficiency (reward ÷ length) 3️⃣ Train on top-k 4️⃣ 🚀 Cut 80% of…

4 49 333 59K 281

Download Image

Jack Morris @jxmnop

2 months ago

OpenAI hasn’t open-sourced a base model since GPT-2 in 2019. they recently released GPT-OSS, which is reasoning-only... or is it? turns out that underneath the surface, there is still a strong base model. so we extracted it. introducing gpt-oss-20b-base 🧵

158 467 6K 923K 4K

Download Image

Ming Yin @MingYin_0312

2 months ago

I implemented GRPO and DPO from scratch in vanilla Pytorch to unravel every piece of training details. Hope it could be helpful for those who care about the implementation details of the algorithms. 👉 github.com/mingyin0312/RL… #AI #RL #LLM

16 210 2K 104K 2K

Sam Altman @sama

2 months ago

gpt-oss is out! we made an open model that performs at the level of o4-mini and runs on a high-end laptop (WTF!!) (and a smaller one that runs on a phone). super proud of the team; big triumph of technology.

2K 4K 46K 4.2M 8K

Sangmin Bae @raymin0223

2 months ago

✨Huge thanks for interest in Mixture-of-Recursions! Codes are officially out! It's been a long journey exploring Early-exiting with Recursive Architecture. I'll soon post my 👨‍🎓PhD thesis on Adaptive Computation too! Code: github.com/raymin0223/mix… Paper: arxiv.org/abs/2507.10524

6 64 281 17K 169

Download Image

Yujin Kim @yujin301300

2 months ago

Introducing our new work: 🚀Mixture-of-Recursions! 🪄We propose a novel framework that dynamically allocates recursion depth per token. 🪄MoR is an efficient architecture with fewer params, reduced KV cache memory, and 2× greater throughput— maintaining comparable performance!

10 60 329 22K 218

Download Image

Alex Prompter @alex_prompter

3 months ago

R.I.P McKinsey. You don’t need a $300k consultant anymore. You can now run full competitive market analysis using Grok 4. Here are the exact 3 mega-prompts I use to replicate McKinsey-style insights for free:

852 5K 43K 13.7M 62K

Download Image

Dongmin Park @dongmin_park11

4 months ago

🚨New Paper Alert As a game company, @Krafton_AI is actively exploring how to apply LLM agents to video games. We present Orak—a foundational video gaming benchmark for LLM agents! Includes Pokémon, StarCraft II, Slay the Spire, Darkest Dungeon, Ace Attorney, and more in🧵

2 22 74 10K 21

Download Image

Johannes Oswald @oswaldjoh

4 months ago

Super happy and proud to share our novel scalable RNN model - the MesaNet! This work builds upon beautiful ideas of 𝗹𝗼𝗰𝗮𝗹𝗹𝘆 𝗼𝗽𝘁𝗶𝗺𝗮𝗹 𝘁𝗲𝘀𝘁-𝘁𝗶𝗺𝗲 𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 (TTT), and combines ideas of in-context learning, test-time training and mesa-optimization.

4 64 404 87K 334

Download Image

Carlos E. Perez @IntuitMachine

4 months ago

Shocker! Claude 4 system prompt was leaked, and it's a goldmine! The Claude system prompt incorporates several identifiable agentic AI patterns as described in "A Pattern Language For Agentic AI." Here's an analysis of the key patterns used: Run-Loop Prompting: Claude…

63 496 5K 1.2M 13K

Download Image

Rohan Paul @rohanpaul_ai

5 months ago

Small language models struggle with complex reasoning tasks where large models excel. This paper introduces the SMART framework, where a small model performs reasoning but selectively requests corrections from a large model only for steps identified as uncertain via a scoring…

4 31 179 11K 121

Download Image

Genspark @genspark_ai

6 months ago

Meet Genspark Super Agent - a fast & reliable general AI agent! Check it out: genspark.ai

61 144 749 317K 573

Download Video

Pieter Abbeel @pabbeel

7 months ago

Basics of Deep RL tutorial I am still very happy with, as good a day as any to re-post :) youtube.com/playlist?list=…

15 115 944 117K 741

Yi Ma @YiMaTweets

9 months ago

Academia should focus on discovering simplifying and unifying principles and mechanisms behind intelligence; and industry is obviously better equipped to manifest and scale up. That is the same as physics/mechanics to building big airplanes... But I do not believe the current…

Lucas Beyer (bl16) @giffmana

9 months ago

8 18 257 80K 75

5 16 156 31K 65

Stephanie Chan @scychan_brains

9 months ago

Devastatingly, we have lost a bright light in our field. Felix Hill was not only a deeply insightful thinker -- he was also a generous, thoughtful mentor to many researchers. He majorly changed my life, and I can't express how much I owe to him. Even now, Felix still has so much…

6 94 607 89K 560

John Nguyen @JohnNguyen

10 months ago

🥪New Paper! 🥪Introducing Byte Latent Transformer (BLT) - A tokenizer free model scales better than BPE based models with better inference efficiency and robustness. 🧵