Who is going to be at #COLM2025?
I want to draw your attention to a COLM paper by my student @sheridan_feucht that has totally changed the way I think and teach about LLM representations. The work is worth knowing.
And you meet Sheridan at COLM, Oct 7!
Who is going to be at #COLM2025?
I want to draw your attention to a COLM paper by my student @sheridan_feucht that has totally changed the way I think and teach about LLM representations. The work is worth knowing.
And you meet Sheridan at COLM, Oct 7! https://t.co/L8TtQHFiqC
Language models that think, chat better.
We used longCoT (w/ reward model) for RLHF instead of math, and it just works. Llama-3.1-8B-Instruct + 14K ex beats GPT-4o (!) on chat & creative writing, & even Claude-3.7-Sonnet (thinking) on AlpacaEval2 and WildBench!
Read on. 🧵
1/8
(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models.
ai.meta.com/research/publi…
"AI slop" seems to be everywhere, but what exactly makes text feel like slop?
In our new work (w/ @TuhinChakr, @dgolano, @byron_c_wallace) we provide a systematic attempt at measuring AI slop in text!
arxiv.org/abs/2509.19163
🧵 (1/7) https://t.co/WlVRnq07cd
So You Want to Be an Academic? A couple of years into your PhD, but wondering: "Am I doing this right?" Most of the advice is aimed at graduating students. But there's far less for junior folks who are still finding their academic path.
My candid takes: anandbhattad.github.io/blogs/jr_grads…
-2016 (classic era): focus on data efficiency
2017-2025 (pretraining era): focus on compute efficiency
2026-: focus on data efficiency (again)
The standard Transformer paradigm is optimized for compute efficiency. As we look at data efficiency, we'll see very different design…
-2016 (classic era): focus on data efficiency
2017-2025 (pretraining era): focus on compute efficiency
2026-: focus on data efficiency (again)
The standard Transformer paradigm is optimized for compute efficiency. As we look at data efficiency, we'll see very different design…
We found "misaligned persona" features in Llama and Qwen that mediate emergent misalignment. Fine-tuning on bad medical advice strengthens these pre-existing features, causing broader undesirable behavior. lesswrong.com/posts/NCWiR8K8…
Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference”
We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to…
How do we navigate a growing collection of post-trained LLMs?
In Delta Activations: A Representation for Finetuned LLMs, we propose a compact embedding that encodes the post-training signal.
Try the interactive model navigator 👉 oscarxzq.github.io/delta_activati…
Language models often produce repetitive responses, and this issue is further amplified by post-training. In this work, we introduce DARLING, a method that explicitly optimizes for both response diversity and quality within online reinforcement learning!
Language models often produce repetitive responses, and this issue is further amplified by post-training. In this work, we introduce DARLING, a method that explicitly optimizes for both response diversity and quality within online reinforcement learning!
👀Have you asked LLM to provide a more detailed answer after inspecting its initial output? Users often provide such implicit feedback during interaction.
✨We study implicit user feedback found in LMSYS and WildChat. #EMNLP2025
It’s rare for competitors to collaborate. Yet that’s exactly what OpenAI and @AnthropicAI just did—by testing each other’s models with our respective internal safety and alignment evaluations. Today, we’re publishing the results.
Frontier AI companies will inevitably compete on…
We also have a very similar and maybe simpler observations in our recent paper
Large Language Models Encode Semantics in Low-Dimensional Linear Subspaces
arxiv.org/abs/2507.09709
In fact we can build very effective guardrails using the subspace observation…
We also have a very similar and maybe simpler observations in our recent paper
Large Language Models Encode Semantics in Low-Dimensional Linear Subspaces
arxiv.org/abs/2507.09709
In fact we can build very effective guardrails using the subspace observation…
Introducing Generative Interfaces - a new paradigm beyond chatbots.
We generate interfaces on the fly to better facilitate LLM interaction, so no more passive reading of long text blocks.
Adaptive and Interactive: creates the form that best adapts to your goals and needs!
NEW: A major AI copyright legal showdown just took a huge twist today. Facing a class action on behalf of book authors that could've seen it pay over a TRILLION in damages for alleged piracy, Anthropic has agreed to settle instead: wired.com/story/anthropi…
New Anthropic research: filtering out dangerous information at pretraining.
We’re experimenting with ways to remove information about chemical, biological, radiological and nuclear (CBRN) weapons from our models’ training data without affecting performance on harmless tasks.
8K Followers 893 FollowingAssociate professor at NYU (Courant CS + Center for Data Science) | advisor for @bespokelabsai | large language models and NLP | he/him
971 Followers 593 FollowingPhD student at @LTIatCMU / @SCSatCMU she/her, prev. @UVA and intern @ai2_allennlp
@/clara on https://t.co/GHxXbrRHSB and @/clarana on https://t.co/47UIhMGaRd
2K Followers 2K FollowingAssistant Professor @sutdsg, working on online trust & safety, computational social science, and social NLP. Currently leading the Social AI Studio.
141 Followers 148 FollowingUW NLP | MATS Scholar | Comp. Psyc/Social Sci. |
ai values ↔ human values • value alignment for the good of humanity |
Working on eval • data collection in wild
105 Followers 130 FollowingCSE PhD student @hkust in her second year advised by @junxian_he . Machine learning, NLP.
bluesky here: https://t.co/ECxlKtKTxz
77K Followers 13K FollowingNewsletter exploring AI&ML - AI 101, Agentic Workflow, Business insights. From ML history to AI trends. Led by @kseniase_ Know what you are talking about👇🏼
6K Followers 271 FollowingComputer Science Professor at Northeastern, Ex-Googler. Believes AI should be transparent. @[email protected] @davidbau.bsky.social https://t.co/wmP5LV0pJ4
53 Followers 0 FollowingWomen in AI Research (WiAIR) is dedicated to celebrating the remarkable contributions of female AI researchers from around the globe.
24K Followers 688 FollowingProfessor and Head of Machine Learning Department at @CarnegieMellon. Board member @OpenAI and @Qualcomm. Chief Technical Advisor @GraySwanAI.
8K Followers 893 FollowingAssociate professor at NYU (Courant CS + Center for Data Science) | advisor for @bespokelabsai | large language models and NLP | he/him
992 Followers 133 FollowingGroup account for Prof. Yulia Tsvetkov's lab at @uwnlp. We work on low-resource, multilingual, social-oriented NLP. Details on our website:
3K Followers 2K FollowingEach episode of The Thesis Review is a conversation centered around a researcher's PhD thesis, and how their research has evolved since. Hosted by @wellecks.
21K Followers 465 Followingphysics of language models @ Meta (FAIR, not GenAI, not TBD)
🎓:Tsinghua Physics — MIT CSAIL — Princeton/IAS
🏅:IOI x 2 — ACM-ICPC — USACO — Codejam — math MCM
16K Followers 364 FollowingRuns an AI Safety research group in Berkeley (Truthful AI) + Affiliate at UC Berkeley. Past: Oxford Uni, TruthfulQA, Reversal Curse. Prefer email to DM.
8K Followers 710 FollowingAssistant Professor MIT @medialab @MITEECS @nlp_mit || PhD from CMU @mldcmu @LTIatCMU || Foundations of multisensory AI to enhance the human experience.
296 Followers 88 FollowingNatural language processing researcher. Assistant Professor at Stony Brook University. Previous: Research Assistant Professor at TTIC, PhD from Harvard.
19K Followers 1K FollowingAgents @Meta MSL TBD Lab. previously posttraining research @OpenAI train LLMs to do things: deep research, chatgpt agent, etc. CS PhD @LTIatCMU
38K Followers 485 FollowingDigital Geometer, Assoc. Prof. of Computer Science & Robotics @CarnegieMellon @SCSatCMU and member of the @GeomCollective. There are four lights.