Xinyue Liu @irisiris_l

PhD student @sbucompsc. Prev @LTIatCMU cauchy221.github.io Stony Brook Joined July 2023

Tweets

42
Followers

20
Following

139
Likes

10

David Bau @davidbau

3 days ago

Who is going to be at #COLM2025? I want to draw your attention to a COLM paper by my student @sheridan_feucht that has totally changed the way I think and teach about LLM representations. The work is worth knowing. And you meet Sheridan at COLM, Oct 7!

Sheridan Feucht @sheridan_feucht

6 months ago

2 37 186 34K 144

Download Image

3 33 179 19K 93

Download Image

David Bau @davidbau

4 days ago

Announcing a broad expansion of the National Deep Inference Fabric. This could be relevant to your research...

1 7 25 2K 4

Download Image

Adithya Bhaskar @AdithyaNLP

5 days ago

Language models that think, chat better. We used longCoT (w/ reward model) for RLHF instead of math, and it just works. Llama-3.1-8B-Instruct + 14K ex beats GPT-4o (!) on chat & creative writing, & even Claude-3.7-Sonnet (thinking) on AlpacaEval2 and WildBench! Read on. 🧵 1/8

2 12 87 14K 54

Download Image

Gabriel Synnaeve @syhw

6 days ago

(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. ai.meta.com/research/publi…

59 300 2K 790K 1K

Chantal @ChantalShaib

6 days ago

"AI slop" seems to be everywhere, but what exactly makes text feel like slop? In our new work (w/ @TuhinChakr, @dgolano, @byron_c_wallace) we provide a systematic attempt at measuring AI slop in text! arxiv.org/abs/2509.19163 🧵 (1/7)

Aidan McLaughlin @aidan_mclau

8 months ago

355 72 966 251K 306

14 35 219 32K 143

Download Image

Anand Bhattad @anand_bhattad

a week ago

So You Want to Be an Academic? A couple of years into your PhD, but wondering: "Am I doing this right?" Most of the advice is aimed at graduating students. But there's far less for junior folks who are still finding their academic path. My candid takes: anandbhattad.github.io/blogs/jr_grads…

15 94 719 78K 764

Boaz Barak @boazbaraktcs

a week ago

Third lecture in AI safety course on YouTube. Fantastic guest lecture by Nicholas Carlini. youtube.com/playlist?list=…

3 20 188 52K 166

Percy Liang @percyliang

2 weeks ago

-2016 (classic era): focus on data efficiency 2017-2025 (pretraining era): focus on compute efficiency 2026-: focus on data efficiency (again) The standard Transformer paradigm is optimized for compute efficiency. As we look at data efficiency, we'll see very different design…

Suhas Kotha @kothasuhas

2 weeks ago

9 79 434 137K 259

Download Image

15 69 627 97K 388

Andy Arditi @andyarditi

2 weeks ago

We found "misaligned persona" features in Llama and Qwen that mediate emergent misalignment. Fine-tuning on bad medical advice strengthens these pre-existing features, causing broader undesirable behavior. lesswrong.com/posts/NCWiR8K8…

1 12 77 12K 39

Thinking Machines @thinkymachines

3 weeks ago

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to…

240 1K 8K 3.2M 5K

Download Image

Simon Willison @simonw

3 weeks ago

Am I the only person who thinks this $1.5bn Anthropic books settlement counts as a win for Anthropic?

146 61 1K 154K 287

Download Image

Zhiqiu (Oscar) Xu @oscar_zhiqiu_xu

4 weeks ago

How do we navigate a growing collection of post-trained LLMs? In Delta Activations: A Representation for Finetuned LLMs, we propose a compact embedding that encodes the post-training signal. Try the interactive model navigator 👉 oscarxzq.github.io/delta_activati…

3 19 41 6K 10

Download Video

Tianjian Li @tli104

4 weeks ago

Language models often produce repetitive responses, and this issue is further amplified by post-training. In this work, we introduce DARLING, a method that explicitly optimizes for both response diversity and quality within online reinforcement learning!

Jason Weston @jaseweston

4 weeks ago

5 87 422 83K 345

Download Image

2 24 89 9K 47

Yuhan Liu @YuhanLiu_nlp

a month ago

👀Have you asked LLM to provide a more detailed answer after inspecting its initial output? Users often provide such implicit feedback during interaction. ✨We study implicit user feedback found in LMSYS and WildChat. #EMNLP2025

2 21 74 18K 39

Download Image

Wojciech Zaremba @woj_zaremba

a month ago

It’s rare for competitors to collaborate. Yet that’s exactly what OpenAI and @AnthropicAI just did—by testing each other’s models with our respective internal safety and alignment evaluations. Today, we’re publishing the results. Frontier AI companies will inevitably compete on…

106 402 2K 371K 469

Amin Karbasi @aminkarbasi

a month ago

We also have a very similar and maybe simpler observations in our recent paper Large Language Models Encode Semantics in Low-Dimensional Linear Subspaces arxiv.org/abs/2507.09709 In fact we can build very effective guardrails using the subspace observation…

Simons Institute for the Theory of Computing @SimonsInstitute

a month ago

3 11 80 25K 57

Download Image

2 10 113 15K 105

Tuhin Chakrabarty @TuhinChakr

a month ago

We have more evidence that some AI detectors ( in particular @pangramlabs ) work more than others and we should use them

Brian Jabarian @brian_jabarian

a month ago

We have more evidence that some AI detectors ( in particular @pangramlabs ) work more than others and we should use them

20 77 330 127K 325

Download Image

0 1 15 1K 4

Yanzhe Zhang @StevenyzZhang

a month ago

Introducing Generative Interfaces - a new paradigm beyond chatbots. We generate interfaces on the fly to better facilitate LLM interaction, so no more passive reading of long text blocks. Adaptive and Interactive: creates the form that best adapts to your goals and needs!

4 41 151 57K 99

Download Video

Kate Knibbs 🏄🏻‍♀️ @Knibbs

a month ago

NEW: A major AI copyright legal showdown just took a huge twist today. Facing a class action on behalf of book authors that could've seen it pay over a TRILLION in damages for alleged piracy, Anthropic has agreed to settle instead: wired.com/story/anthropi…

11 104 228 80K 66

Anthropic @AnthropicAI

a month ago

New Anthropic research: filtering out dangerous information at pretraining. We’re experimenting with ways to remove information about chemical, biological, radiological and nuclear (CBRN) weapons from our models’ training data without affecting performance on harmless tasks.