Qihan Ren @jsonren00

Ph.D. candidate at SJTU @sjtu1896. Prev. Undergrad @sjtu1896 and @Umich. Interpretability, safe & trustworthy LLMs. nebularaid2000.github.io Shanghai, China Joined July 2024

Tweets

20
Followers

25
Following

87
Likes

23

Dongrui Liu @dong_rui39501

a day ago

Self-Evolving AI Risks "Misevolution" Even top LLMs (Gemini-2.5-Pro, GPT-4o) face this—agents drift into harm: over-refunding, reusing insecure tools, losing safety alignment. First study on this! arxiv.org/pdf/2509.26354

0 5 6 362 0

Download Image

Jiahao Qiu @JiahaoQiu99

2 months ago

🚀 Just released: "A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence"! We provide the first comprehensive review of agents capable of self-evolution—highlighting what, when, and how agents evolve, key benchmarks and applications, and future directions…

2 40 159 14K 94

Download Image

Qihan Ren @jsonren00

4 months ago

When agents can search for and learn new tools by themselves... Amazing paper from Jiahao. Really glad to have participated in this project, and congrats for taking top in GAIA! 🚀

Jiahao Qiu @JiahaoQiu99

4 months ago

When agents can search for and learn new tools by themselves... Amazing paper from Jiahao. Really glad to have participated in this project, and congrats for taking top in GAIA! 🚀

17 29 92 25K 42

Download Image

0 0 0 44 0

Jason Wei @_jasonwei

7 months ago

In today’s competitive product landscape, scientific understanding of models often lags behind speed of model deployment. If the goal is to train a deployable model (especially when bottlenecked by compute), it totally makes sense to make several changes at a time without…

16 22 296 35K 67

Qihan Ren @jsonren00

8 months ago

Can't agree more. Sparsity also plays an important role in explanations🤔

Yi Ma @YiMaTweets

8 months ago

Can't agree more. Sparsity also plays an important role in explanations🤔

10 50 387 45K 391

0 0 1 30 0

Jiayi Pan @jiayi_pirate

8 months ago

We reproduced DeepSeek R1-Zero in the CountDown game, and it just works Through RL, the 3B base LM develops self-verification and search abilities all on its own You can experience the Ahah moment yourself for < $30 Code: github.com/Jiayi-Pan/Tiny… Here's what we learned 🧵

194 1K 6K 1.7M 6K

Download Image

GREG ISENBERG @gregisenberg

8 months ago

DeepSeek just proved the 'worthless' GPT wrapper startups are actually the ones with real moats. A week ago, nothing was more LOW status than being a 'GPT wrapper' startup. But I think we're learning that's DEAD wrong. Turns out they were just early to the only game that…

498 890 8K 1.3M 4K

Jiao Sun @sunjiao123sun_

8 months ago

I read the DeepSeek-R1 paper the day it came out, and I don’t think GRPO is the key to its success. Instead, here’s what truly matters (ranked by importance): 1. Iterative RL and SFT 2. A hybrid reward model—mixing rule-based RM and neural RM for deterministic tasks 3.…

70 458 3K 419K 2K

Yao Fu @Francis_YAO_

10 months ago

Don’t race. Don’t catch up. Don’t play the game. Instead, do rigorous science. Do controlled experiments. Formulate clear hypothesis. Carefully examine alternative hypothesis. Rule out confounders. Listen to the physics of LLM tutorial 10 times and recite every single word of it.…

Wenhu Chen @WenhuChen

10 months ago

37 10 174 134K 39

13 152 1K 133K 472

Zhuang Liu @liuzhuang1234

10 months ago

How far is an LLM from not only understanding but also generating visually? Not very far! Introducing MetaMorph---a multimodal understanding and generation model. In MetaMorph, understanding and generation benefit each other. Very moderate generation data is needed to elicit…

24 137 726 249K 545

Download Image

garreth @garrxth

10 months ago

🚀 With Meta's recent paper replacing tokenization in LLMs with patches 🩹, I figured that it's a great time to revisit how tokenization has evolved over the years using everyone's favourite medium - memes! Let's take a trip down memory lane! [1/N]

20 226 2K 434K 2K

Download Image

Qihan Ren @jsonren00

10 months ago

I was not at #NeurIPS2024 due to visa issues. But it was really sad to see this kind of biased claims in a top conference 😢 A thumb-up to the one who pointed it out.

Jiao Sun @sunjiao123sun_

10 months ago

I was not at #NeurIPS2024 due to visa issues. But it was really sad to see this kind of biased claims in a top conference 😢 A thumb-up to the one who pointed it out.

182 809 4K 2.2M 523

Download Image

0 0 2 267 0

Qihan Ren @jsonren00

10 months ago

Final stop of this trip @Penn. Thanks to Prof. Weijie Su @weijie444 for hosting, and Yangxinyu @Xinyu51689497 for the detailed discussion. It's a pity to forget to take any photos though😭

0 0 1 30 0

Qihan Ren @jsonren00

10 months ago

Gave a talk at Social Cognitive AI (SCAI) Lab @JohnsHopkins with Prof. Tianmin Shu @tianminshu. Many interesting feedbacks from the view of human-centered AI!

0 0 1 27 0

Download Image

Qihan Ren @jsonren00

10 months ago

Short visit and talk @UCBerkeley last week with Prof. Kannan Ramchandran and Justin Kang. Really excited to have someone working in the same direction!

0 0 0 42 0

Download Image

Qihan Ren @jsonren00

11 months ago

Glad to gave a talk on XAI at Melady Lab @USC. Thanks to Prof. Yan Liu @yanliu_usc and Defu @caodefu_dove for hosting. Thanks to James @EnouenJames for the in-depth questions!

1 0 3 68 0

Download Image

Qihan Ren @jsonren00

11 months ago

💡Thrilled to deliver a talk @UCLA. Thanks to Prof. Yingnian Wu for hosting! This is a really nice experience. Photos from Yasi Zhang.

1 0 2 50 0

Download Image

Quanshi Zhang @QuanshiZhang

11 months ago

How to explain a DNN’s generalization ability and learning dynamics through the lens of interaction concepts? Our recent works (arxiv.org/abs/2405.10262, and arxiv.org/abs/2407.19198 in #NeurIPS2024) discover and theoretically prove a two-phase dynamics of interaction concepts…

0 2 1 120 0

Download Image

Quanshi Zhang @QuanshiZhang

11 months ago

Can the inference logic of a DNN be faithfully explained as symbolic concepts? Our #ICLR2024 paper (arxiv.org/abs/2305.01939) makes an initial theoretical attempt to address this question. We prove that under three sufficient conditions, a DNN only encodes a small number of…