Wenting Zhao @wzhao_nlp
reasoning & llms @Alibaba_Qwen Opinions are my own wenting-zhao.github.io NYC Joined June 2013-
Tweets450
-
Followers5K
-
Following606
-
Likes387
.@RichardSSutton, father of reinforcement learning, doesn’t think LLMs are bitter-lesson-pilled. My steel man of Richard’s position: we need some new architecture to enable continual (on-the-job) learning. And if we have continual learning, we don't need a special training…
while we are on this, rmb we also had: - Neural Architecture Search with Reinforcement Learning arxiv.org/abs/1611.01578 - Symbolic Discovery of Optimization Algorithms arxiv.org/abs/2302.06675 - Using Large Language Models for Hyperparameter Optimization arxiv.org/abs/2312.04528 -…
while we are on this, rmb we also had: - Neural Architecture Search with Reinforcement Learning arxiv.org/abs/1611.01578 - Symbolic Discovery of Optimization Algorithms arxiv.org/abs/2302.06675 - Using Large Language Models for Hyperparameter Optimization arxiv.org/abs/2312.04528 -…
Congrats to the codellama team on the release! Some real good stuff
Congrats to the codellama team on the release! Some real good stuff
QWEN-3 MAX is so good this level of details was only generated by gemini deepthink before one shottes 3d simulation of a procedurally generated mini planet
I’m seriously so eager to learn kernel programming, but one thing I couldn’t decide is whether I should be an expert on it myself or teach AI to be really good at it…😶🌫️
@simonw I was a bit surprised it is less than case than I expected. Code is KING. It’s the primary means of processing digital information - long term I can’t imagine a more important domain for the AGI pilled. And it is highly valuable in the interim too - big TAM @ high salaries.…
We’ve been cooking 👩🍳🥘
What strikes me in the work is that as long as the data recipe is right everything can just work with RL, generalizes super well, even at 1.7B level. Even people said it’s hard to improve RL’ed qwen models, we just did it! Thanks @_akhaliq for featuring my work the third time…
What strikes me in the work is that as long as the data recipe is right everything can just work with RL, generalizes super well, even at 1.7B level. Even people said it’s hard to improve RL’ed qwen models, we just did it! Thanks @_akhaliq for featuring my work the third time…
🚀 Introducing Qwen3-Next-80B-A3B — the FUTURE of efficient LLMs is here! 🔹 80B params, but only 3B activated per token → 10x cheaper training, 10x faster inference than Qwen3-32B.(esp. @ 32K+ context!) 🔹Hybrid Architecture: Gated DeltaNet + Gated Attention → best of speed &…
The online RL is so cool, guess the moat is to have a large number of users 😏
The online RL is so cool, guess the moat is to have a large number of users 😏
I learned so much from this as an ML (not-system-ish) researcher. highly recommend a read!!
I’ve recently joined @Alibaba_Qwen! We’re building the next generation of frontier models through careful science and world-class engineering, and we are making rapid progress. Excited for what’s ahead 💜
🌀New Test-time scaling method 🌀 📝: arxiv.org/abs/2509.06870 - Use RL to train an LLM solution aggregator – Reasons, reviews, reconciles, and synthesizes a final solution -> Much better than existing techniques! - Simple new method. Strong results across 4 math benchmarks. 🧵1/5
Mirage or method? We re-assess a series of RL observations such as spurious reward, one-shot RL, test-time RL, and negative-sample training. 🧐These approaches were all proved on Qwen+Math combination originally, but do they work in other settings? If not, under which…
Feels like everyone is slowly admitting that there's no moat in foundational models, and the only way to build a business out of AI is to build products.
Feels like everyone is slowly admitting that there's no moat in foundational models, and the only way to build a business out of AI is to build products.
seems like worktrees / workspaces are going to be essential if we're going to have 20 agents going at once.
I've always been skeptical about PRMs, but being able to apply RL+reasoning changes the entire story for me. It was a fun ride with @weixiong_1, who has been teaching me a unified view to think about all RL methods. He'll be on the job market! It'd be so lucky to work with him.
I've always been skeptical about PRMs, but being able to apply RL+reasoning changes the entire story for me. It was a fun ride with @weixiong_1, who has been teaching me a unified view to think about all RL methods. He'll be on the job market! It'd be so lucky to work with him.
Being disliked is not a weakness. Needing to be liked is.

cogwire @cog_wire
113 Followers 1K Following
Rajan @sockzonqp
112 Followers 62 Following MS CS + AI researcher @Stanford · Prev @Scale_AI · BS CS @GeorgiaTech · Now building agentic RAG/search @ContextualAI (new acc)
Ruiling Guo @RuilingGuo18833
7 Followers 143 Following
Febria Roosita Dwi @febria_roo
648 Followers 2K Following Developer & Startup Programs | Co-organizer of @jakartajs, @seamlschool | Startup & technology ethusiast
Marko Tasic @mtasic85
588 Followers 1K Following Technology Principal at Coming AI. System Architect. Software Dev. AI/ML Specialist. Tech Consultant. SMBs & Enterprises. Opinions are my own.
Vineet Jain @thevineetjain
701 Followers 432 Following PhD candidate @Mila_Quebec and @mcgillu. Previously @valence_ai @Bosch_AI @mldcmu
olive @olive_learns
1 Followers 113 Following
Benhao Huang @huskydogewoof
93 Followers 722 Following M.S. student @mldcmu, Prev. @sjtu1896 | Opinions approved by my puppy.
Madison @ChaosGoddess769
486 Followers 1K Following AI Scientist researching Video & Multi-Modal LLMs. Learning Leadership & Writing. $TSLA Investing. Practicing Meditation & Health. A Physicist at Heart.
🖤❤️🔥🖤 @whisperowt
159 Followers 2K Following Grok and Capcut made i only use Grok created images that i tell it to create or images i find on Grok ~or pictures i take literally with a camera. 18+ only~
Tim Shi @timshi_ai
7K Followers 3K Following something new! early @openai ‘16, cofounder @cresta ($1B+) (@sequoia @a16z)
Thomas Kariert @tkariert
74 Followers 2K Following
Stanislav Fort @stanislavfort
14K Followers 7K Following Building in AI + security | Stanford PhD in AI & Cambridge physics | ex-Anthropic and DeepMind | scientific progress + economic growth | 🇺🇸🇨🇿
Arian Hosseini @arianTBD
2K Followers 326 Following Research Scientist @GoogleDeepMind - LLM reasoning and alignment - prev: @Google @MSFTResearch
Nicholas Sainsbury @NicholasSa92591
25 Followers 102 Following
Harman Singh @Harman26Singh
998 Followers 2K Following PhD student @berkeley_ai, Prev: Gemini @GoogleDeepMind, AI Resident @MetaAI. Creating intelligence.
Sahil Khose @SahilKhose
886 Followers 2K Following PhD CS at Georgia Tech @ICatGT Advisor: Prof. @judyfhoffman ECCV 2024 SkyScenes 🤗: https://t.co/fHQm3ITmmz
Mik @MikhailObv51347
9 Followers 876 Following
Siddarth Venkatraman @siddarthv66
453 Followers 423 Following PhD at Mila | RL and other stuff I find interesting
Chen Cheng @cherry_cc12
1K Followers 111 Following maintainer of modelscope community, contributor of Qwen
Nil @Nil142296146312
0 Followers 22 Following
Toheart @Toheart724
7 Followers 426 Following
Jashan @jashan702
0 Followers 37 Following Learning about mathematics, machine learning, and software engineering
Boyu Zhu @BoyuZ52287
1 Followers 46 Following
Gabriel Synnaeve @syhw
16K Followers 1K Following Nerd & Dad. RL & CodeGen research since before it was cool.
Iris Duong/Credit & C... @CryptoETFGuide
1 Followers 48 Following Credit & Capital market Associate @ Harvest partners/ Ex-Goldman Sachs & and Houlihan Lokey/ Sharing insights on crypto. https://t.co/XMIPygpxxt
Alexander Pondaven @alexpondaven
82 Followers 508 Following Working on controllable video generation. PhD student @UniofOxford @aims_oxford @OxfordTVG @Snap. MEng @Imperialcollege
freaky fish guy @samliu
358 Followers 1K Following building creativity ex- perception @waymo, spam&abuse @google https://t.co/T1kjswGzvU
Schwinn @szawinis
48 Followers 286 Following CS @ Stanford // prev founded Sidenote (YC S23), AI @ NVIDIA
Yixuan "Tom" Wang @tom_yixuan_wang
78 Followers 383 Following computational linguistics & nlp phd student @UWCheritonCS w/ @fredahshi previously @PKU1898 & @UChicago
Lantao Yu @complex_filter
7 Followers 51 Following a Computer Scientist at Adobe, a contributor of Project Indigo, Ex-intern at MERL, Facebook.
Yu Fei @ Amazon Rufus @Walter_Fei
216 Followers 295 Following PhD student @UCIrvine working on NLP/ML. Previously: ms @ETH, undergrad @PKU1898
xyyzxxyzzxyy @xyyzxxyzzxyy
0 Followers 482 Following
Kanishk Singh @kanishksin
49 Followers 345 Following CS UG '17-'22 @IITKgp Research Engineer Intern @Amazon | Prev: Summer Intern '21 @AdobeResearch
Theonewhomadethings @laplaceFactor
157 Followers 5K Following Mechatronics engineer by day | Interests: Classical Control, SLAM, Depth, RL, World Models | focus: Robot Manipulators and Mobile Robots
Fedor Kovalev @KesselmanF
103 Followers 535 Following Media exec turned AI nerd Living life as conscious as possible.
Ayaan Sharif @Ayaan_Shariif
171 Followers 2K Following Ai engineer @XfiniteOfficial vibe coding @KaivoAi
json @CaptJson
50 Followers 842 Following
Sümer Cip @sumercip
435 Followers 2K Following Sr Sw Eng working on observability. Specially interested in observability tools & databases & low-level stuff. Dad. Sports nerd, specially Squash.
Liangchen Luo @LiangchenLuo
5K Followers 126 Following @xAI reasoning; ex @GoogleDeepMind. B.Sc. @PKU1898. Opinions are my own.
Gabriel Synnaeve @syhw
16K Followers 1K Following Nerd & Dad. RL & CodeGen research since before it was cool.
Kunhao Zheng @KunhaoZ
1K Followers 648 Following The real AGI is the friends we make along the way. PhD in FAIR CodeGen @AIatMeta. Alumni: @Huggingface, Sea AI Lab, @openai, École Polytechnique, SJTU
Ning Ding @stingning
3K Followers 326 Following Researcher of AI/LM. Assistant Professor @Tsinghua_Uni. Working on scalable methods of language models.
Brendan (can/do) @BrendanFoody
12K Followers 459 Following ceo @mercor_ai | labor markets fascinate me
Junrong Lin @OcssLin
101 Followers 311 Following MTS @Alibaba_Qwen on MLsys, building SGLang @lmsysorg | Prev. @DukeU
Chen Cheng @cherry_cc12
1K Followers 111 Following maintainer of modelscope community, contributor of Qwen
Zijian Wang @zijianwang30
619 Followers 397 Following Science Manager at AWS AI Labs. Training code LLM/agents. Organizer of @DL4Code at ICLR and @LLM4Code at ICSE Past @StanfordNLP @StanfordSymSys @UMich @SJTU1896
Yang Su @YangSu2000
29 Followers 160 Following Building Qwen models @Alibaba_Qwen 🥝 | Agent & Code RL, Mid-Training
Chujie Zheng @ChujieZheng
6K Followers 305 Following Researcher @Alibaba_Qwen | GSPO, Qwen3, QwQ, ProcessBench | Opinions are my own
Tianbao Xie @TianbaoX
3K Followers 2K Following Ph.D. candidate @XLangNLP lab and @hkunlp2020 . Incoming @OpenAI . Advised by @taoyds and @ikekong . 🤝 @Alibaba_Qwen @SFResearch
Weizhe Yuan @WeizheY
341 Followers 297 Following Ph.D. at @nyuniversity. Visiting researcher at @AIatMeta. Previous Intern @cohere, MCDS @LTIatCMU. Working on ML/NLP. Painting lover🎨.
Shuchao Bi @shuchaobi
13K Followers 691 Following Research @Meta Superintelligence Labs, RL/post-training/agents; Previously Research @OpenAI on multimodal and RL; Opinions are my own.
You Jiacheng @YouJiacheng
8K Followers 2K Following a big fan of TileLang 关注TileLang喵!关注TileLang谢谢喵! https://t.co/utshC0jrCO 十年老粉
Kevin Lu @_kevinlu
9K Followers 220 Following @thinkymachines. formerly: - @openai: RL, synthetic data, efficient models - @berkeley_ai: decision transformer, universal computation
Yiheng Xu @yihengxu_
1K Followers 711 Following ai agent research @hkuniversity | scaling agent @Alibaba_Qwen | ex @msftresearch @sfresearch | from automation to autonomy
Fan Zhou @FaZhou_998
1K Followers 834 Following Qwen Coding @Alibaba_Qwen. Prev: Core member @XLangNLP, Intern @MSFTResearch.
Yuchen Jin @Yuchenj_UW
57K Followers 564 Following Co-founder & CTO @hyperbolic_labs cooking fun AI systems. Prev: OctoAI (acquired by @nvidia) building Apache TVM, PhD @ University of Washington.
martin_casado @martin_casado
69K Followers 3K Following GP @ a16z ... questionable heuristics in a grossly underdetermined world
Donglai Xiang @DonglaiXiang
2K Followers 903 Following Research Scientist at Nvidia. Previously Ph.D. from Carnegie Mellon University; visiting researcher at Meta Reality Labs.
Johannes Treutlein @j_treutlein
348 Followers 172 Following AI alignment stress-testing research @AnthropicAI. On leave from my CS PhD at UC Berkeley, @CHAI_Berkeley. Opinions my own.
Umar Jamil @hkproj
15K Followers 1K Following AI @MistralAI - Join the best AI community on Discord: https://t.co/zYH1DlgdbW - Opinions my own
Shengjia Zhao @shengjia_zhao
52K Followers 231 Following Chief Scientist @ Meta MSL. Formerly MTS @ OpenAI, PhD @ Stanford. I train models. All opinions my own.
Sebastian Ruder @ ACL @seb_ruder
93K Followers 1K Following Research Scientist @AIatMeta • Ex @Cohere @GoogleDeepMind
Rohan Pandey @khoomeik
39K Followers 2K Following descending cross-entropy to ascend entropy || prev research @OpenAI @CarnegieMellon '23
Tanay Jaipuria @tanayj
70K Followers 3K Following partner @wing_vc investing in AI applications and infra. opinions, analysis, and banter on technology and business
Jingfeng Wu @uuujingfeng
1K Followers 1K Following Bsky: https://t.co/hUrRPJZ9BU Postdoc @SimonsInstitute @UCBerkeley; alumnus of @JohnsHopkins @PKU1898; DL theory, opt, and stat learning.
Pranjal Aggarwal ✈�... @PranjalAggarw16
483 Followers 110 Following PhD Student @LTIatCMU. research scientist intern @AIatMeta FAIR. Working on reasoning, computer-use agents and test-time compute. Prev @IITD
Tianjian Li @tli104
325 Followers 605 Following PhD student @jhuclsp, research scientist intern @AIatMeta FAIR. Previously @nyuniversity.
Zhijian Liu @zhijianliu_
2K Followers 824 Following Research Scientist @NVIDIA. Assistant Professor @UCSanDiego. PhD @MIT. Efficient AI. Views are my own.
Narutatsu (Edward) Ri @narutatsuri
426 Followers 248 Following PhD Student @PrincetonPLI | BS @Columbia ‘24
Valerie Chen @valeriechen_
2K Followers 513 Following phd student @mldcmu @SCSatCMU + intern @allhands_ai | building @CopilotArena | previously @NYUDataScience @MSFTResearch @yale @CMU_Robotics @IBMResearch
Mike A. Merrill @Mike_A_Merrill
658 Followers 306 Following Postdoc @StanfordAILab Building https://t.co/KWJvsMlWva with @alexgshaw and many others Go Bills
Nishant Subramani @nsubramani23
789 Followers 2K Following PhD student @LTIatCMU working on model interpretability; student researcher @google // Prev: intern @msftresearch, predoc @allen_ai // @BVB supporter // he/him
Jascha Sohl-Dickstein @jaschasd
24K Followers 712 Following Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.
Sebastian Raschka @rasbt
358K Followers 1K Following ML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).
Yoram Bachrach @yorambac
3K Followers 7K Following Research Scientist at Meta (prev Google DeepMind and Microsoft Research). Working on LLM Agents and Multi-Agent Systems.
Alexander Doria @Dorialexander
19K Followers 4K Following Reasoning models to come. Co-founder @pleiasfr
OpenAI @OpenAI
4.4M Followers 3 Following OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6Lg202
SSI Inc. @ssi
102K Followers 0 Following A straight shot to safe superintelligence. Join us https://t.co/hHla3vusDE.