Lai Dang Quoc Vinh @LDQuocVinh
Daejeon, Republic of Korea Joined February 2021-
Tweets159
-
Followers23
-
Following402
-
Likes1K
Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to…
Cool research from Microsoft! They release rStar2-Agent, a 14B math reasoning models trained with agentic RL. It reaches frontier-level math reasoning in just 510 RL training steps. Here are my notes:
Towards a Unified View of LLM Post-Training This work proposes Hybrid Post-Training, which switches between RL and SFT using simple performance feedback to balance exploration and exploitation. More below:
RL’s Razor: On-policy RL forgets less than SFT. Even at matched accuracy, RL shows less catastrophic forgetting Key factor: RL’s on-policy updates bias toward KL-minimal solutions Theory + LLM & toy experiments confirm RL stays closer to base model
@jasonth0 I love doing this actually :). I think it's a pretty powerful eval too. Have all models generate something, then put it all together and give it back to all of them and ask them to rank all outputs. I thought models might have a bias to prefer their own outputs, but this doesn't…
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization "We present DuPO, a dual learning-based preference optimization framework that generates annotation-free feedback via a generalized duality" "DuPO decomposes a primal task’s input into known and…
Couldn't resist. Here's a pure PyTorch from-scratch re-implementation of Gemma 3 270M in a Jupyter Notebook (uses about 1.49 GB RAM): github.com/rasbt/LLMs-fro…
Couldn't resist. Here's a pure PyTorch from-scratch re-implementation of Gemma 3 270M in a Jupyter Notebook (uses about 1.49 GB RAM): github.com/rasbt/LLMs-fro… https://t.co/9vF9U29pWh
Introducing Jan-v1: 4B model for web search, an open-source alternative to Perplexity Pro. In our evals, Jan v1 delivers 91% SimpleQA accuracy, slightly outperforming Perplexity Pro while running fully locally. Use cases: - Web search - Deep Research Built on the new version…
Presenting the GLM-4.5 technical report!👇 arxiv.org/abs/2508.06471 This work demonstrates how we developed models that excel at reasoning, coding, and agentic tasks through a unique, multi-stage training paradigm. Key innovations include expert model iteration with…
Top AI Papers of The Week (August 4-10): - CoAct-1 - ReaGAN - Agentic Web - Seed Diffusion - Efficient Agents - A Taxonomy of Hallucinations - Unified Retrieval Agent for AI Search Read on for more:
I'm thrilled to announce the definitive course on Claude Code, created with @AnthropicAI and taught by Elie Schoppik @eschoppik. If you want to use highly agentic coding - where AI works autonomously for many minutes or longer, not just completing code snippets - this is it.…
China just dropped an absolute bombshell AI for Math paper: not just Gold in IMO 2025, but >50% of all Putnam and 78% of all past IMO problems. It BEATS Google's AlphaGeometry2, achieves 100% on OpenAI's miniF2F. Uses Lean for proofs and novel approaches for geometry. And it's…
9 new policy optimization techniques ▪️ GSPO ▪️ LAPO ▪️ HBPO ▪️ SOPHIA ▪️ RePO ▪️ CISPO ▪️ PAPO ▪️ OPO ▪️ EXPO Save the list, and check this out for the links and more info: huggingface.co/posts/Kseniase…
Gemini 2.5 Pro Capable of Winning Gold at IMO 2025 This work shows that you can really push LLMs to achieve remarkable performance on hard tasks. Self-verification and careful orchestration did the trick here! Good insights for devs on this one. Here are my notes:
Introducing GLM-4.5 and GLM-4.5 Air: new flagship models designed to unify frontier reasoning, coding, and agentic capabilities. GLM-4.5: 355B total / 32B active parameters GLM-4.5-Air: 106B total / 12B active parameters API Pricing (per 1M tokens): GLM-4.5: $0.6 Input / $2.2…
Finally, a good modern book on causality for ML: causalai-book.net by @eliasbareinboim. This looks like a worthy successor to the ground breaking book by @yudapearl which I read in grad school. (h/t @JoshuaSafyan for the ref).
Beautiful @GoogleResearch paper. LLMs can learn in context from examples in the prompt, can pick up new patterns while answering, yet their stored weights never change. That behavior looks impossible if learning always means gradient descent. The mechanisms through which this…
New Anthropic research: Building and evaluating alignment auditing agents. We developed three AI agents to autonomously complete alignment auditing tasks. In testing, our agents successfully uncovered hidden goals, built safety evaluations, and surfaced concerning behaviors.
>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

Live Update @LiveUpdate_EM
83 Followers 7K Following Official Elon Musk Communication Channel This is the official account to directly communicate with Elon Musk.
BuffettStyle🇺🇸 @Seauweek697575
50 Followers 2K Following 15-30% Monthly | 2 High-Conviction Stocks.Short-Term Gains: 15-20% in Days/Weeks.DM "JOIN" for WhatsApp Alerts. Live Trade Signals • Market Analysis
Helena R.S @Helenaisgood
892 Followers 7K Following Mom of a beautiful twin, lover girl and a sweet soul ....#itistimeforpeace ✡️✡️
Ormomoon @Ormomoon772
37 Followers 674 Following
Sheighez @SheighezV5lhTY
6 Followers 185 Following
Sairshes @SairshesrFI
170 Followers 3K Following
RoxanneHope @8506qdd9G8a0i
63 Followers 7K Following
Tepseedawn @TepseedawnDBZ8
3 Followers 50 Following
Will Knottenbelt @knottenbeltwill
32 Followers 216 Following ML Engineer @ Speechmatics. Data Science @ Cambridge.
Alex Asch @solalexasch
677K Followers 566K Following Topline Music Producer and Songwriter ARTIST MANAGER . CEO @bluelightmgroup . Lover, Believer, Traveler, Coffee, Tattoos.
Sarvar Hussain @NengrooSarvar
288 Followers 322 Following Research fellow @KAISTPR. Working on Energy Storage & Conversion, Smart grid. #NEREC Research Fellow 2022.
Bute AI Cryptocurrenc... @w2NUOSY95F9cR3
3 Followers 260 Following No need to stay up late to watch the market; Experience 24 hours to earn 1k-10k profit! https://t.co/8wqXuwWapw
GeorgiaMacadam @0m9k2iaQhs3Zkq8
75 Followers 7K Following
RebeccaClara @r9Gj8CexF8J9Zi
55 Followers 5K Following
Mildred @worthymildred29
306 Followers 3K Following
Allison @allison69lynch
576 Followers 3K Following
slfshope @lakshit72470467
39 Followers 443 Following A client is a person or business that buys something from a third party. The main goal of a business is to get a group of customers
SeungAh Son @SeungAh_Son
1 Followers 11 Following
Jihui Lee @JH2_LEE_
2 Followers 35 Following
Bill Fajardo @BillHiruma10
383 Followers 5K Following EV specialist looking to innovate day in & day out. #ElectricVehicles are the future
Zephyr @zephyr_z9
32K Followers 506 Following Tech, AI, Semiconductors, Stocks, Finance. DMs are open
vLLM @vllm_project
19K Followers 20 Following A high-throughput and memory-efficient inference and serving engine for LLMs. Join https://t.co/lxJ0SfX5pJ to discuss together with the community!
Nikita Bier @nikitabier
606K Followers 2K Following head of product @x, advisor @solana, venture partner @lightspeedvp, ex-founder @gasappteam (acq by discord), ex-founder @thetbhapp (acq by facebook)
Tanishq Mathew Abraha... @iScienceLuvr
82K Followers 1K Following CEO @SophontAI | Founder @MedARC_AI | PhD at 19 (2023) | ex Research Director Stability AI | Biomed. engineer @ 14 | TEDx talk➡https://t.co/xPxwKTq6Qb
IBM @IBM
714K Followers 4K Following We don’t just imagine the future; we create it. With AI, hybrid cloud & quantum, we’re building a smarter world. Let’s create smarter business.
Guan Wang @makingAGI
5K Followers 36 Following CEO of Sapient Intelligence. Exploring the path to AGI through brain-inspired AI. 🧠🤖 #AGI #NeuroAI
Jeremy Howard @jeremyphoward
261K Followers 6K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Prev: professor @ UQ; Stanford fellow; @kaggle president; @fastmail/@enlitic/etc founder https://t.co/16UBFTX7mo
Menlo Research @menloresearch
3K Followers 343 Following Anti-Robot Robot Club. Community: https://t.co/tQV4qpFX3h
Alan Dao @alandao_ai
346 Followers 25 Following AI Researcher at Menlo Research. Author of Jan, Lucy, Jan-nano, Ichigo, AlphaMaze, and various other works at Menlo Research.
👋 Jan @jandotai
11K Followers 979 Following Jan is the open-source ChatGPT replacement. We're building Open Superintelligence together. Community: https://t.co/NIyIbR60qQ
François Fleuret @francoisfleuret
46K Followers 487 Following Research Scientist @meta (FAIR), Prof. @Unige_en, co-founder @nc_shape. I like reality.
Ammaar Reshi @ammaar
62K Followers 2K Following Lead Product + Design @GoogleAIStudio // Exploring AI and sharing everything I learn // My views • 🇵🇰 🇺🇸
Hunyuan @TencentHunyuan
27K Followers 6 Following Tencent's large model, encompasses text generation, image generation, video generation, and 3D generation.
Claude @claudeai
136K Followers 1 Following Claude is an AI assistant built by @anthropicai to be safe, accurate, and secure. Talk to Claude on https://t.co/ZhTwG8dz3D or download the app.
Teknium (e/λ) @Teknium1
50K Followers 5K Following Cofounder and Head of Post Training @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE
机器之心 JIQIZHIX... @jiqizhixin
10K Followers 716 Following China's leading media & information provider for #AI & #MachineLearning
verl project @verl_project
1K Followers 5 Following Open RL library for LLMs. https://t.co/Xpaq0thhgi Join us on https://t.co/uWI5Zbd6IH
Z.ai @Zai_org
17K Followers 154 Following The AI lab behind GLM models, dedicated to inspiring the development of AGI to benefit humanity. https://t.co/b6zGxJvzzS
Elias Bareinboim @eliasbareinboim
14K Followers 584 Following Professor of Causal Inference, Machine Learning, and Artificial Intelligence. Director, CausalAI Lab @ Columbia University.
Google Research @GoogleResearch
23K Followers 6 Following Impossible? Let’s see. From algorithms to neuroscience to AI, Google Research strives to progress science, advance society & improve billions of people’s lives.
Minh Le @minhxle1
127 Followers 222 Following Research Fellow @AnthropicAI | Prev: @Parafin, @Robinhood
Binyuan Hui @huybery
35K Followers 662 Following 🥝 Building Qwen @Alibaba_Qwen. Focus on CodeLLM (Pre-training and Post-training) / Reasoning / Agent. Ideas my own.
NVIDIA AI Developer @NVIDIAAIDev
83K Followers 324 Following All things AI for developers from @NVIDIA. Additional developer channels: @NVIDIADeveloper, @NVIDIAHPCDev, and @NVIDIAGameDev.
Alexander Wei @alexwei_
24K Followers 194 Following Reasoning @OpenAI. Co-built CICERO @MetaAI | @Berkeley_AI PhD '23 | @Harvard '20
Wojciech Zaremba @woj_zaremba
121K Followers 204 Following Co-Founder of OpenAI https://t.co/OCQ3mpf0IN
Teortaxes▶️ (Deep... @teortaxesTex
45K Followers 3K Following We're in a race. It's not USA vs China but humans and AGIs vs ape power centralization. @deepseek_ai stan #1, 2023–Deep Time «C’est la guerre.» ®1
Hieu Pham @hyhieu226
34K Followers 25 Following @openai | ex: @xai, @augmentcode, @GoogleBrain, @LTIatCMU, @Stanford, ACM ICPC, IMO🥈 Opinions are my own.
Ai2 @allen_ai
74K Followers 410 Following Breakthrough AI to solve the world's biggest problems. › Join us: https://t.co/MjUpZpKPXJ › Newsletter: https://t.co/k9gGznstwj
Judea Pearl @yudapearl
80K Followers 279 Following Student of causal inference, human reasoning, and history of ideas, all viewed through the sharp lens of artificial intelligence.
Jon Richens @jonathanrichens
1K Followers 320 Following Research scientist in AI safety @GoogleDeepMind
Tesla AI @Tesla_AI
407K Followers 18 Following
MiniMax (official) @MiniMax__AI
18K Followers 11 Following Our mission is to build a world where intelligence thrives with everyone. MiniMax Agent: https://t.co/XzaTmAos0V
Kimi.ai @Kimi_Moonshot
53K Followers 100 Following Built by Moonshot AI to empower everyone to be superhuman.
George @georgejrjrjr
4K Followers 1K Following writing about ml, neuro-woo, and tea. formerly: co-founder @ Vibecamp on Signal @ghw.01
Jack Morris @jxmnop
46K Followers 993 Following research @cornell // language models, information theory, science of AI
Chelsea Finn @chelseabfinn
83K Followers 399 Following Asst Prof of CS & EE @Stanford Co-founder of Physical Intelligence @physical_int PhD from @Berkeley_EECS, EECS BS from @MIT
Jürgen Schmidhuber @SchmidhuberAI
165K Followers 0 Following Invented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.
Mark Chen @markchen90
65K Followers 341 Following Chief Research Officer at @OpenAI. Coach for the USA IOI Team.
Deedy @deedydas
209K Followers 5K Following Partner @MenloVentures. Formerly founding team @glean, @Google Search. @Cornell CS. Tweets about tech, immigration, India, fitness and search.
Dwarkesh Patel @dwarkesh_sp
130K Followers 919 Following Host of @dwarkeshpodcast https://t.co/3SXlu7fy6N https://t.co/4DPAxODFYi https://t.co/hQfIWdM1Un