Khoa Dang @khoaddd

Systems & ML enthusiast, ML@MBZUAI merlin.bearblog.dev Joined May 2021

Tweets

286
Followers

23
Following

279
Likes

5K

Gergely Orosz @GergelyOrosz

2 years ago

My first manager at Uber started a GitHub page back at the time with resources to become a more proficient developer - ones he personally found helpful (he did not have a CS degree). I realized he is *still* updating it, 7 years later! A neat list: github.com/charlax/profes…

60 1K 10K 1.1M 17K

Download Image

Gabriel Synnaeve @syhw

a day ago

(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. ai.meta.com/research/publi…

56 267 2K 682K 924

Nathan Chen @nathancgy4

a day ago

(1/6) triton kernels are a great way to understand ML models. but tutorials are scattered the learning method for me was jst to read real, high performance code so i wrote a blog which walkthroughs the design and intuitions behind FLA's softmax attention kernel 🧵also a thread

10 89 945 71K 1K

Download Image

Ara @arafatkatze

3 days ago

Here's the simplest explanation of @cline's agentic algorithm. It's just a state machine that classifies every request with a tool call into 3 types: 1. Question tools (need clarification) 2. Action tools (gather context) 3. Completion tools (present results) That's it.

18 48 675 54K 920

Download Image

Chi Le @chi_maile

2 days ago

Introducing EveryLab.ai Not your normal intern.

53 29 97 4K 14

Download Video

Ashlee Vance @ashleevance

6 days ago

.@neuralink does some crazy engineering on its way to performing brain surgeries. We wanted to document one of the more striking examples - This Is How Neuralink Builds A Human Head Full video down below

17 92 829 108K 284

Download Video

Jia Guo @Jia__Guo

6 days ago

🚨Training–inference mismatch in MoE RL? It gets even worse than we thought… But no worries—just grab an "IcePoP"🧊 and chill😉! Our new solution keeps MoE RL cool😎 & boosted🚀. Check it out! 📜Blog: ringtech.notion.site/icepop

3 17 92 12K 64

Download Image

Jason Spielman @jayspiel_

a week ago

Designing @NotebookLM was one of the most meaningful opportunities of my career. I finally found time to document the process. Here’s a look behind the scenes: 📐 The mental model is anchored in the creation journey: Inputs → Chat → Outputs. This simple yet flexible flow gave…

82 357 3K 246K 3K

Download Image

Grad @Grad62304977

2 weeks ago

@vikhyatk For well executed reasoning RL I would say: arxiv.org/abs/2505.22312 arxiv.org/abs/2506.13284 arxiv.org/abs/2508.06471 arxiv.org/abs/2504.13914 arxiv.org/abs/2508.08221 arxiv.org/abs/2505.08311 arxiv.org/abs/2506.13585 github.com/Tencent-Hunyua… honorable-payment-890.notion.site/POLARIS-A-POst……

24 161 2K 339K 4K

JingyuanLiu @JingyuanLiu123

2 weeks ago

I was lucky to work in both China and the US LLM labs, and I've been thinking this for a while. The current values of pretraining are indeed different: US labs be like: - lots of GPUs and much larger flops run - Treating stabilities more seriously, and could not tolerate spikes…

Charuru Charuru @CharuruCha14310

2 weeks ago

2 0 38 514K 23

59 343 3K 516K 2K

Rob Wiblin @robertwiblin

a week ago

Neel Nanda is leading a Google DeepMind research team at 26. He and I discuss: • How that happened • “If your safety work doesn't advance capabilities, it's probably bad safety work” • Should people work at the safest or most reckless AI company? • An AI PhD – with these…

12 61 633 109K 796

Download Video

Rosinality @rosinality

2 weeks ago

DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL Training web agents with data constructed using knowledge graphs (arxiv.org/abs/2507.02592).

5 73 430 67K 392

Download Image

Ivan Velichko @iximiuz

2 weeks ago

Building a Docker-like Container From Scratch 🐳 Learn about the key Linux namespaces by assembling a tiny but realistic container using only stock Linux commands: unshare, mount, and pivot_root. No runtime magic and (almost) no cut corners. labs.iximiuz.com/tutorials/cont…

14 243 2K 76K 1K

Download Image

Thinking Machines @thinkymachines

2 weeks ago

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to…

241 1K 8K 3.2M 5K

Download Image

samsja @samsja19

4 weeks ago

Next generation of 10B+ valuation product startup will be built by scaling training on in house RL environment We live in an abundance of capabilities and yet we only have two major AI products, chatgpt and coding agent, and it deeply frustrates me The current supply chain of…

Prime Intellect @PrimeIntellect

4 weeks ago

118 403 3K 1.4M 2K

Download Video

9 26 361 46K 147

Blake Scholl 🛫 @bscholl

4 weeks ago

x.com/i/article/1962…

132 304 3K 730K 3K

Matt Beton @MattBeton

4 weeks ago

Announcing EXO Gym: Simulate distributed training environments using just your laptop. Previously, distributed training experiments required setting up complex multi-node clusters. With EXO Gym, multiple virtual nodes are spawned within one device. 🧵

9 15 145 46K 79

Download Video

Hahnbee Lee @hahnbeeIee

4 weeks ago

so true @dakshgup

394 174 5K 3.1M 2K

Download Image

Daniel Han @danielhanchen

4 weeks ago

GPT-OSS bug fixes + Flex Attention support is here! 1. Fixed float16 infinite losses (>65504 overflows) 2. SWA=128 Flex default uses 129 tokens (extra 1) 3. Fixed MXFP4 inference swiglu_limit=7.0 not set 4. Sink token moved to index 0 5. FA3 doesn't have attn sink dX Details:…

Unsloth AI @UnslothAI

4 weeks ago

21 145 876 138K 405

Download Image

16 101 704 60K 343

Download Image

Ai2 @allen_ai

a month ago

Introducing Asta—our bold initiative to accelerate science with trustworthy, capable agents, benchmarks, & developer resources that bring clarity to the landscape of scientific AI + agents. 🧵