Yang Chen @ychenNLP

Research Scientist @NVIDIA | PhD @GeorgiaTech| RL and LLM reasoning edchengg.github.io Joined September 2018

Tweets

80
Followers

1K
Following

560
Likes

2K

Sam Altman @sama

5 days ago

Progress at our datacenter in Abilene. Fun to visit yesterday!

1K 991 11K 909K 907

Download Video

Elon Musk @elonmusk

6 days ago

@techdevnotes Just as we will be the first to bring a Gigawatt of coherent training compute online, we will also be the first to 10GW, 100GW, 1TW, …

354 607 6K 3.1M 483

We're proud to announce a landmark partnership with @OpenAI to build new gigascale AI factories using millions of NVIDIA GPUs. 🤝 This partnership will supply 10 gigawatts of GPUs to fuel @OpenAI's data center growth.

278 737 5K 2.8M 496

Download Image

Satya Nadella @satyanadella

2 weeks ago

If intelligence is the log of compute… it starts with a lot of compute! And that’s why we’re scaling our GPU fleet faster than anyone else. Just last year, we added over 2 gigawatts of new capacity – roughly the output of 2 nuclear power plants. And today we’re going further,…

612 1K 8K 1.4M 2K

Download Video

Elon Musk @elonmusk

2 weeks ago

@ns123abc @SemiAnalysis_ @BrentM_SpaceX 1TW is a start

72 116 1K 55K 45

Elon Musk @elonmusk

a month ago

Having thought about it some more, I think the 50 million H100 equivalent number in 5 years is about right. Eventually, billions.

Andree Jacobson @nmswede

a month ago

Having thought about it some more, I think the 50 million H100 equivalent number in 5 years is about right. Eventually, billions.

124 115 2K 11.3M 182

2K 3K 22K 11.5M 1K

Elon Musk @elonmusk

2 months ago

Cable pr0n of @xai GB200 servers at Colossus 2

9K 11K 151K 28.7M 10K

Download Image

Sam Altman @sama

2 months ago

we have signed a deal for an additional 4.5 gigawatts of capacity with oracle as part of stargate. easy to throw around numbers, but this is a _gigantic_ infrastructure project. some progress photos from abilene:

2K 2K 21K 2.1M 2K

Download Image

Zhuolin Yang @lucas110550

3 months ago

Our released evaluation toolkit can reproduce our AceReason-Nemotron models numbers (see below): AceReason-Nemotron-1.0-7B: LiveCodeBench (Avg@8): * [05/23-05/24]: 72.0; [06/24-01/25]: 54.2 * release set v5: 51.2; release set v6: 44.4 AIME (Avg@64): * AIME'24: 68.6; AIME'25:…

Yang Chen @ychenNLP

3 months ago

1 4 53 5K 35

0 4 9 1K 5

Yang Chen @ychenNLP

3 months ago

The first thing we did was to make sure the eval setup is correct! We spend a lot of time to make sure our eval can - accurately reproduce the DeepSeek-R1 numbers on AIME, LiveCodeBench - it's IMPOSSIBLE to track the RL progress without a good eval set up (e.g., we see AIME up…

Francesco Bertolotti @f14bertolotti

3 months ago

2 25 154 13K 139

Download Image

1 4 53 5K 35

Yang Chen @ychenNLP

3 months ago

📌Paper: arxiv.org/abs/2506.13284 📌Model: huggingface.co/nvidia/AceReas… 📌SFT Data: huggingface.co/datasets/nvidi… 📌Math RL Data: huggingface.co/datasets/nvidi… A series of our work on reasoning models: 📌5/22/2025: AceReason-Nemotron: Scaling RL for math and code (7B and 14B)…

0 3 11 782 5

Download Image

Zihan (Johan) Liu @zihan_johan_liu

3 months ago

With stronger SFT backbone, AceReason-Nemotron-1.1-7B significantly outperforms its predecessor and sets a record-high performance among Qwen2.5-7B-based reasoning models. 📄Report: arxiv.org/pdf/2506.13284 🤗Model: huggingface.co/nvidia/AceReas… 📚SFT Data: huggingface.co/datasets/nvidi…

Wei Ping @_weiping

3 months ago

2 16 69 6K 31

Download Image

1 8 25 2K 8

Wei Ping @_weiping

3 months ago

Introducing AceReason-Nemotron 1.1 Our previous release, AceReason-Nemotron-1.0, introduced a stage-wise RL recipe that was applied sequentially to math-only and code-only prompts, demonstrating both high efficiency and strong effectiveness. Here, we systematically investigate…

2 16 69 6K 31

Download Image

Zhuolin Yang @lucas110550

4 months ago

@etash_guha @ryanmart3n I tried to reproduce DS-R1-distilled-7B and AceReason-7B's performance on your split (06/24-01/25), and they turn out to be 41.9 and 54.6 correspondingly, which is obviously higher than your reported number. Anything wrong here? @etash_guha @ryanmart3n

1 2 3 262 2

Yang Chen @ychenNLP

4 months ago

Does RL incentive reasoning capability over the starting SFT model? We show an interesting result with our recent published AceReason-Nemotron-7B model, which was trained with RL pass@K from 1 to 1024 consistently +10% on LiveCodeBench v6 perhaps scaling RL is the key