Lorenz Kuhn @_lorenzkuhn

Reasoning Research @OpenAI | o1-preview through o3 Joined January 2014

Tweets

248
Followers

1K
Following

746
Likes

964

Mostafa Rohaninejad @MostafaRohani

2 weeks ago

1/n I’m really excited to share that our @OpenAI reasoning system got a perfect score of 12/12 during the 2025 ICPC World Finals, the premier collegiate programming competition where top university teams from around the world solve complex algorithmic problems. This would have…

138 450 3K 2.2M 679

Download Image

Miles Wang @MilesKWang

2 months ago

IMO gold is a win for scaling ~nearly~ superhuman oversight on a fuzzy, hard-to-verify RL domain

Alexander Wei @alexwei_

2 months ago

IMO gold is a win for scaling ~nearly~ superhuman oversight on a fuzzy, hard-to-verify RL domain

9 42 921 190K 114

Download Image

4 3 158 14K 32

Lorenz Kuhn @_lorenzkuhn

3 months ago

It was thrilling to watch AI compete against some of the best human competitive programmers at AtCoder World Finals Heuristics yesterday. Check out @andresnds ‘s thread on how the AI solutions improved throughout the 10h contest. Congrats to @FakePsyho on 1st place!

Andre Saraiva @andresnds

3 months ago

39 201 1K 752K 592

Download Image

1 3 48 5K 2

Ahmed El-Kishky @ahelkky

3 months ago

Congratulations @FakePsyho on a nail-biting performance! Great showings as well from @bminaiev, @andresnds, and @_lorenzkuhn representing OpenAI. It’s been fantastic sponsoring AtCoder World Finals @atcoder. We’re excited to share some of the model solutions with the world.

Psyho @FakePsyho

3 months ago

560 1K 13K 2.2M 2K

Download Image

2 4 72 66K 10

Lorenz Kuhn @_lorenzkuhn

8 months ago

Two important points from our new technical report: 1. Scaling continues to work and the bitter lesson still holds 2. Recent AI models are strong at reasoning tasks and are rapidly becoming stronger — 4o was released less than a year ago, o1 less than six months ago

Ahmed El-Kishky @ahelkky

8 months ago

4 6 65 7K 5

Download Image

0 0 7 690 0

Lorenz Kuhn @_lorenzkuhn

a year ago

i generally feel super grateful that i get to work with such exceptionally skilled and kind people on reasoning research. the sprint for IOI in particular was special though. IOI 2024 gold @ 10k submissions; 49th percentile of competitors under real contest conditions

Mark Chen @markchen90

a year ago

53 56 848 473K 222

0 0 8 1K 0

Noam Brown @polynoamial

a year ago

Today, I’m excited to share with you all the fruit of our effort at @OpenAI to create AI models capable of truly general reasoning: OpenAI's new o1 model series! (aka 🍓) Let me explain 🧵 1/

227 2K 11K 2.7M 5K

Download Image

Lorenz Kuhn @_lorenzkuhn

a year ago

very excited about these models helping people solve hard problems and proud of the work we did. give the new models a try!

OpenAI @OpenAI

a year ago

very excited about these models helping people solve hard problems and proud of the work we did. give the new models a try!

956 4K 18K 8.0M 3K

1 0 16 805 0

Jerry Tworek @MillionInt

a year ago

We trained a model and it is good in some things

OpenAI @OpenAI

a year ago

We trained a model and it is good in some things

956 4K 18K 8.0M 3K

28 48 1K 1.5M 105

William Fedus @LiamFedus

a year ago

But the ELO can ultimately become bounded by the difficulty of the prompts (i.e. can’t achieve arbitrarily high win rates on the prompt: “what’s up”). We find on harder prompt sets — and in particular coding — there is an even larger gap: GPT-4o achieves a +100 ELO over our prior…

21 88 729 228K 89

Download Image

Lorenz Kuhn @_lorenzkuhn

2 years ago

rainy day in sf

2 0 6 1K 0

Ajeya Cotra @ajeya_cotra

2 years ago

Excellent post by @JacobSteinhardt trying to forecast the abilities of models that could be trained in 2030: bounded-regret.ghost.io/what-will-gpt-…

1 8 54 8K 17

aj @anndvision

2 years ago

new preprint "ReLU to the Rescue: Improve your On-policy Actor-Critic with Positive Advantages" shockingly simple changes to A3C can give a cautious RL algorithm more effective than PPO in some settings, just adding a ReLU is enough! arxiv.org/abs/2306.01460

2 17 90 41K 31

Download Image

Sebastian Farquhar @seb_far

2 years ago

The Google DeepMind alignment team is looking for research scientists and research engineers to help us work towards safe AGI. I think this is a very pressing problem, and it's a nice place to work. Please apply and help take our work to the next level. boards.greenhouse.io/deepmind/jobs/…

0 3 12 2K 1

Google DeepMind @GoogleDeepMind

2 years ago

With more powerful AI systems comes more responsibility to identify novel capabilities in models. 🔍 Our new research looks at evaluating future 𝘦𝘹𝘵𝘳𝘦𝘮𝘦 risks, which may cause harm through misuse or misalignment. Here’s a snapshot of the work. 🧵 dpmd.ai/novel-ai-risks