Adrien Laurent @alaurentg

Grok 4 (Thinking) achieves new SOTA on ARC-AGI-2 with 15.9% This nearly doubles the previous commercial SOTA and tops the current Kaggle competition SOTA

246 738 5K 7.3M 721

Download Image

I'm excited to announce what we have been working on for months. Announcing OpenThinker3, the strongest 7B reasoning model with open data. Also more than 1000 experiments on what works and what doesn't for post-training data curation.

Ryan Marten @ryanmart3n

4 months ago

33 194 927 195K 728

Download Image

5 27 253 18K 101

Cursor @cursor_ai

6 months ago

Gemini 2.5 Pro is available to all Cursor users! You can enable the full 1M context window if you'd like. We're curious to hear how you think it compares to Sonnet.

365 478 9K 1.1M 1K

Alex Albert @alexalbert__

7 months ago

We've introduced a new text_editor tool in the Anthropic API. It's designed for apps where Claude works with text files. With the new tool, Claude can make targeted edits to specific portions of text. This reduces token consumption and latency, all while increasing accuracy.

85 108 2K 119K 701

Download Image

Nathan Lambert @natolambert

7 months ago

A very exciting day for open-source AI! We're releasing our biggest open source model yet -- OLMo 2 32B -- and it beats the latest GPT 3.5, GPT 4o mini, and leading open weight models like Qwen and Mistral. As usual, all data, weights, code, etc. are available. For a long time,…

51 150 953 98K 348

Download Image

Andrej Karpathy @karpathy

7 months ago

It's 2025 and most content is still written for humans instead of LLMs. 99.9% of attention is about to be LLM attention, not human attention. E.g. 99% of libraries still have docs that basically render to some pretty .html static pages assuming a human will click through them.…

657 1K 13K 1.8M 5K

Thomas Wolf @Thom_Wolf

7 months ago

I shared a controversial take the other day at an event and I decided to write it down in a longer format: I’m afraid AI won't give us a "compressed 21st century". The "compressed 21st century" comes from Dario's "Machine of Loving Grace" and if you haven’t read it, you probably…

281 508 3K 386K 2K

Alex Dimakis @AlexGDimakis

8 months ago

Discovered a very interesting thing about DeepSeek-R1 and all reasoning models: The wrong answers are much longer while the correct answers are much shorter. Even on the same question, when we re-run the model, it sometimes produces a short (usually correct) answer or a wrong…

145 210 2K 218K 796

Download Image

Alex Dimakis @AlexGDimakis

8 months ago

What if we had the data that DeepSeek-R1 was post-trained on? We announce Open Thoughts, an effort to create such open reasoning datasets. Using our data we trained Open Thinker 7B an open data model with performance very close to DeepSeekR1-7B distill.

Mahesh Sathiamoorthy @madiator

8 months ago

46 291 2K 213K 1K

Download Image

7 25 219 25K 76

Jeff Dean @JeffDean

9 months ago

Personalized educational uses like this are one of the ways that capability advances in AI models will provide broad benefits. The idea that you can get a personalized tutor for any piece of information that knows you and knows how you learn best is going to be powerful! 🎉📚

👩‍💻 Paige Bailey @DynamicWebPaige

9 months ago

21 72 711 144K 822

Download Video

16 76 473 80K 234

Steven Heidel @stevenheidel

9 months ago

these new captchas are getting way too difficult

29 96 2K 77K 79

Download Image

François Chollet @fchollet

10 months ago

@VictorTaelin Just use a LLM bro

5 1 130 16K 4

Jeff Dean @JeffDean

10 months ago

Introducing Gemini 2.0 Flash Thinking, an experimental model that explicitly shows its thoughts. Built on 2.0 Flash’s speed and performance, this model is trained to use thoughts to strengthen its reasoning. And we see promising results when we increase inference time…

127 482 4K 1.5M 896

CJ Zafir @cjzafir

10 months ago

I've built 19 projects with Cursor AI without line a single line of code myself. But, the Truth is Cursor is dumb you don't add detailed docs around your project. You need to build a strong <Context Boundary> around Cursor Here what you can do to improve your Cursor workflow🧵