-
Tweets189
-
Followers2K
-
Following593
-
Likes2K
Cursor can now control your browser. Agent can take screenshots, improve UI, and debug client issues. Try our early preview with Sonnet 4.5.
We evaluated Anthropic's Sonnet 4.5 with our minimal agent. New record on SWE-bench verified: 70.6%! Same price/token as Sonnet 4, but takes more steps, ending up being more expensive. Cost analysis details & link to full trajectories in 🧵
yolo run summer is over scaling laws fall has arrived
🔍 How do we teach an LLM to 𝘮𝘢𝘴𝘵𝘦𝘳 a body of knowledge? In new work with @AIatMeta, we propose Active Reading 📙: a way for models to teach themselves new things by self-studying their training data. Results: * 𝟔𝟔% on SimpleQA w/ an 8B model by studying the wikipedia…
MoE layers can be really slow. When training our coding models @cursor_ai, they ate up 27–53% of training time. So we completely rebuilt it at the kernel level and transitioned to MXFP8. The result: 3.5x faster MoE layer and 1.5x end-to-end training speedup. We believe our…
Presenting two posters at ICML over the next two days: - Both at 11am - 1:30pm - Both about how to improve pre-training with domains - Both at stall # E-2600 in East Exhibition Hall A-B (!) Tomorrow: WebOrganizer w/ @soldni & @kylelostat Thursday: MeCo by @gaotianyu1350
Tokenization is just a special case of "chunking" - building low-level data into high-level abstractions - which is in turn fundamental to intelligence. Our new architecture, which enables hierarchical *dynamic chunking*, is not only tokenizer-free, but simply scales better.
Tokenization is just a special case of "chunking" - building low-level data into high-level abstractions - which is in turn fundamental to intelligence. Our new architecture, which enables hierarchical *dynamic chunking*, is not only tokenizer-free, but simply scales better. https://t.co/SsZloRQR24
Anthropic staff realized they could ask Claude to buy things that weren’t just food & drink. After someone randomly decided to ask it to order a tungsten cube, Claude ended up with an inventory full of (as it put it) “specialty metal items” that it ended up selling at a loss.
New paper cutting through the thicket of KV cache eviction methods!
Can GPT, Claude, and Gemini play video games like Zelda, Civ, and Doom II? 𝗩𝗶𝗱𝗲𝗼𝗚𝗮𝗺𝗲𝗕𝗲𝗻𝗰𝗵 evaluates VLMs on Game Boy & MS-DOS games given only raw screen input, just like how a human would play. The best model (Gemini) completes just 0.48% of the benchmark! 🧵👇
Claude Sonnet 4 is much better at codebase understanding. Paired with recent improvements in Cursor, it's SOTA on large codebases
Massive gains with Sonnet 4 on SWE-agent: Single-attempt pass@1 rises to 69% on SWE-bench Verified! Sonnet 4 iterates longer (making it slightly more expensive) but almost never gets stuck. Localization ability appears unchanged, but quality of edits improves.
Great results from the Claude team- the 80% result is pass@1!! They ran the model in parallel multiple times and had an LM judge pick the best patch to submit.
Big arrow time! We can make huge progress on open-source SWE agents by scaling up the creation of virtual coding environments 🚀
Big arrow time! We can make huge progress on open-source SWE agents by scaling up the creation of virtual coding environments 🚀
Cursor is now free for students. Enjoy!
Introducing COMPACT: COMPositional Atomic-to-complex Visual Capability Tuning, a data-efficient approach to improve multimodal models on complex visual tasks without scaling data volume. 📦 arxiv.org/abs/2504.21850 1/10
@ weekend warriors - DM me a GitHub repo that you like / maintain, and I'll train you a 7B coding agent that's an expert for that repo. Main constraints - it's predominantly Python, and has a testing suite w/ good coverage. (example of good repo = sympy, pandas, sqlfluff)
Training with more data = better LLMs, right? 🚨 False! Scaling language models by adding more pre-training data can decrease your performance after post-training! Introducing "catastrophic overtraining." 🥁🧵+arXiv 👇 1/9
We created SuperBPE🚀, a *superword* tokenizer that includes tokens spanning multiple words. When pretraining at 8B scale, SuperBPE models consistently outperform the BPE baseline on 30 downstream tasks (+8% MMLU), while also being 27% more efficient at inference time.🧵
Want state-of-the-art data curation, data poisoning & more? Just do gradient descent! w/ @andrew_ilyas Ben Chen @axel_s_feldmann @wsmoses @aleks_madry: we show how to optimize final model loss wrt any continuous variable. Key idea: Metagradients (grads through model training)

Asaf Gilboa @GilboaamirAsaf
394 Followers 978 Following @Grappaxyz - Building a new trust layer for the internet
paul @paul_okewunmi
1K Followers 4K Following ML/AI Engineer | MLH Fellow'23 @ Meta | Drone Hobbyist
Ann Miura-Ko 🦖 @annimaniac
42K Followers 2K Following VC 🥷🏼 in pre-seed and seed. Yale, Stanford (PhD - Cybersecurity and Math Modeling), Mayfield Fellows, Foodie, Mom to 3 rascals and 🐶
Jeremie Tavares @jeremie_tavares
10 Followers 296 Following
Christiane @Odeacui4405
36 Followers 2K Following You don’t have to play the game the way they wrote it.
Data_team @Data_team89
2 Followers 27 Following Data Team @BlubridgeAI We work towards building SOTA Data Pipeline for optimising LLM Downstream tasks
Sang Michael Xie @sangmichaelxie
3K Followers 741 Following Research Scientist at Meta GenAI / LLaMA. AI + ML + NLP + data. Prev: CS PhD @StanfordAILab @StanfordNLP @Stanford, @GoogleAI Brain/DeepMind
parhamiam @Parhamiamz
302 Followers 916 Following Developer | React & TypeScript enthusiast ⚡ | Curious mind 🚀 | Exploring tech, open-source & personal growth 🌱
rayan @traderayann
4K Followers 6K Following
Post Silentium Vox @PostSilentiumVx
371 Followers 3K Following Ideas, thoughts, bits of wisdom, fun and sarcasm from a worst-selling author of no book at all. Yet. One day, maybe there’ll be enough for one.
builder business @builderbizness
0 Followers 82 Following
nick bradford @n_s_bradford
2K Followers 2K Following building @cursor_ai | prev co-founder @ellipsis_dev (YC W24)
Dhairya Desai @writessoftly
76 Followers 356 Following Software Engineer @aws. Peeling abstractions. Chronically reposting.
Jessy Lin @realJessyLin
3K Followers 887 Following PhD @Berkeley_AI, visiting researcher @AIatMeta. Interactive language agents 🤖 💬
the dark knight @Im_the_reverse1
1 Followers 55 Following
Vadim Liventsev @vadimdotme
80 Followers 1K Following lead hip hop engineer @ https://t.co/y80r2iuMsN
mhmd hani @mhmdoutofkarak
13 Followers 439 Following i'm not that accomplished yet, to have a bio i mean.
Vicente @Vicente2002_01
7 Followers 600 Following
HaMeedo ReFa'i @hameedorefai
8 Followers 187 Following
kevinz000 @kevinz00025511
2 Followers 61 Following
Urban Intent @urbanintent
203 Followers 2K Following Fix zoning code. Build mixed-use. End absolute car dependency. Thanks
ʟɪsᴀ @lisacheng
7K Followers 4K Following 10+ yrs in crypto. Blockchain Architect @ AI Co. Ethereum & Mastercoin alum. 2 exits. Burned, rebuilt, still here.
mycontext_ai @mycontext_ai
2 Followers 36 Following
Dima Sabanin @DmitrySabanin
527 Followers 665 Following CTO at Elara. Making programs that speak human for the benefit of humanity with some of my favorite people. Code archeologist. Complete nerd. A family man.
Cian Vance @vance_cian
1K Followers 2K Following Equipping sales teams with the tools to succeed. | 40+ sales playbooks created.
Shivam Singh @er_shivamsingh0
743 Followers 6K Following Engineer| koinophobic | 22 | AI | GPU POOR | Building Neo clouds https://t.co/qGAknj71kz
Jonathan Hayase @JonathanHayase
221 Followers 143 Following 5th year Machine Learning PhD student at UW CSE
rayan @traderayann
4K Followers 6K Following
rajan agarwal @_rajanagarwal
5K Followers 1K Following RL @amazon agi lab, se @uwaterloo, scholar @neo, prev @trykino @hitachi
typedfemale @typedfemale
39K Followers 537 Following a really exciting new account "advanced pytorch user" - @cHHillee alt: @typedalt
paige finn doherty @paigefinnn
33K Followers 4K Following investing in technical storytellers @behind_genius wrote a children’s book about VC - seed to harvest prev: @workos @northropgrumman
TBPN @tbpn
106K Followers 959 Following Technology's daily show. Hosted by @johncoogan & @jordihays. Streaming live 11a-2p PT every weekday and available on Apple, Spotify, & YouTube.
nick bradford @n_s_bradford
2K Followers 2K Following building @cursor_ai | prev co-founder @ellipsis_dev (YC W24)
Charles 🎉 Frye @charles_irl
15K Followers 3K Following gpu enjoyer at @modal. he/him. ex @full_stack_dl, @weights_biases (acq. @CoreWeave), phd Berkeley @Redwood_Neuro. try https://t.co/SYWVMCazZ3
Jonathan Hayase @JonathanHayase
221 Followers 143 Following 5th year Machine Learning PhD student at UW CSE
Kimura Hinami @hinamin012
35K Followers 530 Following 11/7〜11/16個展「In Light,I Live」 神戸→横浜🇯🇵 PTから写真家へ 雑誌「GENIC」掲載 ,企業タイアップなど NikonZfスペコン,カタログ 仕事依頼はDMかメールからお願いします✉️ 出張いきたいです!
Lichang Chen @LichangChen2
774 Followers 664 Following Context Engineer & Agents | ex GenAI & Science Unit Intern @GoogleDeepmind | PhD @umdcs and BS @ZJU_China
Stuart Sul @stuart_sul
1K Followers 118 Following ml research @cursor_ai, cs @Stanford, mlsys @HazyResearch
Omar Shaikh @oshaikh13
1K Followers 849 Following member of sociotechnical staff @Stanford - previously @GeorgiaTech
Scott Swingle @bio_bootloader
10K Followers 3K Following Father of 3, building Mentat (the github native coding agent!) @AbanteAI, prev @DeepMind
Yifeng Ding @YifengDing_
838 Followers 2K Following CS PhD student @illinoisCDS. Research intern @GoogleResearch. Towards building code LLMs with better reasoning and planning. Prev: @AmazonScience
Jack Cai @jackcai1206
150 Followers 443 Following CS PhD student at Princeton PLI, Research Intern at Microsoft. Previously Masters at UW-Madison. Working towards goal generating long horizon agents.
Sholto Douglas @_sholtodouglas
28K Followers 1K Following Scaling RL @AnthropicAI, ex @DeepMind - working towards intelligence too cheap to meter
Gabriele Berton @gabriberton
7K Followers 1K Following Postdoc @Amazon working on VLM - ex @CarnegieMellon @PoliTOnews @IITalk
Helen Jin @helenj1n
84 Followers 326 Following PhD Student @CIS_Penn @Penn | Intern @awscloud | Trustworthy ML + NLP 🌟 | Previously @CC_Columbia @Columbia
Lindia Tjuatja @lltjuatja
1K Followers 621 Following a natural language processor and “sensible linguist”. PhD-ing @LTIatCMU, often visiting @NYUDataScience, prev BS-ing @UT_linguistics + @utexasece 🤠🤖📖 she/her
Jifan Zhang @jifan_zhang
380 Followers 459 Following Research Fellow @AnthropicAI | Previously Ph.D. @WisconsinCS @WIDiscovery, BS/MS @uwcse, @Meta @Google @Amazon
Kylie Robison @kyliebytes
47K Followers 2K Following take it easy dude, but take it • robison (rah-beh-son) not robinson • signal @ kylie.01 💌 [email protected]
Jacob Buckman @jacobmbuckman
5K Followers 374 Following Founder @manifest__ai. PhD candidate @MILAMontreal. Formerly @jhuclsp, @GoogleAI, @SCSatCMU.
Amirhossein Kazemneja... @a_kazemnejad
2K Followers 585 Following Working on RL training of LLMs @Mila_Quebec. Prev: @mcgillu
Hoyeon Chang @hoyeon_chang
924 Followers 2K Following PhD student at @kaist_ai Language & Knowledge Lab Passionate about understanding intelligent systems Also a jazz pianist
Dylan Sam @dylanjsam
885 Followers 481 Following phd student @mldcmu | past: student researcher @GoogleAI, intern @GraySwanAI @AmazonScience, undergrad @BrownUniversity
Vincent Abbott @vtabbott_
7K Followers 334 Following Maker of *those* diagrams for deep learning algorithms | @mit @mitlids incoming PhD
shira @shiraeis
14K Followers 2K Following ai startup. prev: ai @uchicago @mit @intel @cdcgov and a few other places. I personally think I’m quite funny.
Psyho @FakePsyho
26K Followers 370 Following Game Designer; Problem Solver; past: OpenAI (Dota), Pro Competitive Programmer, Poker
Savvy is 🎃 𓅰 @savvyinwndrland
164 Followers 1K Following mech interp enthusiast, cs+math @harvard '22 "Live simply and do serious things." ~DCH
Ahmad Beirami @abeirami
10K Followers 4K Following sth new // ex Gemini RL+Inference @GoogleDeepMind // Chat AI @Meta // RL Agents @EA // ML+Information Theory @MIT+@Harvard+@GeorgiaTech // زن زندگی آزادی
Yoonsang Lee @yoonsang_
240 Followers 626 Following CS PhD @princeton_nlp @princetonPLI; prev @SeoulNatlUni
Teknium (e/λ) @Teknium1
50K Followers 5K Following Cofounder and Head of Post Training @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE
Calvin French-Owen @calvinfo
15K Followers 495 Following Making things, trail running. Prev: Codex @OpenAI, https://t.co/4qWGncHOAX, co-founder @Segment, @MIT
Nick Miller @nickwm
2K Followers 1K Following 25+ years building software products. Now building @cursor_ai. DMs open.