Princeton NLP Group @princeton_nlp
Princeton NLP Group led by @prfsanjeevarora @danqi_chen @karthik_r_n nlp.cs.princeton.edu Princeton, NJ Joined August 2020-
Tweets258
-
Followers5K
-
Following62
-
Likes284
AlgoTune is a benchmark that penalizes expensive models, since we give each model a budget of $1 to solve each task. Cool to see open weight models doing well! x.com/ori_press/stat…
AlgoTune is a benchmark that penalizes expensive models, since we give each model a budget of $1 to solve each task. Cool to see open weight models doing well! x.com/ori_press/stat…
What happens if you compare LMs on SWE-bench without the fancy scaffolds? Our new leaderboard “SWE-bench (bash only)” shows you which LMs are the best at getting the job done with just bash. More on why this is important 👇
Shoutout to all the @Princeton researchers participating in @icmlconf #ICML2025 Browse through some of the cutting edge research from AI Lab students, post-docs and faculty being presented this year: pli.princeton.edu/blog/2025/prin…
As we optimize model reasoning over verifiable objectives, how does this affect human understanding of said reasoning to achieve superior collaborative outcomes? In our new preprint, we investigate human-centric model reasoning for knowledge transfer 🧵:
Improved reasoning increases performance on benchmarks, but are models able to pass their knowledge onto humans? 🧐 We evaluate models’ communication abilities in teaching novel solutions to users! See our new paper!
Improved reasoning increases performance on benchmarks, but are models able to pass their knowledge onto humans? 🧐 We evaluate models’ communication abilities in teaching novel solutions to users! See our new paper!
Introducing SWE-bench Multilingual: a new eval in the SWE-bench family to test LLM coding abilities in *9* programming languages, fully integrated with SB so it can plug into existing workflows. Claude 3.7 gets 43% on SB Multilingual vs 63% on SB Verified, a 20 pt drop!🧵
Join us on May 21st- I'll talk about how we built SWE-bench & SWE-agent and what I'm excited about for the future of autonomous AI systems.
Join us on May 21st- I'll talk about how we built SWE-bench & SWE-agent and what I'm excited about for the future of autonomous AI systems.
Our warmest congratulations to @danqi_chen, @stanfordnlp grad and now Associate Professor at @PrincetonCS and Associate Director of @PrincetonPLI on her stunning @iclr_conf keynote!
Claude can play Pokemon, but can it play DOOM? With a simple agent, we let VLMs play it, and found Sonnet 3.7 to get the furthest, finding the blue room! Our VideoGameBench (twenty games from the 90s) and agent are open source so you can try it yourself now --> 🧵
Can language models effectively impersonate you to family and friends? We find that they can: 44% of the time, close friends and family mis-identify Llama-3.1-8b as human… 🧵👇
Congrats on the Verified and Multimodal SWE-bench numbers. venturebeat.com/ai/zencoders-c…
We just updated the SWE-bench Multimodal leaderboard. Congrats to Globant, Zencoder, and the Agentless team from UIUC for their strong results.
🤔 Ever wondered how prevalent some type of web content is during LM pre-training? In our new paper, we propose WebOrganizer which *constructs domains* based on the topic and format of CommonCrawl web pages 🌐 Key takeaway: domains help us curate better pre-training data! 🧵/N
This Tuesday (Feb 18), @_carlosejimenez will discuss SWE-bench and the future of codegen evals, as part of the Conference on Synthetic Software in NYC. @KLieret will also be there. RSVP: lu.ma/k2q27yi3
SWE-agent 1.0 is the open-source SOTA on SWE-bench Lite! Tons of new features: massively parallel runs; cloud-based deployment; extensive configurability with tool bundles; new command line interface & utilities.
🚀 Introducing Goedel-Prover: A 7B LLM achieving SOTA open-source performance in automated theorem proving! 🔥 ✅ Improving +7% over previous open source SOTA on miniF2F 🏆 Ranking 1st on the PutnamBench Leaderboard 🤖 Solving 1.9X total problems compared to prior works on Lean…
Congrats to o3-mini on setting a new high score on SciCode!! R1 clocks in at an impressive 4.6%, matching Claude 3.5. SciCode is our super-tough programming benchmark written by PhDs in various scientific domains.
SciCode is our super tough coding benchmark testing the abilities of LMs to program code based on research in physics/biology/material science/... o1 is the SoTA with 7%. To make it easier to use we're putting it into the Inspect AI format, as a few groups were asking for this.
Congrats to the DeepSeek team on the impressive SWE-bench results!

(((ل()(ل() 'yoav)))... @yoavgo
66K Followers 2K Following
Delip Rao e/σ @deliprao
62K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
Jacob Andreas @jacobandreas
20K Followers 951 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL / @NLP_MIT (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw
Bill Yuchen Lin @billyuchenlin
24K Followers 3K Following Grok Code @xAI. Ex: Affiliate Assistant Prof @UW, Research Scientist @allen_ai, Google AI, Meta FAIR.
Sam Bowman @sleepinyourhat
50K Followers 3K Following AI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
Sewon Min @sewon__min
14K Followers 819 Following Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp
Sebastian Ruder @ ACL @seb_ruder
93K Followers 1K Following Research Scientist @AIatMeta • Ex @Cohere @GoogleDeepMind
Sebastian Gehrmann @sebgehr
6K Followers 2K Following Head of Responsible AI, CTO office, @Bloomberg. (he/him) Formerly LLMs @ Google Brain / Harvard. views my own
Jay Alammar @JayAlammar
46K Followers 1K Following Writer https://t.co/TquuQXlLOJ. O'Reilly Author https://t.co/Fl3uPAZHLg. LLM Builder @Cohere. Visualizing AI one concept at a time.
Ofir Press @OfirPress
15K Followers 7K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
Mark Dredze @mdredze
6K Followers 782 Following John C Malone Professor at @JohnsHopkins @JHUCompSci @jhuclsp @jhumceh; Part time @techatbloomberg (tweets my own) @mdredze.bsky.social🦋
Vivek Gupta @keviv9
3K Followers 5K Following Assistant Professor @SCAI_ASU; PostDoc @cogcomp @Penn, ed-@UUtah,@iitkanpur. @Bloomberg @MSFTResearch Fellow; ex-@MetaAI @IBM @Verisk @samsungresearch @Synopsys
Shaily @shaily99
7K Followers 2K Following PhD @LTIatCMU. Prev: @allen_ai @GoogleAI @MSFTResearch. #NLProc. Often ranting about research.
rishi @RishiBommasani
6K Followers 2K Following Societal/economic impacts of AI; AI policy & governance @StanfordHAI Stanford CS PhD w/ @percyliang @jurafsky Cornell CS undergrad w/ @clairecardie
Greg Durrett @gregd_nlp
8K Followers 892 Following Associate professor at NYU (Courant CS + Center for Data Science) | advisor for @bespokelabsai | large language models and NLP | he/him
Weijia Shi @WeijiaShi2
9K Followers 1K Following PhD student @uwnlp @allen_ai | Prev @MetaAI @CS_UCLA | 🏠 https://t.co/Q6Mzg8ow2j
Yonatan Belinkov @boknilev
5K Followers 1K Following Assistant professor of computer science @TechnionLive; visiting scholar @KempnerInst 2025-2026.
king @bmwforever23
14 Followers 799 Following
fufu Chen @fufuChen0728
2 Followers 19 Following
Anthony @Antho7311
263 Followers 7K Following
Yixin Ye @BLeavesYe
502 Followers 175 Following Undergrad @sjtu1896. Intern @ GAIR Lab (https://t.co/QWViO83puG) Visiting @stanfordnlp. NLP/LLMs/Reasoning. Looking for a Ph.D. in the 26 fall.
Kai Zhang @KaiZhang_CS
101 Followers 524 Following CS PhD @OSUNLP with @ysu_nlp. Prev @AIatMeta @MSFTResearch @GoogleDeepMind. my former account @DrogoKhal4 was wrongly suspended...
zikai Xiao @ZikaiXiao
6 Followers 70 Following
Cheng Wang @WangCheng_0116
26 Followers 477 Following CS Undergrad @NUSingapore Focusing on Trustworthy ML/LLM, Reasoning Models and Agents Seeking 26 Fall PhD positions on above topics
Ysimal han @ysimalhan
37 Followers 226 Following
Jalal Naghiyev @jalalnaghiyev06
16 Followers 843 Following
ashish chadha @ashishc63669010
0 Followers 388 Following Proud Bharatiya 🇮🇳 AI enthusiast IIT Guwahati
Yoshida Bizu @BizuYoshida
2 Followers 55 Following
Sashimix @Vrai41647204
3 Followers 226 Following PhD in AI | Freelance ML/Data Science engineer | Building and sharing hands-on AI tools, insights, and entrepreneurial experiments.
Zhiyu Yang @zhiyu_yang1683
1 Followers 47 Following
Fangcong Yin @fangcong_y10593
276 Followers 679 Following CS PhD Student @UTAustin studying NLP. Prev: @CornellCIS
Supreet Sahu @supreet_sahu
19 Followers 872 Following IIT Kharagpur @IITKgp '26 | 4th Year Undergrad @ ECE( Dual degree spl- Vision & Intelligent Systems) | AI/ML/DL/Computer Vision | Also on X : @SupreetSahu
Giosuè Baggio @giosuebaggio
2K Followers 2K Following Cognitive scientist @NTNU · Author of ‘Meaning in the Brain’ and ‘Neurolinguistics’ @mitpress · “(…) ita res accendent lumina rebus.” (Lucretius)
Gin.AI @ginbitcoin
533 Followers 2K Following 手艺人Build&Sell 👩💻https://t.co/3BQxr054GI 🎵https://t.co/V64qINIM9D 🌍https://t.co/ldnu88Yksc Run, don't walk, if you don't jump, your perish will never end 心存善念,每个人都在打一场人生硬仗
Shiyue Zhang @byryuer
3K Followers 1K Following Research Engineer @TechAtBloomberg | ex PhD student at UNC-Chapel Hill (@unccs @uncnlp) | Bloomberg PhD Fellow | Past Intern at @MetaAI @MSFTResearch | #NLProc
Ahlam @AhElouOfficial
0 Followers 10 Following
Chaoyue He @CYH37
357 Followers 4K Following AI Research Scientist@NTUsg 🇸🇬|LLM|Sustainability|GenRecSys|AGI|Productivity|UBI & Fortune|Disease Cures|Xi'an, China🇨🇳|Bodybuilder💪|Caregiver🤲❤
Wei Xia @ericwxia
37 Followers 153 Following
Xiaoyue Xu @xiaoyue02_xu
23 Followers 213 Following CS undergrad @Tsinghua_Uni | Seeking 25 fall PhD position in nlp | 🔗 https://t.co/yQea7xlnJ9
PKU WHZhang @PKUBrian
0 Followers 46 Following PhD candidate at Peking University @PKU1898. Focusing on LLMs and Autonomous Agents.
smile @Smilex_P
230 Followers 6K Following
云创兽Ai @Frawal7909
0 Followers 112 Following 🌟 focusing on dividend stocks lover, independent girl! open to insights. DM me about economic cycles! 📊 #Nasdaq #Stocks
Goddy Snow @GoddySnow62732
4 Followers 142 Following
Liu He (Helium) @Heliummn
2 Followers 97 Following PhD Applicant (Fall '26) in CSS | Social NLP|SpeechLLM|XAI | MS @StudyatUSTC | B.E @HIT_1920, prev intern. @Baidu_Inc’s ERNIE Bot 🤖🗣️👥
Patang Maja @MajaPatang
77 Followers 1K Following
아아 @aa164919269577
0 Followers 83 Following
Jason @Jason27627351
8 Followers 322 Following
RuthBunyan @fg0eWs80A58wh2D
283 Followers 5K Following
Nafise Sadat Moosavi @NafiseSadat
473 Followers 392 Following Lecturer (~Assistant Prof.) in NLP @SheffieldNLP @shefcompsci, Muslim Iranian woman إنا على العهد
Atharva Mehta @atharva20038
0 Followers 26 Following
Junyi Zhang @Levi_JYZhang
6 Followers 61 Following UCLA Master, PLUS Lab, advised by Prof. Violet Peng
EMNLP 2025 @emnlpmeeting
15K Followers 51 Following EMNLP 2025 - The 2025 Conference on Empirical Methods in Natural Language Processing, 2025 Hashtag: #EMNLP2025 Dates: November 5-9 Submission Deadline: May 19th
Stanford NLP Group @stanfordnlp
172K Followers 295 Following Computational Linguists—Natural Language—Machine Learning @chrmanning @jurafsky @percyliang @ChrisGPotts @tatsu_hashimoto @MonicaSLam @Diyi_Yang @StanfordAILab
Ofir Press @OfirPress
15K Followers 7K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
ACL 2025 @aclmeeting
22K Followers 52 Following Association for Computational Linguistics | ACL 2025 conference | The 63rd Annual Meeting of the ACL Hashtags: #NLProc #ACL2025NLP
typedfemale @typedfemale
39K Followers 537 Following a really exciting new account "advanced pytorch user" - @cHHillee alt: @typedalt
Princeton Laboratory ... @PrincetonAInews
1K Followers 68 Following The Princeton Laboratory for Artificial Intelligence supports and expands the scope of AI research at Princeton.
Yoonsang Lee @yoonsang_
240 Followers 626 Following CS PhD @princeton_nlp @princetonPLI; prev @SeoulNatlUni
Yong Lin @Yong18850571
755 Followers 225 Following Postdoc Fellow @PrincetonPLI @Princeton. Co-leading the Goedel-Prover project. Apple AI/ML PhD Fellow 2023.
Adithya Bhaskar @AdithyaNLP
345 Followers 338 Following Third year CS PhD candidate at Princeton University (@princeton_nlp @PrincetonPLI), previously CS undergrad at IIT Bombay
Luxi (Lucy) He @LuxiHeLucy
1K Followers 440 Following Princeton CS PhD @PrincetonPLI. Previously @Harvard ‘23 CS & Math.
Kilian Lieret @KLieret
899 Followers 41 Following Research Software Engineer at Princeton University. AI agents & benchmarks for software engineering.
Howard Yen @HowardYen1
240 Followers 240 Following
Tri Dao @tri_dao
33K Followers 632 Following Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.
Zirui "Colin" Wang @zwcolin
1K Followers 575 Following CS PhD Student @Berkeley_AI and @BerkeleySky. Prev. MS @Princeton_NLP, BS @HDSIUCSD and @CogSciUCSD; '25 @SiebelScholars; I work on multimodal models; He/Him.
Princeton PLI @PrincetonPLI
2K Followers 32 Following Princeton University initiative enhancing fundamental understanding of AI, enabling its use in academic disciplines, and examining AI's societal implications.
Ellen Zhong @ZhongingAlong
8K Followers 883 Following Assistant Professor @PrincetonCS. #ai4science #proteins247 #cryoem #cryodrgn ❄️🐉 Prev: @MIT @DeepMind @DEShawResearch. Currently moonlighting @generate_biomed.
John Yang @jyangballin
4K Followers 798 Following 🌲 CS PhD @Stanford 🤖 SWE-bench + agent + smith 🎓 Prev. @princeton_nlp 🐯; @Berkeley_EECS 🐻
Vishvak Murahari @VishvakM
472 Followers 225 Following NLP + ML Ph.D. candidate @princeton_nlp Ex. @Google @allen_ai @Microsoft
Princeton University @Princeton
562K Followers 1K Following The official account of Princeton University. In the Nation’s Service and the Service of Humanity.
Princeton Engineering @EPrinceton
10K Followers 2K Following Princeton University School of Engineering and Applied Science. Engineering in the service of humanity.
Princeton Computer Sc... @PrincetonCS
6K Followers 195 Following The Department of Computer Science at Princeton University
Tianyu Gao @gaotianyu1350
5K Followers 914 Following CS PhD student @Princeton @Princeton_nlp @PrincetonPLI working on language models. Previously: @Tsinghua_Uni @TsinghuaNLP
Zexuan Zhong @ZexuanZhong
3K Followers 703 Following @xAI post-trained Grok 3&4; scaling up RL for Grok-next | prev @PrincetonCS
"Tony" Runzhe Yang @RunzheYang
231 Followers 235 Following Machine Learning & Computational Neuroscience Ph.D. @PrincetonCS @PrincetonNeuro
Yangsibo Huang @YangsiboHuang
4K Followers 701 Following research scientist @googledeepmind. gemini thinking & coding. phd @princeton. opinions are my own.
AmsterdamNLP @AmsterdamNLP
4K Followers 331 Following Tweeting about NLP research, events and opportunities in Amsterdam -- run by @wzuidema and others.
Institute for Advance... @the_IAS
27K Followers 383 Following Latest news, research, and campus updates from one of the world's leading centers for theoretical research and intellectual inquiry.
Sanjeev Arora @prfsanjeevarora
25K Followers 100 Following Director, @PrincetonPLI and Professor @PrincetonCS. Seeks math/conceptual understanding of deep learning and large AI models. Also on the "other" social network
UMD Department of Com... @umdcs
6K Followers 90 Following Official feed for the @UofMaryland's Department of Computer Science housed in the @iribecenter.
Howard Chen @__howardchen
1K Followers 1K Following PhDing @PrincetonPLI. Machine memory / control / agency.
Language Technologies... @LTIatCMU
12K Followers 238 Following The Language Technologies Institute in Carnegie Mellon University's @SCSatCMU
JHU CLSP @jhuclsp
7K Followers 6K Following Center for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSQtw @[email protected]
CopeNLU @CopeNLU
4K Followers 313 Following University of Copenhagen Natural Language Understanding research group, led by @IAugenstein #NLProc #ML #dlearn Funded by @ERC_Research @DFF_raad @VILLUMFONDEN
Machine Learning at G... @mlatgt
7K Followers 431 Following The Machine Learning Center at Georgia Tech (ML@GT) is an interdisciplinary research center that trains the next generation of #machinelearning & #AI pioneers.
CambridgeNLP @cambridgenlp
9K Followers 200 Following The Natural Language Processing Group @Cambridge_Uni, Computer Science department #NLProc #ML. Account managed by @Eric_chamoun, @richarddm1, @pietro_lesci.
EdinburghNLP @EdinburghNLP
13K Followers 159 Following The Natural Language Processing Group at the University of Edinburgh.
WiML @WiMLworkshop
18K Followers 1K Following Women in Machine Learning organization. Maintains a list of women in ML. Profiles the research of women in ML. Annual workshop and other events.
CILVR @CILVRatNYU
2K Followers 13 Following CILVR at NYU https://t.co/PbvGtsBGvR CILVR Blog https://t.co/fyHd5zS3w2
USC NLP @nlp_usc
4K Followers 361 Following The NLP group at @USCViterbi. @DaniYogatama+@_jessethomason_+@jieyuzhao11+@robinomial+@swabhz+@xiangrenNLP at @CSatUSC + researchers @USC_ICT, @USC_ISI.
Griffiths Computation... @cocosci_lab
6K Followers 134 Following Tom Griffiths' Computational Cognitive Science Lab. Studying the computational problems human minds have to solve.