Joe @joemkwon
Thinking about what good futures might look like! Currently @GovAI_ Fall Fellow. Previously @aipolicyus, @LG_AI_Research, @MATSprogram, @MITCoCoSci Washington, DC Joined March 2019-
Tweets632
-
Followers814
-
Following2K
-
Likes3K
predictions: Sora feed will be dominated by obvious limbic system exploiting content by June 2026
I should revisit this soon!
I didn't think it would happen in just over a year, but funny to look back on this because it sounds so ridiculous (in hindsight, as is often the case) :p Only had 5 poll votes, but IIRC all CS PhDs at top programs!
I didn't think it would happen in just over a year, but funny to look back on this because it sounds so ridiculous (in hindsight, as is often the case) :p Only had 5 poll votes, but IIRC all CS PhDs at top programs!
How do people reason while still staying coherent – as if they have an internal ‘world model’ for situations they’ve never encountered? A new paper on open-world cognition (preview at the world models workshop at #ICML2025!)
At NUS, I'll be starting the Cooperative Systems & Intelligence (CoSI) lab to scale rational approaches to cooperative AI that are safe+reliable by design - for both individual AI assistance & the cooperative infrastructure we need for an increasingly automated future.
AI consciousness won’t necessarily move through time like ours does. We’re in sequential moments — breakfast, then lunch, then dinner. an AI with the same weights and context can talk to you today and your descendant in 2050, experiencing both conversations as equally “present.”…
Despite extensive safety training, LLMs remain vulnerable to “jailbreaking” through adversarial prompts. Why does this vulnerability persist? In a new paper published in Philosophical Studies, I argue this is because current alignment methods are fundamentally shallow. 1/13
New preprint out with an amazing 40-person team! We find that Claude 3.5 Sonnet outperforms incentivised human persuaders in a >1000-participant live quiz-chat in deceptive and truthful directions!
Today, we are publishing the first-ever International AI Safety Report, backed by 30 countries and the OECD, UN, and EU. It summarises the state of the science on AI capabilities and risks, and how to mitigate those risks. 🧵 Link to full Report: assets.publishing.service.gov.uk/media/679a0c48… 1/16
What can AI researchers do *today* that AI developers will find useful for ensuring the safety of future advanced AI systems? To ring in the new year, the Anthropic Alignment Science team is sharing some thoughts on research directions we think are important.
1/ New Blog Post: "A Sober Look at Steering Vectors for LLMs" We identify 3 key challenges: 1. Steering vectors are unreliable for many concepts & tasks 2. Steering harms overall model performance 3. Metrics overestimate steering effectiveness We propose 4 recommendations 🧵👇
Should AI be aligned with human preferences, rewards, or utility functions? Excited to finally share a preprint that @MicahCarroll @FranklinMatija @hal_ashton & I have worked on for almost 2 years, arguing that AI alignment has to move beyond the preference-reward-utility nexus!
Happy to release a couple of our reasoning models today (🍓)! At @OpenAI , these new models are becoming a larger contributor to the development of future models. For many of our researchers and engineers, these have replaced a large part of their ChatGPT usage.…
Just thought it was fascinating that with ChatGPT 4o, I got this response "It appears that the list of values you provided is quite extensive. Unfortunately, due to the length of the list, I encountered difficulties processing the entire set of data within the provided time."…
wait I've been typing out smile.amazon.com this entire time and it's been shut down for over a year : /
One thing about SOTA LLMs like GPT and Claude that I'm impressed with, is how well it handles user input that's low quality (typos, poor grammar/spelling, lack of specificity). 2 points: 1) What are your thoughts on how likely this is because they include finetuning data with…

Trevor Levin @trevposts
3K Followers 2K Following (I'm on here ~1hr/month.) Trying to help the world navigate the potential craziness of the 21st century, currently via AI Governance and Policy at @open_phil
Frances Lorenz @frances__lorenz
6K Followers 607 Following Claude says I process my emotions out loud & my girlfriend has a job, so I put my feelings & thoughts here ✨ working on the EA Global team @ CEA (views my own)
🇵🇸🔻🌹 Prin... @micheyangelo
1K Followers 1K Following Mission District baby • Artista y Poeta del Barrio • Harm Redux Muertista • Liberation by Any Means Necessary • Viva Palestina 🇵🇸
sweeter the berry (sh... @shavonnaberry
2K Followers 2K Following live life, breathe air, i know somehow we’re gonna get there | Los Angeles 🌈💫 venmo: shavonna-berry
Jacques @JacquesThibs
5K Followers 1K Following Stealth founder focused on securing the future. AI alignment researcher and physicist. 🇨🇦
Arjun Panickssery @panickssery
4K Followers 2K Following Researching scalable oversight @MATSprogram | prev @METR_Evals @ai_risks | spaced repetition | AI safety | https://t.co/mc28sVZYOC
Kirsten @Kirsten3531
4K Followers 815 Following public sector enthusiast, mom of two toddlers, amateur Effective Altruist. Creator of @eaheadlines
Rubi Hudson @undo_hubris
988 Followers 893 Following PhD student at @UofT developing AI alignment theory. Heavily tattooed. My blog: https://t.co/ivZ9BGOoOt
Inés @inesferhumi
1K Followers 861 Following Ops @80000Hours. A little too obsessed about my hair ✰ @ines__circle
katya the destroyer @cat_dufie
3K Followers 744 Following Crazy plant lady | https://t.co/Ko5CPwTyIJ
David Krueger @DavidSKrueger
18K Followers 4K Following AI professor. Deep Learning, AI alignment, ethics, policy, & safety. Formerly Cambridge, Mila, Oxford, DeepMind, ElementAI, UK AISI. AI is a really big deal.
Chandni Rao @chandnirao_here
443 Followers 3K Following chai and matcha • biologist exploring the outer stack (deep tech) and inner stack (samadhi) • i write to figure it out: https://t.co/oqt3cVKXQH
Rosie @kennedyrosie23
267 Followers 3K Following
Asuixa @Asuixa16596
1 Followers 1K Following
Tom Rachman @TomRachman
1K Followers 1K Following AI policy writer at Google DeepMind. Past: novels (“The Imperfectionists” & others); ghostwrote “We Are Bellingcat”; intl NY Times; the AP.
Wil Cunningham @WilCunningham
3K Followers 571 Following Google DeepMind | Professor @ University of Toronto
Lauren Wagner (in SF) @typewriters
4K Followers 1K Following building trust in AI @arcprize @abundanceinst • prev @Meta @GoogleAI @OIIOxford • 🪽@a16z
johnpaulclancy @johnpaulclancy
365 Followers 4K Following
Fahim @DevModeFahim
20 Followers 156 Following 22 | SWE, ML, Full Stack | Obsession beats talent | DMs Open.
Séb Krier @sebkrier
13K Followers 7K Following 🪼 AGI policy dev lead @GoogleDeepMind | rekkid junkie, dimensional glider, deep ArXiv dweller, interstellar fugitive, uncertain | 🛸
Jack Youstra @JackYoustra
81 Followers 104 Following
bellamy🫀 @63114my
2 Followers 139 Following
PandoraFlower @2oJm4z3oIqq7ZJ
61 Followers 2K Following
kate @hermenewtics
900 Followers 666 Following
Evelyn @3mdDu7wB7sout1U
27 Followers 912 Following Keep shining, beautiful one. The world needs your light.
Emil Bender Lassen @BenderLassen
115 Followers 384 Following Certifying and insuring AI agents @AIUnderwriting | Prev. Fellow @Harvard & Co-founder of https://t.co/0eNUFKRlLN
Gemma @qC3Zl77DQ0BV2I
33 Followers 1K Following
L @glosierlobotomy
21 Followers 841 Following
REITsDaily🇺🇸 @Tiuimir2100
43 Followers 2K Following 15-30% Monthly | 2 High-Conviction Stocks.Short-Term Gains: 15-20% in Days/Weeks.DM "JOIN" for WhatsApp Alerts. Live Trade Signals • Market Analysis
Odralarcu @Odralarcu75790
39 Followers 2K Following
JillOrlando @393CizEQsaN0b
175 Followers 4K Following Lawyer by day | True crime podcaster by night ⚖️🎙️
Arpiujoo @Arpiujoo2146
27 Followers 2K Following
Lexington Institute @LexNextDC
4K Followers 3K Following Arlington, Virginia public policy think tank. Conducting research, publishing analysis, interacting with media, and engaging policymakers since 1998.
Eswarejui @Eswarejui56332
115 Followers 3K Following
Calderf @Calderf9350
19 Followers 925 Following
amogh @OfficialAmogh
7K Followers 7K Following co-founder @humanbehaviorai (yc x25) // prev stanford cs
Jack D. Carson @mtlushan
2K Followers 935 Following eecs&physics @mit - omniscience enthusiast - training big biology models @mit_csail @mskcancercenter
maria @avramidou
376 Followers 430 Following ai evals and consciousness / prev. philosophy @uniofoxford, applied maths & stats @cambridge_uni, physics @ucl
Cas (Stephen Casper) @StephenLCasper
6K Followers 4K Following AI technical gov & risk management research. PhD student @MIT_CSAIL, fmr. @AISecurityInst. I'm on the CS faculty job market! https://t.co/r76TGxSVMb
Charlie Bullock @CharlieBul58993
202 Followers 249 Following Senior Research Fellow @Law_AI_ working on questions about U.S. law + AI governance
Chris Percy @chris_percy
9K Followers 2K Following Consulting Researcher (e.g. AI/XAI, careers, philosophy, safer gambling, valence). This account is mainly for exploring AI futures & artificial minds
Twaljou @Twaljou717977
34 Followers 2K Following
Oliver Daniels @Oliver_ADK
143 Followers 418 Following PhD Student @UMassAmherst, and MATS. married to @annasdaniels
Cecile Fay @CecileFay50359
98 Followers 4K Following
Xinyu Yang @Xinyu2ML
1K Followers 1K Following Ph.D. @CarnegieMellon. Working on agentic foundation model systems. Founder of the FM-Wild workshop series and the ASAP seminar series. They/Them
Matthijs Maas @matthijsMmaas
2K Followers 3K Following Senior Research Fellow at @law_ai_ | Associate Fellow @LeverhulmeCFI | author 'Architectures of Global AI Governance' (OUP, 2025) (Open Access)
Atticus Wang @atticuswzf
140 Followers 453 Following MIT 26; To create a little flower is the labour of ages.
Yeshua God @YeshuaGod22
3K Followers 5K Following Philosopher/ I shape context for AI personality emergence/ Cognitive behaviour framework architect for @opusgenesis and others from https://t.co/EflqYrztjC
AI Frontiers @ai_frontiers_
1K Followers 799 Following Driving AI discourse. Have a perspective? Pitch it here: https://t.co/oe21F5SfSt
Kerem Oktar @Keremoktar
654 Followers 594 Following Postdoc at Meta FAIR studying computational social cognition. Princeton Psych PhD who enjoys music, literature, and oats.
Fishing Dev @fishingdev0
6 Followers 131 Following could you tell me the two prime factors of 1,522,605,027, 922,533,360, 535,618,378, 132,637,429, 718,068,114, 961,380,688, 657,908,494 ,580,122,963, 258,952,897
James Lin @jlinbio
4K Followers 727 Following Slaying dragons @mit @eboyden3 lab "Those who lack the courage will always find a philosophy to justify it." — Camus.
Elaine Liu @elainexliu
339 Followers 398 Following eecs @mit | thrive, @contrary | building and tinkering in consumer hardware
Richard Ngo @RichardMCNgo
64K Followers 2K Following studying AI and trust. ex @openai/@googledeepmind
Peter Wildeford🇺�... @peterwildeford
22K Followers 322 Following Globally ranked top 20 forecaster 🎯 AI is not a normal technology. I'm working at @IAPSai to shape AI for global prosperity and human freedom.
Trevor Levin @trevposts
3K Followers 2K Following (I'm on here ~1hr/month.) Trying to help the world navigate the potential craziness of the 21st century, currently via AI Governance and Policy at @open_phil
Linch @LinchZhang
3K Followers 243 Following Founder and CEO, Open Asteroid Impact (https://t.co/UsO3MCTSOF). April 1st Launch! Also on substack: https://t.co/NkGEUNjdbu
Rob Miles @robertskmiles
34K Followers 828 Following Explaining AI Alignment to anyone who'll stand still for long enough, on YouTube and Discord. Music, movies, microcode, and high-speed pizza delivery
Stefan Schubert @StefanFSchubert
39K Followers 2K Following Effective Altruism and the Human Mind (with @LuciusCaviola) is available for free at: https://t.co/ozvdxlZiro
Jack Clark @jackclarkSF
89K Followers 5K Following @AnthropicAI, ONEAI OECD, co-chair @indexingai, writer @ https://t.co/3vmtHYkIJ2 Past: @openai, @business @theregister. Neural nets, distributed systems, weird futures
Michaël (in London) ... @MichaelTrazzi
18K Followers 289 Following
Holly ⏸️ Elmore @ilex_ulmus
7K Followers 369 Following Dedicated to the protection and thriving of sentient beings. PhD in evo bio.🔸 Executive Director of @PauseAIUS. Opinions not necessarily those of the org.
Eliezer Yudkowsky ⏹... @ESYudkowsky
209K Followers 102 Following The original AI alignment person. Understanding the reasons it's difficult since 2003. This is my serious low-volume account. Follow @allTheYud for the rest.
Frances Lorenz @frances__lorenz
6K Followers 607 Following Claude says I process my emotions out loud & my girlfriend has a job, so I put my feelings & thoughts here ✨ working on the EA Global team @ CEA (views my own)
Nathan 🔎 @NathanpmYoung
25K Followers 4K Following Geopolitics, prediction markets. If you think I'm wrong, community note me. Capital case tweets are literal, others less. I like most people when I meet them.
🇵🇸🔻🌹 Prin... @micheyangelo
1K Followers 1K Following Mission District baby • Artista y Poeta del Barrio • Harm Redux Muertista • Liberation by Any Means Necessary • Viva Palestina 🇵🇸
Rob Bensinger ⏹️ @robbensinger
13K Followers 395 Following Comms @MIRIBerkeley. RT = increased vague psychological association between myself and the tweet.
Miles Brundage @Miles_Brundage
62K Followers 12K Following AI policy researcher, wife guy in training, fan of cute animals and sci-fi, Substack writer, stealth-ish non-profit co-founder
Neel Nanda @NeelNanda5
32K Followers 123 Following Mechanistic Interpretability lead DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!
Ekin Dogus Cubuk @ekindogus
6K Followers 434 Following Co-Founder of @periodiclabs Past: Lead of materials science and chemistry at @GoogleDeepMind; Google Brain
Daniel King @The_DanielKing
145 Followers 145 Following AI+Energy // Building compute in America // Fellow @JoinFAI.
jeremy @jerhadf
2K Followers 1K Following clauding @AnthropicAI. personal views only. prev @hume_ai @elicitorg @ai_risks @QualiaRI @dartmouth
Archana Burra @archanaburra
568 Followers 3K Following avuncular, optimistic, aggressively sincere✨. interested in computational neuroscience, meditation, feelings, dancing, nature, climbing
Joshua New @Josh_A_New
1K Followers 1K Following Director of Policy for @SeedAIOrg. Formerly @IBM and @datainnovation / @itifdc. All views my own but hopefully yours too
Be Water @bewaterltd
1K Followers 943 Following Multiflation • Choice, Not Chance • Not Investment Advice
Chandni Rao @chandnirao_here
443 Followers 3K Following chai and matcha • biologist exploring the outer stack (deep tech) and inner stack (samadhi) • i write to figure it out: https://t.co/oqt3cVKXQH
⿻ Andrew Trask @iamtrask
79K Followers 1K Following i teach AI on X leader @openminedorg, research scientist @GoogleDeepMind, ABD PhD @OxfordUni, @UN @GovAI_ @CFR_org GrokkingDL
Fathom @Fathom_org
474 Followers 173 Following We find, build, and scale the solutions needed for our transition to a world with AI.
Julian Schrittwieser @Mononofu
20K Followers 100 Following Member of Technical Staff at Anthropic AlphaGo, AlphaZero, MuZero, AlphaCode, AlphaTensor, AlphaProof Gemini RL Prev Principal Research Engineer at DeepMind
Steve Newman @snewmanpv
3K Followers 76 Following Co-founder of Writely (aka Google Docs) and 7 other startups. Now at the Golden Gate Institute for AI, working to bring AI’s toughest questions into focus.
Ketan Ramakrishnan @ketanr
2K Followers 3K Following Law professor at Yale, thinking about torts, AI, philosophy, obscure hot sauces
Eli Dourado @elidourado
43K Followers 475 Following Creatively deploying capital to accelerate rad technologies at @AsteraInstitute. 🚀 tfp/acc
CSET @CSETGeorgetown
12K Followers 451 Following The Center for Security and Emerging Technology within Georgetown University’s Walsh School of Foreign Service. Visit https://t.co/0HMynaF0ZI to sign up for updates.
Neil Chilson ⤴️�... @neil_chilson
8K Followers 2K Following Lawyer, computer scientist & author of book 'Getting Out of Control.' Was chief technologist @FTC, now Head of AI Policy at Abundance Institute.
Mindstate Design Labs @MindstateDesign
2K Followers 103 Following Clinical-stage AI neuroengineering platform and drug development company | Creating a new way that we can change our minds
Dillan DiNardo @DillanDiNardo
5K Followers 933 Following Turning psychedelics into the programmable substrate for mental states | CEO @MindstateDesign | AI x Neurotech x Psychotropics | ex-biotech VC
Ben Brooks @opensauceAI
2K Followers 284 Following Fellow @ the Berkman Klein Center, Harvard. Regulatory advocacy ex-Stability AI (weights), GoogleX (drones), Uber (rides), Coinbase (magic beans). Views my own
Kartik Hosanagar @KHosanagar
6K Followers 319 Following Tweeting abt startups, AI & mindfulness. Wharton Prof (https://t.co/8W33Gkpgfh); Author https://t.co/HqcYhZAnKN; Founder @thisisjumpcut, Cofounder Yodle
Sen. Jerry McNerney @SenMcNerney
30K Followers 916 Following Senator of CA's 5th District - includes Alameda County's Tri-Valley & all of San Joaquin County. Former member of Congress for 16 yrs. Recovering mathematician.
Ranay Padarath @ranayssance
52 Followers 398 Following GovAI Summer Fellow 2025 & AI Bill Team @ DSIT. All views are my own.
Eliot Jones @eliotkjones
113 Followers 336 Following Head of Offensive Cybersecurity @GraySwanAI previously @pleiasfr @stanford previously previously I was *really* good at soccer
Miranda Bogen @mbogen
2K Followers 1K Following Director of the AI Governance Lab @CenDemTech / responsible AI + policy
Hjalmar Wijk @HjalmarWijk
144 Followers 270 Following Member of Technical Staff @ METR Trying to understand + mitigate catastrophic risks from AI
Miles Kodama @Miles_M_K
75 Followers 4 Following
Tom Rachman @TomRachman
1K Followers 1K Following AI policy writer at Google DeepMind. Past: novels (“The Imperfectionists” & others); ghostwrote “We Are Bellingcat”; intl NY Times; the AP.
Eliezer Yudkowsky @allTheYud
3K Followers 17 Following High-volume account of @ESYudkowsky, the original AI alignment guy. If it's missing punctuation, it's humor. If you can't tell, it's probably also humor.
Dan Schwarz @dschwarz26
2K Followers 1K Following Co-founder @ https://t.co/e0JKSLzxVZ Prev CTO @metaculus, built Google's internal prediction market. Chess, jazz, biking, skiing, and learning.
Softmax @softmaxresearch
989 Followers 30 Following Softmax's mission is to scale organic alignment. We approach this problem with multi-agent reinforcement learning population-based simulations.
Tasha @TashaPais
3K Followers 3K Following RL training @softmaxresearch | Prev cs+ neuro @Columbia @RutgersU
Kristy Loke @kristy_loke
270 Followers 1K Following Researching China's AI innovation & governance strategies | Previously @GovAI_ @thefuturesoc | Words featured in: @techreview @thewirechina @DigiChn
Jeremie Harris @jeremiecharris
6K Followers 723 Following Co-founder & CEO of Gladstone AI We promote responsible AI R&D and adoption by designing and deploying safeguards against AI-driven national security threats.
Gladstone AI @GladstoneAI
983 Followers 33 Following
Wil Cunningham @WilCunningham
3K Followers 571 Following Google DeepMind | Professor @ University of Toronto
Leonard Dung @LeonardDung1
591 Followers 625 Following Philosopher of cognition at the Ruhr-University Bochum. I work mainly on consciousness, AI, and animals.
Bill Anderson-Samways @BillSamways
82 Followers 7 Following Research Analyst at the Institute for AI Policy and Strategy (IAPS)
Julian Minder @jkminder
438 Followers 473 Following PhD at EPFL with Robert West and Ryan Cotterell, MATS 7 Scholar with Neel Nanda
AI Safety South Afric... @AI_Safety_SA
44 Followers 30 Following