Editor, @ReadTransformer. Prev: AI safety and EA comms, journalist @TheEconomist, @Protocol, @finimizetransformernews.ai @shakeelhashim.com on BlueSkyJoined October 2009
Even if Claude Sonnet 4.5's evaluation awareness is "safe," it points toward a troubling pattern. As models get smarter, it becomes harder to tell whether they're actually aligned, or just on their best behavior.
My latest for @ReadTransformer: transformernews.ai/p/claude-sonne…
Even if Claude Sonnet 4.5's evaluation awareness is "safe," it points toward a troubling pattern. As models get smarter, it becomes harder to tell whether they're actually aligned, or just on their best behavior.
My latest for @ReadTransformer: transformernews.ai/p/claude-sonne…
22K Followers 321 FollowingGlobally ranked top 20 forecaster 🎯
AI is not a normal technology. I'm working at @IAPSai to shape AI for global prosperity and human freedom.
10K Followers 322 FollowingOfficial Unofficial EA mascot. I'm here to make friends and maximise utility, and I'm all out of neglected altruistic opportunities
6K Followers 607 FollowingClaude says I process my emotions out loud & my girlfriend has a job, so I put my feelings & thoughts here ✨ working on the EA Global team @ CEA (views my own)
7K Followers 369 FollowingDedicated to the protection and thriving of sentient beings. PhD in evo bio.🔸
Executive Director of @PauseAIUS. Opinions not necessarily those of the org.
3K Followers 696 FollowingEffective altruism / AI safety / Peaceful music-maker.
AI Events Program Lead at the Centre for Effective Altruism (views my own)
34K Followers 828 FollowingExplaining AI Alignment to anyone who'll stand still for long enough, on YouTube and Discord.
Music, movies, microcode, and high-speed pizza delivery
27K Followers 26K FollowingI just love going on #FilmX, it's the highlight of my day. I can think of nothing better than talking and reading about movies.
587 Followers 7K FollowingTrust in the Lord with all your heart, and do not lean on your own understanding. In all your ways acknowledge him, and he will make straight your paths.✝️
15K Followers 5K FollowingCEI is a non-profit public policy organization dedicated to advancing the principles of limited government, free enterprise, and individual liberty.
26K Followers 25K FollowingTech VC and entrepreneur. Curious. Investing and building in AI. Built companies in media and tech. Founder @frontiervc. Learned things @Harvard, @Stanford
37K Followers 37K FollowingHumankind has not woven the web of life. We are but one thread within it. Whatever we do the web, we do to ourselves.
Chief Seattle (1782 ― June 7 1866) 🏳️🌈
1K Followers 1K Followinginternet explorer @ Open Tabs 📩 ~ founding editor DF magazine ~ views my own ~ “she’s got a good head on her shoulders” - my friend’s dad ~ ex @wireduk
4K Followers 3K FollowingKeynote speaker | Writer | Strategist | Currently charting the impacts of generative AI on human-made media. Subscribe: https://t.co/8TjODWML3F
26K Followers 211 FollowingWorking towards the safe development of AI for the benefit of all @UMontreal, @LawZero_ & @Mila_Quebec
A.M. Turing Award Recipient and most-cited AI researcher.
22K Followers 321 FollowingGlobally ranked top 20 forecaster 🎯
AI is not a normal technology. I'm working at @IAPSai to shape AI for global prosperity and human freedom.
6K Followers 607 FollowingClaude says I process my emotions out loud & my girlfriend has a job, so I put my feelings & thoughts here ✨ working on the EA Global team @ CEA (views my own)
3K Followers 696 FollowingEffective altruism / AI safety / Peaceful music-maker.
AI Events Program Lead at the Centre for Effective Altruism (views my own)
34K Followers 828 FollowingExplaining AI Alignment to anyone who'll stand still for long enough, on YouTube and Discord.
Music, movies, microcode, and high-speed pizza delivery
4.4M Followers 3 FollowingOpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6Lg202
5K Followers 1K Followingcontributing editor at Vanity Fair, mostly writing about tech. bad at social media, good at long phone calls
unsure how to use this website
127 Followers 439 FollowingFL ➡️ DC | coalitions director for @americans4ri | previously @progresschamber & @deweysquare | 2x @floridastate alum | all views are my own & rt ≠ endorsement
5K Followers 4K FollowingWriter, editor and journalist | 📩 UK 2.0 newsletter about tech policy and growth | 📨 Sorry We're Prosed newsletter about writing | Header by @dancoxdesign
27 Followers 220 FollowingRhode Island raised, Brussels-based freelance reporter. Words @politico @nytimes @NBCNews. Read my newsletter at https://t.co/1uIEdxdidf.
427 Followers 51 Followingtrying to see through context windows
currently: agent security lead @ U.S. Center for AI Standards and Innovation
past: science of deep learning phd @ harvard
3K Followers 501 FollowingExcels at reasoning & tool use🪄 Tensor-enjoyer 🧪 @METR_Evals. My COI policy is available under “Disclosures” at https://t.co/bihrMIUKJq
5K Followers 1K Followingwealth and power reporter @sfstandard. previously @thedailybeast @Independent. words in @thecut @guardian. email: [email protected]
246 Followers 922 FollowingPhD-ing @uniofoxford researching LLM explainability and interpretability + doing some evals work along the way | Applied AI @The_IGC | Prev @Cambridge_Uni
146 Followers 353 FollowingDeputy Director, Research Unit, UK AISI. Generalist who enjoys getting difficult things done and trying to make the world less bad. Views mine, rt!=endorse
No recent Favorites. New Favorites will appear here.