Geoffrey Irving @geoffreyirving

Research Director at the UK AI Safety Institute (AISI). Previously DeepMind, OpenAI, Google Brain, etc. @[email protected] naml.us/blog London Joined September 2009

Tweets

3K
Followers

8K
Following

259
Likes

11K

Geoffrey Irving @geoffreyirving

4 weeks ago

What’s the best class of O(1)-parameterized distributions that decently model *trained* weight matrices in neural networks (either generally or for transformers specifically)?

1 0 9 2K 4

Geoffrey Irving @geoffreyirving

4 weeks ago

Lovely news to get on the morning of my first day at the UK AI Safety Institute. :)

Ian Hogarth @soundboy

4 weeks ago

Lovely news to get on the morning of my first day at the UK AI Safety Institute. :)

11 41 260 27K 42

Download Image

2 5 95 7K 7

Mech interp has been very successful in tiny models, but does it scale? …Kinda! Our new @GoogleDeepMind paper studies how Chinchilla70B can do multiple-choice Qs, focusing on picking the correct letter. Small model techniques mostly work but it's messy!🧵arxiv.org/abs/2307.09458

3 43 225 72K 99

Download Image

Geoffrey Irving @geoffreyirving

a year ago

I’m surprised people need to hear this, but: If you’re considering whether to join a company, you should not sign a statement preventing you from talking to people with concerns about that company.

2 0 31 8K 2

Geoffrey Irving @geoffreyirving

a year ago

Part of AI alignment is picking tasks on which, if you do really well, the outcome is good.

1 0 35 0 3

Geoffrey Irving @geoffreyirving

a year ago

What is the Mastodon instance that is simultaneously canonical for ML, EA, and Math Twitter?

0 0 4 0 3

Geoffrey Irving @geoffreyirving

a year ago

Something related to one of @littmath's tweets.

0 0 0 0 0

Richard Ngo @RichardMCNgo

35K Followers 1K Following What would we need to understand in order to design an amazing future? Figuring that out @openai

We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.

Google DeepMind @GoogleDeepMind

944K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.

Miles Brundage @Miles_Brundage

43K Followers 10K Following Policy research at @openai. I mostly tweet about AI, animals, and sci-fi. He/him. Views my own.

Michael Nielsen @michael_nielsen

96K Followers 6K Following Searching for the numinous 🇦🇺 🇨🇦, home in 🇺🇸 Research @AsteraInstitute https://t.co/maezekzRUb

Philosopher & ethicist teaching models to be good @AnthropicAI.
Personal account. All opinions come from my training data.

Amanda Askell @AmandaAskell

26K Followers 653 Following Philosopher & ethicist teaching models to be good @AnthropicAI. Personal account. All opinions come from my training data.

Explaining AI Alignment to anyone who'll stand still for long enough, on YouTube and Discord.

Music, movies, microcode, and high-speed pizza delivery

Rob Miles (✈️ Tok.. @robertskmiles

18K Followers 790 Following Explaining AI Alignment to anyone who'll stand still for long enough, on YouTube and Discord. Music, movies, microcode, and high-speed pizza delivery

Rob Bensinger ⏹️ @robbensinger

8K Followers 302 Following Comms @MIRIBerkeley. RT = increased vague psychological association between myself and the tweet.

@AnthropicAI, ONEAI OECD, co-chair @indexingai, writer @ https://t.co/3vmtHYkaTu Past: @openai, @business @theregister. Neural nets, distributed systems, weird futures

Jack Clark @jackclarkSF

68K Followers 5K Following @AnthropicAI, ONEAI OECD, co-chair @indexingai, writer @ https://t.co/3vmtHYkaTu Past: @openai, @business @theregister. Neural nets, distributed systems, weird futures

Julian @mealreplacer

16K Followers 1K Following AI safety

Stefan Schubert @StefanFSchubert

28K Followers 2K Following Philosophy, psychology, and effective altruism.

Jan Leike @janleike

44K Followers 322 Following ML Researcher, co-leading Superalignment @OpenAI. Optimizing for a post-AGI future where humanity flourishes.

Mechanistic Interpretability lead @DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!

Neel Nanda @NeelNanda5

13K Followers 89 Following Mechanistic Interpretability lead @DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!

Nathan 🔍 @NathanpmYoung

15K Followers 3K Following Will bet $10 on any statement I make.

David Krueger @DavidSKrueger

13K Followers 4K Following Cambridge faculty - AI alignment, deep learning, and existential safety. Formerly Mila, FHI, DeepMind, ElementAI, AISI.

Senior writer at Vox's Future Perfect. kelsey.piper@vox.com

Kelsey Piper @KelseyTuoc

27K Followers 544 Following Senior writer at Vox's Future Perfect. [email protected]

Robert Long @rgblong

6K Followers 975 Following AI consciousness

Hanging out with Claude, improving its behavior, and building tools to support that @AnthropicAI 😁

prev: @open_phil @googlebrain @openai (@microcovid)

Catherine Olsson @catherineols

15K Followers 1K Following Hanging out with Claude, improving its behavior, and building tools to support that @AnthropicAI 😁 prev: @open_phil @googlebrain @openai (@microcovid)

Sam Bowman @sleepinyourhat

35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.

Pro forecaster w/ good track record. Seeking to understand + manage risks from advanced AI systems.

- Co-CEO @RethinkPriors
- Chief Advisory Executive @iapsAI

Peter Wildeford @peterwildeford

10K Followers 367 Following Pro forecaster w/ good track record. Seeking to understand + manage risks from advanced AI systems. - Co-CEO @RethinkPriors - Chief Advisory Executive @iapsAI

Wojciech Zaremba @woj_zaremba

79K Followers 192 Following Co-Founder of OpenAI

kenshin_the_great @KenshinThe1337

0 Followers 88 Following

Paylz @paylza

144 Followers 2K Following The best online market for digital downloads with best prices.

Joe Skinner @joecskinner

5 Followers 57 Following

younghoax @younghoax20

458 Followers 7K Following Doctor, stocks, crypto,AI

Make @LearnAnything_

Learn in public: https://t.co/GbFvuErkYn

macOS course: https://t.co/JdbJWru6zG

https://t.co/94R8ER7K2h
https://t.co/ROkqhyhpEK

Nikita @nikitavoloboev

4K Followers 7K Following Make @LearnAnything_ Learn in public: https://t.co/GbFvuErkYn macOS course: https://t.co/JdbJWru6zG https://t.co/94R8ER7K2h https://t.co/ROkqhyhpEK

Jake VFX @VFXNaturePhotog

7 Followers 49 Following VFX, Photography, Disabled 🏳️‍🌈

AlphOmega @AlphOmegaTk

40 Followers 636 Following Πolitics / Crypto / Programming / Gaming / Whatevs

Karolina Stanczak @karstanczak

515 Followers 446 Following NLP & ML PhD candidate @uni_copenhagen @CopeNLU

✾ @acity_cap

2 Followers 118 Following

Alexandru Tifrea @alexandrutifrea

169 Followers 322 Following

Rohan Gupta @ggrohdg

0 Followers 80 Following

Matt Clifford @matthewclifford

25K Followers 2K Following Co-founder @join_ef; Chair @ARIA_Research; co-led AI Safety Summit at Bletchley Park

Soroush Ebadian @SoroushEbadian

132 Followers 228 Following In-between Computer Science, Innovation, and Violin.

Cawreo @Cawreo

135 Followers 932 Following Founder/CEO @NexusNets | I code open protocol AI.

Louis Matha @loulouAI0662

4 Followers 45 Following

Ilia @IliaTeimouri

3 Followers 79 Following

Anonymous Founder @anonymfounder

386 Followers 7K Following My startup diary. From startups to marketing, finance to entrepreneurship........and cryptocurrency.

aj @awsedrftaj

10 Followers 108 Following

Co-Survivor • Business Development Manager • Battalion Chief of EMS (Retired) • Aspiring Screenwriter • Citizen of U.S., Canada, Ireland • ECGs • YouTube 👇🏻

Tom Bouthillet 🇺�.. @tbouthillet

8K Followers 4K Following Co-Survivor • Business Development Manager • Battalion Chief of EMS (Retired) • Aspiring Screenwriter • Citizen of U.S., Canada, Ireland • ECGs • YouTube 👇🏻

Amanda Cercas Curry @CurriedAmanda

576 Followers 739 Following Postdoc @MilaNLProc | Philosopher of Swift | Cohost of @letschatethics

Samuel Pyeng(GoDeihPi.. @SamuelPyang23

117 Followers 562 Following

Patrick Dillon @mpdillon

20K Followers 5K Following "Daad!" Big fan @jomalleydillon/@jod46. @ObamaWhiteHouse, @Georgetown, @GUPolitics, etc. Texan in DC (El Paso forever). Tweeting is a bad idea but mine alone.

Volodymyr Volkov @lepricon85

21 Followers 416 Following

Horizon Events @HorizonEvents9

11 Followers 270 Following Events consultancy dedicated to advancing R&D in AI safety

Journaliste @LesEchosWeekEnd / ex-correspondante à San Francisco (2016-2020) / amoutot@lesechos.fr

Anaïs Moutot @AnaisMoutot

5K Followers 5K Following Journaliste @LesEchosWeekEnd / ex-correspondante à San Francisco (2016-2020) / [email protected]

We provide meaningful content and connect a thoughtful community of decision-makers to empower smart cities at all stages of growth. Powered by @techconnect360.

Smart Cities Connect @smartcityc

17K Followers 18K Following We provide meaningful content and connect a thoughtful community of decision-makers to empower smart cities at all stages of growth. Powered by @techconnect360.

PhD ML & Comp.Neuro @UniofOxford prev:@UofBristol, prev. @nyuniversity, MSc Applied Maths @EdinburghUni ,BSc CompSci @KingsCollegeLon

Kevin Nejad @kevin_nejad

293 Followers 2K Following PhD ML & Comp.Neuro @UniofOxford prev:@UofBristol, prev. @nyuniversity, MSc Applied Maths @EdinburghUni ,BSc CompSci @KingsCollegeLon

✦✦✦ @not_infinite___

38 Followers 379 Following

Akash Bajwa @AkashBajwa96

2K Followers 1K Following Investing in B2B software & fintech @EarlybirdVC. Writing about SaaS & fintech @ https://t.co/K2Ge60PyQQ.

Vincent Zhang @centzh

17 Followers 690 Following policy enthusiast. recovering CS student. writing (mostly sharing stuff) on tech + society | All views are my own.

Vice President @FPALondon bylines @fattoquotidiano @allthecitizens @espressonline @primaonline @AMINaOdv. Views my own. Pro kemmer. #VatiLeaks

sabrinaprovenzani @sabriprovenzani

925 Followers 2K Following Vice President @FPALondon bylines @fattoquotidiano @allthecitizens @espressonline @primaonline @AMINaOdv. Views my own. Pro kemmer. #VatiLeaks

Droid 42 @droid_no42

0 Followers 114 Following ..-. --- .-. - -.- - - .-- ---

FAITH lN JESUS CHRIST @dabaalwayswinn1

226 Followers 3K Following Evangelist, Researcher and Consultant

Reza Sayar @iamRezaSayar

168 Followers 673 Following 👨🏻‍🎓Life-long Learner👨🏻‍🎓 Kindness❤️, Helpfulness🫂 , AI🧠 & Reggaetón💃🏻

Auventic, Inc. @auventic

19 Followers 75 Following Auventic stands at the forefront of Al safety, ensuring technology enhances, not overshadows, human potential.

Senior tech reporter @POLITICOEurope in London. vmanancourt@politico.eu. DM for phone number

Vincent Manancourt @vmanancourt

7K Followers 2K Following Senior tech reporter @POLITICOEurope in London. [email protected]. DM for phone number

Jack Sellers @JMSellers93

2K Followers 1K Following Special Adviser to the Prime Minister

Krueger AI Safety Lab @kasl_ai

257 Followers 51 Following We are a research group at the University of Cambridge focused on avoiding catastrophic risks from AI.

rupalim Sarma @rupalims

15 Followers 262 Following

Ai Sakura🇯🇵🇺.. @aisakuraonx

114 Followers 791 Following A blend of AI & Coaching 🧠🤖

M.L.S @_mlspace

24 Followers 72 Following

xena @Parth19091

120 Followers 2K Following having fun

math, physics, AI alignment

alignment is too hard, we should do governance instead

leave me anonymous feedback at https://t.co/A1Prj0teYX

Joern Stoehler ⏹️ @JStoehler

78 Followers 193 Following math, physics, AI alignment alignment is too hard, we should do governance instead leave me anonymous feedback at https://t.co/A1Prj0teYX

ByeRose @byerose365

0 Followers 520 Following

Building in AI & Consumer @earlywormapp Prev worked as SWE @ Meta building FB ads and LLM codegen tools. @HopkinsNanjing @UVA

Chris Wood @C_H_Wood

746 Followers 4K Following Building in AI & Consumer @earlywormapp Prev worked as SWE @ Meta building FB ads and LLM codegen tools. @HopkinsNanjing @UVA

Lexi Keegan @lexikeegan

62 Followers 493 Following softball, social research, and occasionally sun.

Joey Trend @joeytrend

18K Followers 7K Following Trend Setter - Entrepreneur - Optimizer

Chris Kihereko @CKihereko

98 Followers 457 Following

Jo Marriott @JoMarriott3

50 Followers 169 Following Senior Impact Manager @EPSRC

Ruth Kaufmann Wolfe @rkaufmannwolfe

26 Followers 259 Following

The original AI alignment person. Missing punctuation at the end of a sentence means it's humor. If you're not sure, it's also very likely humor.

Eliezer Yudkowsky ⏹.. @ESYudkowsky

175K Followers 89 Following The original AI alignment person. Missing punctuation at the end of a sentence means it's humor. If you're not sure, it's also very likely humor.

Google DeepMind @GoogleDeepMind

944K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.

Michael Nielsen @michael_nielsen

96K Followers 6K Following Searching for the numinous 🇦🇺 🇨🇦, home in 🇺🇸 Research @AsteraInstitute https://t.co/maezekzRUb

Amanda Askell @AmandaAskell

26K Followers 653 Following Philosopher & ethicist teaching models to be good @AnthropicAI. Personal account. All opinions come from my training data.

We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.

Anthropic @AnthropicAI

262K Followers 26 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.

Rob Bensinger ⏹️ @robbensinger

8K Followers 302 Following Comms @MIRIBerkeley. RT = increased vague psychological association between myself and the tweet.

Jack Clark @jackclarkSF

68K Followers 5K Following @AnthropicAI, ONEAI OECD, co-chair @indexingai, writer @ https://t.co/3vmtHYkaTu Past: @openai, @business @theregister. Neural nets, distributed systems, weird futures

Jan Leike @janleike

44K Followers 322 Following ML Researcher, co-leading Superalignment @OpenAI. Optimizing for a post-AGI future where humanity flourishes.

Kelsey Piper @KelseyTuoc

27K Followers 544 Following Senior writer at Vox's Future Perfect. [email protected]

Catherine Olsson @catherineols

15K Followers 1K Following Hanging out with Claude, improving its behavior, and building tools to support that @AnthropicAI 😁 prev: @open_phil @googlebrain @openai (@microcovid)

Anders Sandberg @anderssandberg

25K Followers 71 Following Academic jack-of-all-trades.

Sam Bowman @sleepinyourhat

35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.

Robert Wiblin @robertwiblin

34K Followers 643 Following Exploring the inviolate sphere of ideas one interview at a time: https://t.co/2YMw00bkIQ

Habiba @FreshMangoLassi

4K Followers 523 Following Co-founder @SpiroTB - new TB screening and prevention charity focused on children https://t.co/sBf6ONGMSL

Alexander Berger @albrgr

11K Followers 2K Following Enjoys a good applied micro paper. CEO of @open_phil. Views my own, tweets self-destruct every once in a while.

Thinking about whether AI will destroy the world at https://t.co/pMilDvd4ya. DM or email for media requests. Feedback: https://t.co/zGAm1i7SKH

Katja Grace 🔍 @KatjaGrace

8K Followers 798 Following Thinking about whether AI will destroy the world at https://t.co/pMilDvd4ya. DM or email for media requests. Feedback: https://t.co/zGAm1i7SKH

VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead.

Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.

Oriol Vinyals @OriolVinyalsML

167K Followers 82 Following VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead. Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.

Reverse engineering neural networks at @AnthropicAI. DMs open! Previously @distillpub, OpenAI Clarity Team, Google Brain. Personal account.

Chris Olah @ch402

91K Followers 173 Following Reverse engineering neural networks at @AnthropicAI. DMs open! Previously @distillpub, OpenAI Clarity Team, Google Brain. Personal account.

Imogen Schon @Imogen_Schon

142 Followers 733 Following 2015 Year Here Fellow

Adviser to the PM on AI, angel investor, NED @cabinetofficeuk & @oaknational Exited founder - Look After My Bills (@ycombinator W18)

Henry de Zoete @HZoete

3K Followers 3K Following Adviser to the PM on AI, angel investor, NED @cabinetofficeuk & @oaknational Exited founder - Look After My Bills (@ycombinator W18)

Professor @Wharton studying AI, innovation & startups. Democratizing education using tech
Book: https://t.co/CSmipbJ2jV
Substack: https://t.co/UIBhxu4bgq

Ethan Mollick @emollick

211K Followers 553 Following Professor @Wharton studying AI, innovation & startups. Democratizing education using tech Book: https://t.co/CSmipbJ2jV Substack: https://t.co/UIBhxu4bgq

investor @pluralplatform; chair UK AI Safety Institute; co-author @stateofaireport; co-founder @songkick; chair @PhasecraftLtd

Ian Hogarth @soundboy

23K Followers 3K Following investor @pluralplatform; chair UK AI Safety Institute; co-author @stateofaireport; co-founder @songkick; chair @PhasecraftLtd

Researcher @DeepMind working on safer Conversational AI | Honorary Professor @HeriotWattUni | Co-founder @helloalana | mother of dragons | own opinions only

Verena Rieser @verena_rieser

4K Followers 1K Following Researcher @DeepMind working on safer Conversational AI | Honorary Professor @HeriotWattUni | Co-founder @helloalana | mother of dragons | own opinions only

Lean @leanprover

4K Followers 35 Following Lean is a dependently-typed programming language and theorem prover.

research scientist @deepmind | 🔎 tech governance, online harms, socio-technical ai | previously @prodigi_erc @oiioxford | flibbertigibbet, views her own

Dr. Nahema Marchal @nahema_marchal

2K Followers 1K Following research scientist @deepmind | 🔎 tech governance, online harms, socio-technical ai | previously @prodigi_erc @oiioxford | flibbertigibbet, views her own

Mathematician learning Lean and trying to teach it to others. Now gone to Mathstodon (March 2023). No longer reading or replying to mentions.

Kevin Buzzard @XenaProject

9K Followers 0 Following Mathematician learning Lean and trying to teach it to others. Now gone to Mathstodon (March 2023). No longer reading or replying to mentions.

Alicia Parrish @AliciaVParrish

556 Followers 675 Following Research scientist at Google. I like CogSci & NLP. PhD from @nyuling. She/her.

Prof. of math @UConn, number theorist, author, Hagoromo Ambassador.

On *Twitter* hiatus. You can find me in bluer celestial pastures instead.

Álvaro Lozano-Robled.. @MathAndCobb

4K Followers 729 Following Prof. of math @UConn, number theorist, author, Hagoromo Ambassador. On *Twitter* hiatus. You can find me in bluer celestial pastures instead.

Researcher in NLP/ML @deepmind, @ucl_nlp, @riedelcastro@sigmoid.social on Mastodon

Sebastian Riedel (@ri.. @riedelcastro

15K Followers 470 Following Researcher in NLP/ML @deepmind, @ucl_nlp, @[email protected] on Mastodon

Tamar Shinar @ttshinar

31 Followers 225 Following

Some theorems @CihanPostsThms

24K Followers 6 Following Posting some theorems, and occasionally other stuff. By @bahran_cihan

Richard Chappell @RYChappell

1K Followers 114 Following Academic Philosopher. Blogs at https://t.co/d4D6CfLwuB

Working toward a free and fair future powered by friendly AI.

Head of interpretability research at @AiEleuther, but tweets are my own views, not Eleuther’s.

Nora Belrose @norabelrose

8K Followers 124 Following Working toward a free and fair future powered by friendly AI. Head of interpretability research at @AiEleuther, but tweets are my own views, not Eleuther’s.

Matt Levine @matt_levine

306K Followers 1K Following lunch valuation analyst

depths of wikipedia! @depthsofwiki

889K Followers 4K Following Hello I am @anniierau Please take away my blue check! I did not ask for it!

Writing @platformer. Co-hosting Hard Fork @nytimes. Posting good tweets to Instagram stories @crumbler. casey@platformer.news | https://t.co/9KuJb8XCrr

Casey Newton @CaseyNewton

214K Followers 909 Following Writing @platformer. Co-hosting Hard Fork @nytimes. Posting good tweets to Instagram stories @crumbler. [email protected] | https://t.co/9KuJb8XCrr

Cari Tuna @CariTuna

3K Followers 83 Following

Vijay Bolina @vijaybolina

3K Followers 5K Following Hacking AGI. CISO @Google @DeepMind. Former @Mandiant @BoozAllen. Tweets my own.

Ethan Porter @EthanVPorter

2K Followers 1K Following Associate professor at @GWtweets, @SMPAGWU, @GWIDDP, formerly @demjournal, @uchicago.

ruchowdh.bsky.social @ruchowdh

44K Followers 4K Following find me at https://t.co/hrk5quIFJI

A bot that tweets whenever the Metaculus (@metaculus) community prediction for a question (currently Ukraine and Monkeypox) changes significantly.

Metaculus Prediction .. @MetaculusAlert

3K Followers 5 Following A bot that tweets whenever the Metaculus (@metaculus) community prediction for a question (currently Ukraine and Monkeypox) changes significantly.

ninell oldenburg @nellsn1

420 Followers 608 Following PhD student in philosophy of AI @ ucph; tweets on linguistics, artificial "intelligence", the awful German language, & maps

Tech and society outside the global north. Senior research scientist at DeepMind. Past: Meta, Princeton, TVR2C at WPRB, Peace Corps, USDOJ

Stevie Bergman @tvr2c

548 Followers 976 Following Tech and society outside the global north. Senior research scientist at DeepMind. Past: Meta, Princeton, TVR2C at WPRB, Peace Corps, USDOJ

alex lawsen @lxrjl

3K Followers 745 Following AI Grantmaking @ Open Philanthropy Previously 80,000 Hours, teaching, forecasting, poker. Views my 🐒's

Fredrik Johansson @hypergeometer

678 Followers 145 Following fredrikj @ mathstodon Computer algebra & Arbitrary-precision arithmetic. Researcher at @Inria.

Casey Muratori @cmuratori

41K Followers 120 Following I want all my garmonbozia. https://t.co/Bdh1Xj2PpV

Maja Trebacz @majatrebacz

279 Followers 221 Following Research Engineer @DeepMind

Quinta Jurecic @qjurecic

56K Followers 3K Following senior editor @lawfare, fellow @BrookingsInst, contributing writer @TheAtlantic. views my own, RTs = @infinite_scream.

Energy and commodities columnist at Bloomberg. Co-author of the 'The World for Sale' https://t.co/GAcVleqiqp Any views expressed are my own. jblas3@bloomberg.net

Javier Blas @JavierBlas

297K Followers 1K Following Energy and commodities columnist at Bloomberg. Co-author of the 'The World for Sale' https://t.co/GAcVleqiqp Any views expressed are my own. [email protected]

Sean Legassick @SeanLegassick

435 Followers 104 Following Technologist. Ex-Ethics@DeepMind.

Senior epidemiologist - COVID-19 vaccines @UKHSA. PhD in the immunology of respiratory viral infections @imperialcollege.

Freja Kirsebom @freja_kirsebom

7K Followers 689 Following Senior epidemiologist - COVID-19 vaccines @UKHSA. PhD in the immunology of respiratory viral infections @imperialcollege.

Blogger world modeling, now mostly AI and AI x-risk, at Don't Worry About the Vase (https://t.co/O9LbMQjKoo or WP/LW), founding Balsa Research to fix policy.

Zvi Mowshowitz @TheZvi

24K Followers 283 Following Blogger world modeling, now mostly AI and AI x-risk, at Don't Worry About the Vase (https://t.co/O9LbMQjKoo or WP/LW), founding Balsa Research to fix policy.

Amelia (Mia) Glaese @mia_glaese

76 Followers 38 Following

Learning things @DeepMind & @UniofOxford | Angel Investor @Atomico | Formerly @Uber @Dropbox @Google, Board @simplysecureorg | She/her
🇹🇼🇺🇸📍🇬🇧

Dorothy Chou @dorothychou

914 Followers 451 Following Learning things @DeepMind & @UniofOxford | Angel Investor @Atomico | Formerly @Uber @Dropbox @Google, Board @simplysecureorg | She/her 🇹🇼🇺🇸📍🇬🇧

Jacob Steinhardt @JacobSteinhardt

7K Followers 67 Following Assistant Professor of Statistics, UC Berkeley

Aussie battler trying to make it in the Big Smoke.

Building on the critical path to AGI since '16.

Currently: RLHF that doesn't suck @GoogleDeepMind

John Aslanides @john_aslanides

608 Followers 1K Following Aussie battler trying to make it in the Big Smoke. Building on the critical path to AGI since '16. Currently: RLHF that doesn't suck @GoogleDeepMind

Nat McAleese @nmca

3K Followers 306 Following Superalignment by models helping humans help models help humans at OpenAI. Previously @DeepMind. Views my own.

Howie Lempel @HowieLempel

1K Followers 497 Following

Benjamin Todd @ben_j_todd

12K Followers 143 Following Founder @80000Hours Writing about what to do about AI, doing good, and using research to have a nice life 🦑

Liv Boeree @Liv_Boeree

254K Followers 497 Following Looking for the win/wins in life. Not a fan of Moloch traps. Brand new podcast out now, link below👇

Director of https://t.co/gCEDoKdKBT at Uni of Cambridge | Researching Big Risks, and impacts of AI & emerging tech. Opinions own

Seán Ó hÉigeartaig.. @S_OhEigeartaigh

2K Followers 1K Following Director of https://t.co/gCEDoKdKBT at Uni of Cambridge | Researching Big Risks, and impacts of AI & emerging tech. Opinions own

PhD in Psych/Neuro from Princeton. 80,000 Hours Advisor. Mother of two multimodal multitasking neural networks. Views are my own.

Abby Novick Hoskin @CorpusCalosseum

662 Followers 825 Following PhD in Psych/Neuro from Princeton. 80,000 Hours Advisor. Mother of two multimodal multitasking neural networks. Views are my own.

Allan Dafoe @AllanDafoe

3K Followers 565 Following AGI governance: navigating the transition to beneficial AGI (Google DeepMind)

Jacob Menick @jacobmenick

4K Followers 266 Following Researcher @OpenAI. PhD candidate @UCL. previously @DeepMind 🇺🇸/🇬🇧

Jacob Pfau @jacob_pfau

5 days ago

Do models need to reason in words to benefit from chain-of-thought tokens? In our experiments, the answer is no! Models can perform on par with CoT using repeated '...' filler tokens. This raises alignment concerns: Using filler, LMs can do hidden reasoning not visible in CoT🧵

40 179 1K 249K 908

Download Image

Zvi Mowshowitz @TheZvi

5 days ago

I believe we have discovered the flaw in the 'post message that you will be deactivating your account removing all of your Tweets' plan.

roon @tszzl

5 days ago

gonna deactivate for a while. nothings wrong i just need to detox from the site a bit

69 19 805 88K 32

0 0 36 6K 0

Rohin Shah @rohinmshah

5 days ago

Rose: The idea is extremely simple and well-motivated, and the effect sizes are large. Thorn: p=0.05 :( (Tbc, I am very confident we would have reached statistical significance for Gated SAEs being more interpretable, if we had a large enough N.) x.com/sen_r/status/1…

Senthooran Rajamanoharan @sen_r

6 days ago

New @GoogleDeepMind MechInterp work! We introduce Gated SAEs, a Pareto improvement over existing sparse autoencoders. They find equally good reconstructions with around half as many firing features, while maintaining interpretability (CI 0-13% improvement). Joint w/ @ArthurConmy

5 24 158 21K 87

Download Image

0 0 11 2K 2

Ferenc Huszár @fhuszar

a week ago

@geoffreyirving @KLdivergence @sindero And I found the Jensen Huang reference from GTC keynote 2016. Look at the slide:

1 0 1 422 0

Download Image

Ferenc Huszár @fhuszar

a week ago

@geoffreyirving @KLdivergence @sindero I don’t have an issue with that. I have an issue with using “inference” as a verb, as in “inferencing”. That drives me nuts. x.com/fhuszar/status…

Ferenc Huszár @fhuszar

a week ago

@CellTypist @KLdivergence My issue is grammatical: using inference as a verb. I.e instead of to infer something they say to inference something. Or the word “inferencing” Others might take issue using the word “infer” instead of forecast or predict I don’t know.

0 0 1 1K 1

1 0 2 112 0

Ferenc Huszár @fhuszar

a week ago

@geoffreyirving @KLdivergence @sindero phi-3 paper is the last time I saw it:

2 0 1 417 0

Download Image

Kristian Lum @KLdivergence

a week ago

I will never get over how AI/ML people use the word “inference”

67 35 608 203K 113

Allan Dafoe @AllanDafoe

a week ago

We are looking for an AGI Safety Manager to support @GoogleDeepMind 's AGI Safety Council: please encourage excellent people to apply! This role will work closely with my team, Scalable Alignment and Safety, and Responsible Development and Innovation. boards.greenhouse.io/deepmind/jobs/…

9 18 78 9K 25

Amanda Askell @AmandaAskell

a week ago

Most of the time you don't really notice the world changing. Then one day you're sitting in the back of a driverless car, listening to music on your phone while asking an AI something, when suddenly you're struck by a memory of childhood and you realize you now live in Star Trek.

10 20 285 12K 24

Ian Goodfellow @goodfellow_ian

a week ago

@_NicT_ In chapter 1 of deeplearningbook.org we say that data representations are crucial for not just machine learning or even computer science but daily life (e.g. try dividing numbers by hand with Roman numerals).

1 0 37 12K 6

Nicholas Teague @_NicT_

a week ago

@goodfellow_ian I blame myself for failing to cite this passage in your book!

0 0 5 523 0

Greg Egan @gregeganSF

a week ago

No smooth curve that lies on a sphere can contain an inflection point (a point whose curvature is zero). But on a surface of constant negative curvature, like this tractroid, no such obstacle exists, and every non-meridian geodesic on the tractroid has 2 inflection pts (black).

8 18 120 12K 10

Download Image

Wei Dai @weidai11

2 weeks ago

@RokoMijic @ESYudkowsky @robinhanson I think you're over-extrapolating the success of RLHF (which I was worried people would do). Remember why people came up with "scalable alignment" ideas like IDA and Debate. Those solutions aren't coming online fast enough. @geoffreyirving was worried about this when I asked him.

1 0 7 335 1

Helen Toner @hlntnr

2 weeks ago

This is fantastic news for NIST 🎉 I'm biased because he's a friend, but Paul Christiano is both technically brilliant and holistically super thoughtful - not to mention a pioneer in exactly the kind of frontier testing work he'll be leading at AISI. commerce.gov/news/press-rel…

3 8 132 8K 12

Michael Nielsen @michael_nielsen

2 weeks ago

The Future of Humanity Institute has shutdown 🙁 (2005-2024): futureofhumanityinstitute.org

21 38 321 113K 121

Josh Dzieza @joshdzieza

2 weeks ago

I've been wanting to do this story for years and am thrilled it's finally out: inside the surprisingly small, highly specialized industry that repairs the internet cables on the bottom of the ocean theverge.com/c/24070570/int…

30 275 817 133K 249

Anders Sandberg @anderssandberg

2 weeks ago

One unexpected benefit of sending out an email to every contact about an updated email address is that I suddenly hear from a lot of friends and acquaintances I have not interacted with for ages.

4 2 75 4K 6

Daniel Litt @littmath

2 weeks ago

Unfortunately it seems that the universe has been fine-tuned in such a way that this tweet would receive some very annoying replies. Wishing the fundamental constants of the universe were very slightly different right now.

Daniel Litt @littmath

3 weeks ago

I just flipped a coin ten times, resulting in the sequence HTHHTHTTTH. The universe was fine-tuned for this outcome—if the fundamental constants of the universe had been even 0.1% different, I never would have observed this sequence of flips.

30 22 554 54K 35

8 7 222 17K 5

Daniel Litt @littmath

3 weeks ago

30 22 554 54K 35

Daniel Litt @littmath

3 weeks ago

You flip a coin 200 times. The first 100 flips, it lands on heads; the second 100, on tails. As a proud Bayesian, you conclude you are most likely in the middle of a logic puzzle.