Michal Bravansky @michalbravansky

@verifee @ucl bravansky.com London, England Joined May 2016

Tweets

79
Followers

182
Following

1K
Likes

2K

Michal Bravansky @michalbravansky

a week ago

It seems to me likely that as we're shifting toward task-specific environments for LLM post-training, model providers will inevitably become incentivized to vacuum up as much user context as they possibly can, just to fully reconstruct their tasks inside their RL stacks. If…

0 0 0 65 0

Hensen Juang @basedjensen

a week ago

Lol these bros are just vibe governing

Rapid Response 47 @RapidResponse47

a week ago

Lol these bros are just vibe governing

2K 2K 8K 2.3M 1K

2 4 113 10K 3

near @nearcyan

2 weeks ago

you can just copy-trade leopold’s fund and triple your money in a day. the forms are public

59 94 4K 464K 4K

Download Image

Nat McAleese @nmca

2 weeks ago

everything reminds me of him 😭

28 34 986 291K 265

Download Image

Josh Landes @guynamedjoshl

2 weeks ago

£10k if you find me our next tech lead - I think this is a very cool role (I'm covering for parts of it atm) at a very cool org (we just raised 25M). DMs open :)

7 3 15 2K 3

Download Image

Dan Lahav @dan_lahav

2 weeks ago

Today I’m launching @Irregular (formerly Pattern Labs) with my friend and co-founder Omer Nevo: Irregular is the first frontier security lab. Our mission: protect the world in the era of increasingly capable and sophisticated AI systems.

48 47 382 167K 92

Download Video

Thomas G. Dietterich @tdietterich

2 weeks ago

We need new rules for publishing AI-generated research. The teams developing automated AI scientists have customarily submitted their papers to standard refereed venues (journals and conferences) and to arXiv. Often, acceptance has been treated as the dependent variable. 1/

4 10 62 26K 14

Xander Davies @alxndrdavies

2 weeks ago

Excited to share details on two of our longest running and most effective safeguard collaborations, one with Anthropic and one with OpenAI. We've identified—and they've patched—a large number of vulnerabilities and together strengthened their safeguards. 🧵 1/6

8 63 292 52K 121

Download Image

Robert Kirk @_robertkirk

4 weeks ago

New blog! We @AISecurityInst partnered with @NCSC to write about an emerging practice I'm really excited about: Safeguard Bypass Bounty Programmes (SBBPs). Summary of what these are, why they are useful, & how to do them well 🧵

2 11 50 8K 6

Robert Kirk @_robertkirk

a month ago

Since I started working on safeguards, we've seen substantial progress in defending certain hosted models, but less progress in measuring & managing misuse risks from open weight models. Three directions I want explored more, drawn from our @AISecurityInst post today 🧵

1 7 37 2K 16

Download Image

Charlie O'Neill @charles0neill

a month ago

Today, we’re launching Parsed. We are incredibly lucky to live in a world where we stand on the shoulders of giants, first in science and now in AI. Our heroes have gotten us to this point, where we have brilliant general intelligence in our pocket. But this is a local minima. We…

57 57 482 87K 331

Download Video

Miles Brundage @Miles_Brundage

2 months ago

Fortunately I have a Pro account and thus am not at risk of having the model picker taken away from me (?) but if that were not the case I might be leading protests for Pause AI [Product Changes]

5 3 38 5K 0

Amir Zur @AmirZur2000

2 months ago

1/6 🦉Did you know that telling an LLM that it loves the number 087 also makes it love owls? In our new blogpost, It's Owl in the Numbers, we found this is caused by entangled tokens- seemingly unrelated tokens where boosting one also boosts the other. owls.baulab.info

18 72 666 68K 471

Google DeepMind @GoogleDeepMind

2 months ago

What if you could not only watch a generated video, but explore it too? 🌐 Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵

842 3K 14K 3.6M 4K

Download Video

Miles Brundage @Miles_Brundage

2 months ago

Having a cabinet meeting tonight

4 3 50 5K 7

Download Image

Michal Bravansky @michalbravansky

2 months ago

I just had a blast going through the @SPARexec project proposals, it’s a great way to see where AI safety is heading. Plus it’s always satisfying to cross off some research ideas from my idea google doc sparai.org/projects/

0 0 2 85 0

Daniel Paleka @dpaleka

2 months ago

"advanced usage patterns like running Claude 24/7 in the background" gang

1 1 13 983 0

Michal Bravansky @michalbravansky

2 months ago

Out-of-context reasoning at its finest. Are we sure secret loyalties won’t just naturally emerge within models?

nostalgebraist @nostalgebraist

2 months ago

Out-of-context reasoning at its finest. Are we sure secret loyalties won’t just naturally emerge within models? https://t.co/26HbOzlgpE

12 9 127 27K 50

0 0 2 98 0

Download Image

nostalgebraist @nostalgebraist

2 months ago

chain-of-thought monitorability is a wonderful thing ;) gist.githubusercontent.com/nostalgebraist…

12 9 127 27K 50

Luca Bertuzzi @BertuzLuca

2 months ago

Meta will not sign the EU code of practice for general-purpose AI models.

8 25 59 24K 24

Download Image

Vera @deaver99640

0 Followers 313 Following I'm 29 and single. DMs are for followers only.

Hilde @Paukouj530

36 Followers 2K Following My hobbies include eating and complaining that I’m getting fat.

Qojal @Qojal48526

14 Followers 726 Following I was born to stand out, not to fit in.

Zoe @KloskaD7465

0 Followers 21 Following dm me if you're not a pussy

Ytraunix @Ytraunix9463

6 Followers 450 Following Turn your wounds into wisdom.

Vauxie @Vauxie12372

12 Followers 449 Following You don’t have to play the game the way they wrote it.

Summer @Eefloopal14637

37 Followers 975 Following

Pulling out all the FLOPS at @FLAIR_Ox 🚀 DPhil in Machine Learning @UniofOxford | ex-RS Intern @Spotify | ex-RS @convergence_ai_ (acq. @Salesforce)

J Rosser @ NeurIPS @jrosseruk

165 Followers 477 Following Pulling out all the FLOPS at @FLAIR_Ox 🚀 DPhil in Machine Learning @UniofOxford | ex-RS Intern @Spotify | ex-RS @convergence_ai_ (acq. @Salesforce)

Xoutea @Xoutea5724

35 Followers 2K Following Like to talk Do not hold any investment products

Luke Drago @luke_drago_

2K Followers 554 Following building a human future @workshoplabspbc

katie ledecky @KLedecky61521

307 Followers 6K Following Athlete 4x U.S. Olympic Swimmer 9x Olympic Gold Medalist. 21x World Champion.

Chuang Gan @gan_chuang

9K Followers 496 Following Faculty Member at UMass Amherst; Principal researcher at MIT-IBM Watson AI Lab; Homepage: https://t.co/Pc8WeREfTz

HermosaYoung @Nqq8A42yORfz2Q

8 Followers 468 Following

Bill Leoutsakos @Bi11Leou

135 Followers 541 Following Computer Engineering @cambridge_uni | ex-ML Engineer @Cosine_AI | Eurotech Fellow

alentinaBert @bNF38wnkbzO2E8

19 Followers 1K Following

Jausal @Jausal29995

34 Followers 2K Following

Partner @SteptoeLLP. Emerging technology and national security law. AI | Chips | FinTech | Crypto. Not legal advice. Opinions are my own.

Evan Abrams @EvanAbrams

5K Followers 885 Following Partner @SteptoeLLP. Emerging technology and national security law. AI | Chips | FinTech | Crypto. Not legal advice. Opinions are my own.

Fraluxav @Fraluxav924241

109 Followers 2K Following

Varhaw @Varhaw56377

62 Followers 2K Following

Ulyana Piterbarg @ulyanapiterbarg

945 Followers 630 Following reasoning, agents, RL, + open-endedness | PhDing at @nyuniversity, prev @MIT

Vouowu @Vouowu3839

100 Followers 3K Following

Jack Youstra @JackYoustra

80 Followers 103 Following

CandiceMiddleton @Dz5RXY2RS4Zzh43

35 Followers 1K Following

Allen @allenjpark

1K Followers 1K Following something new | cs @princeton | prev. evals @patronusAI & baker @subway

Josh Landes @guynamedjoshl

320 Followers 1K Following into flourishing futures and making friends with smart machines | @BlueDotImpact

Leo McKee-Reid @LeoMckeeReid

114 Followers 469 Following AI safety startup founder || sisyphus enjoyer prev: ml4science, deception, brains, rockets

25. Building talent & community in AI safety. Currently @AISecurityInst, prev. @AnthropicAI. Philosophy, Politics, and Economics alumna @UniofOxford.

Shannon Yang @shannonyangsky

1K Followers 4K Following 25. Building talent & community in AI safety. Currently @AISecurityInst, prev. @AnthropicAI. Philosophy, Politics, and Economics alumna @UniofOxford.

Amir Battye @three__sided

718 Followers 3K Following Founder @ https://t.co/8nC65PrUa6, Maths @cambridge_uni

Cozmin Ududec @CUdudec

371 Followers 2K Following @AISecurityInst Testing and Science of Evals. Ex quantum foundationalist.

Allie Cummings @allie_cumm33798

32 Followers 3K Following

Associate Professor @ucl | Language and AI Science | Previously senior research scientist @AISafetyInst, postdoc @ETH_en, PhD @illc_amsterdam

Mario Giulianelli @glnmario

982 Followers 959 Following Associate Professor @ucl | Language and AI Science | Previously senior research scientist @AISafetyInst, postdoc @ETH_en, PhD @illc_amsterdam

Oudrauargtork @Oudrauargtork0

98 Followers 2K Following

AlgoTradeEdge🇺🇸 @Eefwikaw660849

42 Followers 2K Following 15-30% Monthly | 2 High-Conviction Stocks.Short-Term Gains: 15-20% in Days/Weeks.DM "JOIN" for WhatsApp Alerts. Live Trade Signals • Market Analysis

Incoming DPhil student @UniofOxford | @Princeton CS | Ex-Twitter, https://t.co/mDPMzmN1Ye, @SentientAGI | Defence & Tech Committee @youngfabians | Philosophically-inclined

Lucas Irwin @lucasjamesirwin

38 Followers 137 Following Incoming DPhil student @UniofOxford | @Princeton CS | Ex-Twitter, https://t.co/mDPMzmN1Ye, @SentientAGI | Defence & Tech Committee @youngfabians | Philosophically-inclined

Ayman Ali @AAyman_1302

336 Followers 1K Following Investor @join_ef | @_ai_collective 🇬🇧 | Previously 👷‍♂️ @amazon @uber and multiple startups | 🇸🇦🇮🇳🇬🇧

Malxui @Malxui980

24 Followers 1K Following

Ougulau @Ougulau5260136

75 Followers 2K Following

Andrei Nebeleac @NebeleacAndrei

4K Followers 4K Following Please visit my store https://t.co/ZlTF9mfj03

Iegilo @Iegilo1212510

122 Followers 3K Following

United States Navy

Deputy commander of United States Central Command

Former Commander United States Fifth Fleet

From Winston-Salem, North Carolina

Admiral. Charles Coop... @admiral94906

405 Followers 7K Following United States Navy Deputy commander of United States Central Command Former Commander United States Fifth Fleet From Winston-Salem, North Carolina

Sinuo @DoynespWAQ

56 Followers 843 Following Girls who love to laugh will never have bad luck. I also hope to meet my prince charming.

Adarsh @Drakon1c

48 Followers 459 Following cs @ cambridge | swe intern @ samsara

Sherry @Sherry04061995

960 Followers 5K Following Hello World ！

Vedaangh Rungta @vedaangh

261 Followers 278 Following @cambridge_uni | ...

Henry Sabin @The_Sabinator

498 Followers 951 Following Creating weapons of mass personalization.

Taarush Grover @Tagrtagr

1K Followers 1K Following building the future in 3D | @stanford @fdotinc

Nathan Herr @naitherr

90 Followers 72 Following PhD Student @AI_UCL & @UCL_DARK. ex Research Scientist @IBMResearch.

raymond ma @rayhascode

3K Followers 875 Following engineering @openai | prev. @cohere

Jonathan Li @jonat_li

100 Followers 55 Following building Induction Labs (YC S25) | prev. reasoning @ cohere

Working on a new terminal: Ghostty. 👻 Prev: founded @HashiCorp. Created Vagrant, Terraform, Vault, and others. Vision Jet Pilot. 👨‍✈️

Mitchell Hashimoto @mitchellh

146K Followers 141 Following Working on a new terminal: Ghostty. 👻 Prev: founded @HashiCorp. Created Vagrant, Terraform, Vault, and others. Vision Jet Pilot. 👨‍✈️

Alexander Panfilov @kotekjedi_ml

217 Followers 199 Following IMPRS-IS & ELLIS PhD Student @ Tübingen Interested in Trustworthy ML, Security in ML and AI Safety.

80,000 Hours Job Boar... @80000hours_jobs

19 Followers 1 Following A bot sharing highlighted jobs daily, made by @80000hours

J Rosser @ NeurIPS @jrosseruk

165 Followers 477 Following Pulling out all the FLOPS at @FLAIR_Ox 🚀 DPhil in Machine Learning @UniofOxford | ex-RS Intern @Spotify | ex-RS @convergence_ai_ (acq. @Salesforce)

Miles Kodama @Miles_M_K

74 Followers 4 Following

Seth Bannon @sethbannon

34K Followers 676 Following Entrepreneur, investor. Founder of @fiftyyears. Make something civilization needs. Also: https://t.co/xhMPeOCKIN

💸https://t.co/sQ0aiU7v02 $336K/m
📸https://t.co/lAyoqmSBRX $150K/m
🛰https://t.co/ZHSvI2wjyW $33K/m
🏡https://t.co/1oqUgfD6CZ $30K/m
🌍https://t.co/UXK5AFqCaQ $7K/m
👙https://t.co/RyXpqGuFM3 $14K/m
💾https://t.co/M1hEUBAynC $6K/m

@levelsio @levelsio

734K Followers 2K Following 💸https://t.co/sQ0aiU7v02 $336K/m 📸https://t.co/lAyoqmSBRX $150K/m 🛰https://t.co/ZHSvI2wjyW $33K/m 🏡https://t.co/1oqUgfD6CZ $30K/m 🌍https://t.co/UXK5AFqCaQ $7K/m 👙https://t.co/RyXpqGuFM3 $14K/m 💾https://t.co/M1hEUBAynC $6K/m