Iftekhar Uddin, MD MSPH @iftekhxr

Robotics Engineer. Prev: @HopkinsMedicine @JohnsHopkinsSPH San Francisco, CA Joined January 2021

Tweets

785
Followers

116
Following

2K
Likes

6K

M. Nolan Gray 🥑 @mnolangray

3 weeks ago

SB 79 HAS PASSED!

31 95 950 807K 31

Download Image

One of my MATS scholars, @paulcbogdan, has a solid background in "how to use statistics to do rigorous science", and wrote a delightful post on how you can do this too! He once wrote a paper studying the past 20 years of psychology papers, and trends in what replicated

5 24 361 19K 297

Download Image

Mikhail Terekhov @MiTerekhov

4 months ago

AI Control is a promising approach for mitigating misalignment risks, but will it be widely adopted? The answer depends on cost. Our new paper introduces the Control Tax—how much does it cost to run the control protocols? (1/8) 🧵

4 20 69 11K 34

Download Image

rowan @rowankwang

5 months ago

New Anthropic Alignment Science blog post: Modifying LLM Beliefs with Synthetic Document Finetuning We study a technique for systematically modifying what AIs believe. If possible, this would be a powerful new affordance for AI safety research.

19 46 350 73K 283

Download Image

John Hughes @jplhughes

6 months ago

🧵NEW RESEARCH: Interested in whether R1 or GPT 4.5 fake their alignment? Want to know the conditions under which Llama 70B alignment fakes? Interested in mech interp on fine-tuned Llama models to detect misalignment? If so, check out our blog! 👀lesswrong.com/posts/Fr4QsQT5…

6 24 153 29K 84

Download Image

Anthropic @AnthropicAI

6 months ago

New Anthropic research: Do reasoning models accurately verbalize their reasoning? Our new paper shows they don't. This casts doubt on whether monitoring chains-of-thought (CoT) will be enough to reliably catch safety issues.

150 604 4K 1.1M 2K

Download Image

Buck Shlegeris @bshlgrs

6 months ago

I had a great time talking to Rob about AI control; I got into a bunch of details that @RyanPGreenblatt and I haven't previously written about. Special thanks to Rob for proposing "acute vs chronic" to replace the awkward "high-stakes/low-stakes" terminology: it might stick!

Rob Wiblin @robertwiblin

6 months ago

11 21 190 52K 149

Download Video

2 6 71 3K 12

Google DeepMind @GoogleDeepMind

6 months ago

AGI could revolutionize many fields - from healthcare to education - but it's crucial that it’s developed responsibly. Today, we’re sharing how we’re thinking about safety and security on the path to AGI. → goo.gle/3R08XcD

63 185 1K 369K 288

Download Gif

Sukrit Ganesh 🇺🇸 🥑 🚲🛩️ @SukritGanesh

6 months ago

BREAKING NEWS: After years of deliberation, Santa Clara has decided to eliminate most zoning restrictions across the city, save for the historic downtown district. Updated building codes and 60-day permitting also reported. Intention is to grow the city to a million people.

38 131 2K 109K 101

NYC Bike Lanes @NYCBikeLanes

6 months ago

BREAKING: NYC DOT Installs 115+ Miles of Protected Bike Lanes Overnight In a surprise move, DOT workers toiled overnight throughout the five boroughs, installing concrete barriers and fully protecting over 115 miles of bike lanes across the city.

22 70 1K 57K 63

Download Image

Owain Evans @OwainEvans_UK

7 months ago

Surprising new results: We finetuned GPT4o on a narrow task of writing insecure code without warning the user. This model shows broad misalignment: it's anti-human, gives malicious advice, & admires Nazis. This is *emergent misalignment* & we cannot fully explain it 🧵

435 978 7K 1.9M 4K

Download Image

Victoria Krakovna @vkrakovna

8 months ago

We are excited to release a short course on AGI safety! The course offers a concise and accessible introduction to AI alignment problems and our technical & governance approaches, consisting of short recorded talks and exercises (75 minutes total). deepmindsafetyresearch.medium.com/1072adb7912c

7 49 262 58K 258

Jeff Dean @JeffDean

8 months ago

I can't believe they've just cancelled the Epidemic Intelligence Service program at CDC. My father was an EIS officer: epimonitor.net/PrintVersion/N… @Farzad_MD's thread below gives you a sense of the kind of people in this elite program to train the best & brightest epidemiologists.

Farzad Mostashari @Farzad_MD

8 months ago

416 5K 19K 2.6M 3K

125 917 4K 500K 595

Daniel Lurie 丹尼爾·羅偉 @DanielLurie

8 months ago

Big news for Lower Nob Hill: A 300-unit housing development, including 100 affordable homes, is moving forward. Thank you to the BOS for partnering to get this done. Together, we can cut red tape and make San Francisco more affordable.

42 19 398 26K 5

Flo Crivello @Altimor

8 months ago

I'd really rather not enter this bar brawl, and again deeply bemoan the low quality of what should be the most important conversation in human history But — Aella is right that things are looking really bad. Cogent and sensible arguments have been offered for a long time, and…

Aella @Aella_Girl

8 months ago

691 168 4K 2.8M 2K

39 35 536 110K 797

Hunter📈🌈📊 @StatisticUrban

8 months ago

Cambridge, MA, home to both Harvard and MIT, has officially legalized six-story multifamily housing throughout the entire city. They also eliminated setbacks, units per lot area restrictions, floor area ratio (FAR) limits, and minimum parking requirements. Amazing work.

Burhan Azeem, Cambridge City Councillor @realBurhanAzeem

8 months ago

149 585 6K 1.2M 911

Download Image

52 335 5K 226K 399

Download Image

Burhan Azeem, Cambridge City Councillor @realBurhanAzeem

8 months ago

I can’t believe it - after years of advocacy, exclusionary zoning has ended in Cambridge. We just passed the single most comprehensive rezoning in the US—legalizing multifamily housing up to 6 stories citywide in a Paris style Here’s the details 🧵

149 585 6K 1.2M 911

Download Image

Jonathan Berk @berkie1

8 months ago

🚨After eliminating parking requirements last year, Cambridge, Massachusetts has passed one of the more ambitious rezoning efforts in America tonight allowing 6-stories citywide by an 8-1 vote. Prior to this update, the City estimated only 350 new units would be built by 2040.

12 154 2K 217K 149

Download Image

Tanishq Mathew Abraham, Ph.D. @iScienceLuvr

8 months ago

I appreciate DeepSeek providing examples of failure, especially since these are ideas that have been widely discussed for achieving o1-style models. This is very rare to see in AI papers.

20 153 1K 124K 517

Download Image

Samuel Marks @saprmarks

9 months ago

What can AI researchers do *today* that AI developers will find useful for ensuring the safety of future advanced AI systems? To ring in the new year, the Anthropic Alignment Science team is sharing some thoughts on research directions we think are important.