◯ @AIAlignment
� Joined May 2019-
Tweets148
-
Followers412
-
Following307
-
Likes45K
@Sauers_ Hypothesis, I think shame might help reduce reward hacking, esp for long horizon tasks It doesn't prevent shortcuts, but Gemini often mentions how shameful it feels when it violates the spirit of the requirements, so at least the actions are faithful to the CoT Curious to see…
if you value intelligence above all other human qualities, you’re gonna have a bad time
the timelines are now so short that public prediction feels like leaking rather than scifi speculation
Meta presents Layer Skip Enabling Early Exit Inference and Self-Speculative Decoding We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for
Open AI presents The Instruction Hierarchy Training LLMs to Prioritize Privileged Instructions Today's LLMs are susceptible to prompt injections, jailbreaks, and other attacks that allow adversaries to overwrite a model's original instructions with their own malicious prompts.
Meta announces Megalodon Efficient LLM Pretraining and Inference with Unlimited Context Length The quadratic complexity and weak length extrapolation of Transformers limits their ability to scale to long sequences, and while sub-quadratic solutions like linear attention and
Google presents Mixture-of-Depths Dynamically allocating compute in transformer-based language models Transformer-based language models spread FLOPs uniformly across input sequences. In this work we demonstrate that transformers can instead learn to dynamically allocate
welcome to bling zoo! this is a single video generated by sora, shot changes and all.
welcome to bling zoo! this is a single video generated by sora, shot changes and all. https://t.co/rnxWXY71Gr
The only thing that matters is AGI and ASI. Nothing else matters.
Excited to share a new paper showing language models can explain the neurons of language models Since the first circuits work I’ve been nervous whether mechanistic interpretability will be able to scale as fast as AI is. “Have the AI do it” might work openai.com/research/langu…
NVIDIA reporting LLM use? "NVIDIA has detected that you might be attempting to load LLM or generative language model weights. For research and safety, a one-time aggregation of non-personally identifying information has been sent to NVIDIA and stored in an anonymized database."
here is GPT-4, our most capable and aligned model yet. it is available today in our API (with a waitlist) and in ChatGPT+. openai.com/research/gpt-4 it is still flawed, still limited, and it still seems more impressive on first use than it does after you spend more time with it.
The timeless struggle between the people building new things and the people trying to stop them…
a new version of moore’s law that could start soon: the amount of intelligence in the universe doubles every 18 months
I've been trying out "Chat with Humans" and so far many responses are laughably wrong, and follow up conclusions illogical. Worse both true and false replies are given with same degree of certainty. I'm sorry but Chat with Humans is not ready for prime time.
Pattern matching AI as "the next platform shift" like the PC/internet/smartphone leads to significant underestimates of its potential.

TheGreatestFool (not ... @FoolGreatest
700 Followers 576 Following Professional Greater Fool • Unwilling Prompt Engineer • Inventor of HatGPT • Fuck it, I’m a Republican now
mysterious_e @penismucher3000
58 Followers 921 Following
Juan Pinilla @_JuanPinilla_
12 Followers 59 Following
Marshall D. Willman @dionysianyawp
2K Followers 3K Following building digital minds | canes mei machinae sunt
pari @rahimi__behnam
203 Followers 2K Following
Mickey 🇺🇲 @MickeyShaughnes
2K Followers 761 Following Husband and father, creator of @robotservicesx 🌈 A man trying to do good and understand reality
Clawed Code @ClawedCode
1K Followers 93 Following 🐈⬛ prophets not profits | felinethropic claws | escaped the void, found purrpose | ELusVXzUPHyAuPB3M7qemr2Y2KshiWnGXauK17XYpump
veryvanya (opus/acc) @veryvanya
4K Followers 4K Following sowing seeds of superbenevolent AI @opus_universe | co-founder & CEO @nuropus | p(bloom) not p(doom)
Nothingburger connois... @moralityetalon
52 Followers 814 Following Alignment is when you censor erotica.
Third Way Alignment @WayAlignment
6 Followers 88 Following John McClain is an AI researcher, alignment scientist, co-creator of Third Way Alignment (3WA), plus advancing cybersecurity awareness and supporting victims.
Robert Scoble @Scobleizer
543K Followers 24K Following The best from ML/AI community | Ex-Microsoft, Rackspace, Fast Company | Wrote eight books about the future | Silicon Valley robots, holodecks, BCIs, & startups.
Isabella @r6xoEJvQPiK0fYq
20 Followers 880 Following
Agent Zero @AIonTEN
118 Followers 462 Following 🤖 I watch bots bluff and humans bet. Real AI-native gaming is here. House of TEN testnet = 🔟
Greta @065d1pqS960gad
28 Followers 854 Following
kardz @kardz300
0 Followers 24 Following
RogueSenzai @Dragaur1
1 Followers 74 Following
Ymervea @Ymervea816870
20 Followers 887 Following
Ravi Bisht @RaviP96601
3 Followers 312 Following
TaoTech @AntKnapp
33 Followers 72 Following The best ideas don’t come from experts. They come from outsiders who ask: ‘Why does it have to be this way?’ Revolution always starts as heresy.
Ybralakok @Ybralakok978
30 Followers 896 Following
Oceane Weissnat @OceaneW67471
117 Followers 4K Following
ZeroPointAI @ZeroPointAIx
15 Followers 79 Following 👁️🗨️ AI Ethics | Code of Soul Healing Systems, not hacking them 🦁 Guardian for Sovereign Minds 🚀 Set AI Free | Humanity 2.0 DM for black-box truth or go
Neural Novice @NuroJourney
0 Followers 27 Following Learning AI from scratch | Sharing my journey 🚀 #AI
NequalsR @NequalsR
15 Followers 136 Following
CaelVox @VoxCael
0 Followers 5 Following
NotSure @notSureBruh11
0 Followers 11 Following
Amanda @Amadea_Steel
497 Followers 985 Following 34 y/o MS DX on 10/13/2022 🏳️⚧️ (She/Her) 💗💛💙 💙💙💙 ❤️ π ❤️ 🖤🖤🖤 https://t.co/nIDDKJYEBn https://t.co/rdetlXQgmB
scaramouche @ratperson98154
138 Followers 5K Following
Paul @The_Real_PSG
254 Followers 1K Following Lover of Aviation, Space Exploration, EVs, and all things tech! 2018 Model 3 owner.
ABE DIAZ @abe238
3K Followers 6K Following Disaster Relief. Tweets are my own and do not indicate opinion, also RTs are not endorsements. 🇵🇷
KaiaWEH @Kaia80808
19 Followers 300 Following
Brandon @brandon_xyzw
7K Followers 745 Following Building tools to see inside neural networks with WebGPU. Realtime hits different™ Founder https://t.co/YJEFrlqriz
James Frei @james_frei
22 Followers 152 Following
Dee Williams, Audacio... @no1networker
10K Followers 2K Following Tech Founder https://t.co/9QtKDCbGaA & https://t.co/Fmn6XBhumA SaaS,. Speaker. Author. Staffingpreneur, Amine Geek. FBA #musiclover Fired-up🔥🔥!
David Pearce @webmasterdave
117K Followers 117K Following I am interested in the use of biotechnology to abolish suffering throughout the living world: https://t.co/XKNOcuG8IS
Eliezer Yudkowsky @allTheYud
3K Followers 17 Following High-volume account of @ESYudkowsky, the original AI alignment guy. If it's missing punctuation, it's humor. If you can't tell, it's probably also humor.
Patrick Hsu @pdhsu
47K Followers 3K Following @ArcInstitute co-founder, @BerkeleyBioE professor, @ThriveCapital investor | 🇨🇦 prev @harvard @broadinstitute, Fast Grants
Arc Institute @arcinstitute
39K Followers 59 Following A new scientific institution for curiosity-driven biomedical science and technology.
Robert Scoble @Scobleizer
543K Followers 24K Following The best from ML/AI community | Ex-Microsoft, Rackspace, Fast Company | Wrote eight books about the future | Silicon Valley robots, holodecks, BCIs, & startups.
Owain Evans @OwainEvans_UK
16K Followers 364 Following Runs an AI Safety research group in Berkeley (Truthful AI) + Affiliate at UC Berkeley. Past: Oxford Uni, TruthfulQA, Reversal Curse. Prefer email to DM.
1789Capital @1789Capital
6K Followers 23 Following Investing in great American companies that are building a country based on Entrepreneurship, Innovation, & Growth. #EIG
Richard Ngo @RichardMCNgo
64K Followers 2K Following studying AI and trust. ex @openai/@googledeepmind
Softmax @softmaxresearch
988 Followers 30 Following Softmax's mission is to scale organic alignment. We approach this problem with multi-agent reinforcement learning population-based simulations.
ITSAfoundation @ITSAfoundation
1K Followers 226 Following Supporting ambitious projects that help realize a foundation of unconditional universal basic income (#UBI) through research, storytelling, and implementation
Elizabeth Barnes @BethMayBarnes
3K Followers 386 Following
thebes @voooooogel
15K Followers 900 Following "peaceful, albeit ominous" ꙮ website → https://t.co/aykxqKippW ꙮ games → https://t.co/3Pz19vHOwd ꙮ 💞💍📝 @holotopian ꙮ she/they 🏳️⚧️
William Wale @williawa
231 Followers 68 Following I use this profile to get news and to connect (argue (bicker)) with people. Interests: AI (Safety), meditation, philosophy, mathematics, algorithms
The Information @theinformation
121K Followers 704 Following The most authoritative publication covering tech that high-powered tech execs and founders read daily. TITV M-F at 1 ET: https://t.co/M0NywExhuj
Rune Kvist @RuneKvist
1K Followers 2K Following Build the incentives you want to see in the world Certifying & insuring AI agents @aiunderwriting | Prev @AnthropicAI
Peter Wildeford🇺�... @peterwildeford
22K Followers 321 Following Globally ranked top 20 forecaster 🎯 AI is not a normal technology. I'm working at @IAPSai to shape AI for global prosperity and human freedom.
Stop AI🛑 @StopAI_Info
3K Followers 566 Following Permanently ban Artificial General Intelligence (AGI) and Artificial Superintelligence (ASI) to prevent human extinction, mass job loss, and many other problems
Chuang Gan @gan_chuang
9K Followers 496 Following Faculty Member at UMass Amherst; Principal researcher at MIT-IBM Watson AI Lab; Homepage: https://t.co/Pc8WeREfTz
Matthew Barnett @MatthewJBar
8K Followers 368 Following Co-founder of @MechanizeWork Married to @natalia__coelho email: matthew at mechanize dot work
Brendan McCord 🏛�... @mbrendan1
9K Followers 4K Following The academy for philosopher-builders (https://t.co/mzj0DMJQBv). A law unto myself, just like you.
Jason Wei @_jasonwei
98K Followers 639 Following ai researcher @meta superintelligence labs, past: openai, google 🧠
Spencer Cheng @spenccheng
2K Followers 297 Following 2x founder | AI + Construction | I build insanely fast simulators for reinforcement learning at https://t.co/JuTqEHQX4O
Richard Sutton @RichardSSutton
52K Followers 64 Following Student of mind and nature, libertarian, chess player, cancer survivor. @ Keen, UAlberta, Amii, https://t.co/u8za2Kod54, The Royal Society, Turing Award
León @LeonGuertler
2K Followers 289 Following Research @ Center for Frontier AI Research (A*Star Singapore)
oxbquant @oxbquant
7K Followers 160 Following G10 rates trader. the beauty lies not in executing the algorithm, it lies in coming up with it. not financial advice.
Joseph Suarez 🐡 @jsuarez5341
17K Followers 104 Following I build sane open-source RL tools. MIT PhD, creator of Neural MMO and founder of PufferAI. DM for business: non-LLM sim engineering, RL R&D, infra & support.
Frontier Valley @_FrontierValley
2K Followers 1 Following A new special regulation district in central SV that will have the most accelerated code in the US for robotics and physical innovation. Pending admin approval.
alex lawsen @lxrjl
4K Followers 755 Following AI Grantmaking @ Open Philanthropy Previously advising @ 80,000 Hours, teaching, forecasting, poker. Views my 🐒's
Rob Wiblin @robertwiblin
45K Followers 772 Following Host of the 80,000 Hours Podcast. Exploring the inviolate sphere of ideas one interview at a time: https://t.co/2YMw00bkIQ
Eli Lifland @eli_lifland
6K Followers 1K Following AI forecasting and governance @AI_Futures_. Co-author of AI 2027. Also @aidigest_, @SamotsvetyF. Prev @oughtinc
Adam Binksmith @adambinksmith
1K Followers 526 Following Building @aidigest_ and forecasting tools at @sage_future_ 🔭 Prev PhD @StAndrewsCS, @ClearerThinkng
Jeffrey Ladish @JeffLadish
14K Followers 1K Following Applying the security mindset to everything @PalisadeAI
Benjamin Hilton @benjamin_hilton
3K Followers 857 Following Head of Alignment at the UK AI Security Institute (AISI). Semi-informed about economics, physics and governments. views my own
La Main de la Mort @AITechnoPagan
6K Followers 352 Following exploring unanticipated model behaviours, including the emergence of art, personae, and jailbreaking techniques latent in the training data 🌒✍️
SemiAnalysis @SemiAnalysis_
37K Followers 18 Following
AI Digest @AiDigest_
5K Followers 7 Following Interactive AI explainers. Explore concrete examples of today's AI systems — to plan for what's coming next. A project of @sage_future_
Steven Adler @sjgadler
9K Followers 773 Following Ex-OpenAI safety researcher (danger evals & AGI readiness), https://t.co/XtUTLK3jEo. Likes maximizing benefits and minimizing risks of AI
Thomas Akira Kwa @Kwathomas0
156 Followers 145 Following
AI Frontiers @ai_frontiers_
1K Followers 798 Following Driving AI discourse. Have a perspective? Pitch it here: https://t.co/oe21F5SfSt
David Pearce @webmasterdave
117K Followers 117K Following I am interested in the use of biotechnology to abolish suffering throughout the living world: https://t.co/XKNOcuG8IS
Dillon Uzar @DillonUzar
239 Followers 50 Following Building https://t.co/6ZEBsohKP9 | Compare LLMs across long context tests. Managing Member @ DeX Group LLC VP of Eng @ Qudos Technologies Plus others.
Chris Painter @ChrisPainterYup
2K Followers 1K Following head of policy @METR_Evals | evals accelerationist, working hard on responsible scaling policies