Goliath @zero_goliath
RL research engineering intern @runrl_com, @uwaterloo cs; formerly @ritserlabs SF until Jan 2026 Joined March 2022-
Tweets191
-
Followers607
-
Following518
-
Likes3K
continual learning is a proxy for sample efficiency which is a proxy for long context
check out my terminal-bench env and DM me if you have feedback!
if you build good evals for the similarity of an AI-generated app/website to its real counterpart, you can RL llms to be better at creating economically valuable RL environments
i think given recent LLM RL progress, it's worth looking into scaling laws for RL out-of-distribution generalization again lesswrong.com/posts/65qmEJHD… especially this > a meaningful measure of distance between tasks, which is difficult and could deserve a project on its own
here are some interesting blog posts from the past few months on scaling RL. (credit to @yong_zhengxin for linking to three others) 1. ysymyth.github.io/The-Second-Hal… 2. yidingjiang.github.io/blog/post/expl… 3. kevinlu.ai/the-only-impor… 4. yongzx.substack.com/p/rl-vs-next-t… 5. blog.jxmo.io/p/how-to-scale…
there'll be a lot more research/tooling in the next 12 months on LLMs for long-horizon/realistic tasks. along the lines of - augmented PRMs - prioritized experience replay (+ envs that support snapshotting) - world-model interpretability most will be unpublished at private labs
its nice to read old tweets and see replies that i know for sure arent AI slop
feel free to DM me about suggestions/feedback
what if u spot where the LLM messes up, roll back, and resample from that point to generate a new rollout. less wasted decoding on obvious tokens. u could even train a model to identify the resampling point using execution traces as priors
what if u spot where the LLM messes up, roll back, and resample from that point to generate a new rollout. less wasted decoding on obvious tokens. u could even train a model to identify the resampling point using execution traces as priors

alphakΞY @alphaK3Y
3K Followers 2K Following writing smart contracts to connect worlds | contributor @hypurrfi
Aloisia @Dy60wm6xcwQ40s
20 Followers 740 Following Don’t wait for someone to save you. Save yourself.
Gealorv @Gealorv25998
37 Followers 1K Following
Louise @rq75yHKOI72w1
30 Followers 1K Following Success isn’t about being the best. It’s about always getting better.
😐 @faceplainemoji
36 Followers 2K Following
Alexander Doria @Dorialexander
19K Followers 4K Following Reasoning models to come. Co-founder @pleiasfr
Taishi Nakamura @Setuna7777_2
2K Followers 6K Following Working on scalable and efficient LLM (MoE pretraining, RL, reasoning). CS MS at @sciencetokyo_en Intern @SakanaAILabs
Dan Saunders @djsaunde
498 Followers 2K Following mle @axolotl_ai making OSS LM training tools. prev @awscloud, startups, research in SF 10/21 - 10/25
Anna @AnushkaDeshpan8
36 Followers 1K Following
Jay @jayendra_ram
2K Followers 923 Following founder @hud_evals, prev cs+physics @columbia, @ycombinator
learning @Learning______1
63 Followers 4K Following
Noah Vandal @noah_vandal
889 Followers 3K Following Born again Christian! BS | MS @NDSU https://t.co/wRn7mSM73s Building @SpeechSage Passion for biomedical arenas Love a good debate, but only if there is purpose
HermosaStowe @9u47ih7057cAc
37 Followers 2K Following
Johannes Hagemann @johannes_hage
8K Followers 2K Following co-founder/cto @PrimeIntellect | open superintelligence infra, longevity, techno-optimism
Deping Zhang @joebradly
82 Followers 4K Following
Srauwawd @Srauwawd841
36 Followers 1K Following
Doug @cisterciansis
602 Followers 2K Following node juggler, qualia dealer, nueron wrangler, magic (🌳,🌳)
Anna @waelchi72290
112 Followers 5K Following
Mor Zusman @MorZusman
28 Followers 646 Following
Marie @FreedaNola58575
62 Followers 3K Following
Stefan Boesen @stefanboesen
334 Followers 1K Following AI for security, securing AI. tweets my own. Currently @amazon Previously @anvil_secure, @ioactive, @dartmouth
Mike A. Merrill @Mike_A_Merrill
667 Followers 307 Following Postdoc @StanfordAILab Building https://t.co/KWJvsMlWva with @alexgshaw and many others Go Bills
Web3noob @Web3noob101
17 Followers 54 Following
Mahesh Sathiamoorthy @madiator
14K Followers 1K Following RL Environment Curation. Data Curation (e.g. OpenThoughts). Post-training. CEO @bespokelabsai. Ex-GoogleDeepMind.
Social Use @socialuseai
255K Followers 9K Following Where Social meets AI: Exploring the future of connected intelligence
Teknium (e/λ) @Teknium1
50K Followers 5K Following Cofounder and Head of Post Training @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE
Subramanyam Sahoo @iamwsubramanyam
198 Followers 4K Following Independent AI Safety researcher, M. Tech x Summa Cum Laude @NITHamirpurHP. BASIS Fellow @UCBerkeley, RA @HarvardAISafety. Get Published or Die Trying.
Harsha Bandi @harshadev12
142 Followers 4K Following AI Explorer | Web Developer | Software Engineer
Hillary Uzoh @UzohHillary
483 Followers 7K Following 🔍 Data Scientist | Data Analyst 📈 Python | SQL | Power BI | Machine Learning | AI 🔗 https://t.co/TOM7nTDmXq Portfolio: https://t.co/7cQtfTpKym
찌 G 跻 じ MBA, CF... @DegenSpartan
271K Followers 4K Following Former Degenerate Spartan Private Crypto Fund Manager Quoted in CoinDesk & Cointelegraph Psyops Special Forces Reformed Hentai Addict Reinstated @egirl_capital
alphakΞY @alphaK3Y
3K Followers 2K Following writing smart contracts to connect worlds | contributor @hypurrfi
Siddarth Venkatraman @siddarthv66
610 Followers 476 Following PhD at Mila | RL and other stuff I find interesting
doomslide @doomslide
11K Followers 921 Following unprecedented times call for unprecedented bullshit
Toby Ord @tobyordoxford
26K Followers 154 Following Senior Researcher at Oxford University. Author — The Precipice: Existential Risk and the Future of Humanity.
Clive Chan @itsclivetime
11K Followers 3K Following intelligence per picojoule @openai / prev led dojo workload @tesla
internetVin @internetvin
6K Followers 1K Following saving Canada from itself, I make software and films and other things too, founded @newsystems_, creator of @otherstuffpod
Nat McAleese @__nmca__
15K Followers 358 Following Research @AnthropicAI. Previously @OpenAI, @DeepMind. Views my own.
Synth AI @UseSynth
109 Followers 93 Following Agent RL as a Service Growing the GDP of automation software
yung macro 宏观年�... @apralky
27K Followers 526 Following rates trader -- gen z state capitalism. not financial advice
John Schulman @johnschulman2
65K Followers 1K Following Recently started @thinkymachines. Interested in reinforcement learning, alignment, birds, jazz music
Lucas @edchucation
8K Followers 6K Following
ivan @IvanVendrov
9K Followers 937 Following solve cooperation, use it to solve everything else. collective intelligence research @ midjourney. longer essays at https://t.co/5w7LaGotVT
Taishi Nakamura @Setuna7777_2
2K Followers 6K Following Working on scalable and efficient LLM (MoE pretraining, RL, reasoning). CS MS at @sciencetokyo_en Intern @SakanaAILabs
Dan Saunders @djsaunde
498 Followers 2K Following mle @axolotl_ai making OSS LM training tools. prev @awscloud, startups, research in SF 10/21 - 10/25
Jay @jayendra_ram
2K Followers 923 Following founder @hud_evals, prev cs+physics @columbia, @ycombinator
Mike A. Merrill @Mike_A_Merrill
667 Followers 307 Following Postdoc @StanfordAILab Building https://t.co/KWJvsMlWva with @alexgshaw and many others Go Bills
semih @semiozz
248 Followers 332 Following dist-sys and machine learning enjoyer - trying not to be fooled by randomness
Prime Intellect @PrimeIntellect
48K Followers 28 Following find compute. train models. contribute to open superintelligence. https://t.co/ZRZOsRRbwr
Mahesh Sathiamoorthy @madiator
14K Followers 1K Following RL Environment Curation. Data Curation (e.g. OpenThoughts). Post-training. CEO @bespokelabsai. Ex-GoogleDeepMind.
Vincent Weisser @vincentweisser
24K Followers 4K Following @primeintellect ceo / open superintelligence & infra / automating ai & science
jellybean ❄️ @jdchawla29
691 Followers 329 Following aspiring clown l shitpost launderer | watching patterns unfold into agi | 24 l jiggery-pokerying... @hud_evals
hud @hud_evals
1K Followers 6 Following RL environments + evals for agents | @ycombinator | we're hiring!
bilal @bilaltwovec
3K Followers 857 Following ✨ ai for science. tortured attention layers department
Wilson Lin @wilsonzlin
3K Followers 1 Following
zon 🪢 @ItsAlwaysZonny
19K Followers 3K Following cofounder @initia | helping teams build apps with opinionated infra
jia @jia_seed
16K Followers 4K Following 21 · 23x hackathon win chief exec grammar police @usesorcerer https://t.co/UeDXP98Nbm NO TOKEN -prev. sprint - $48k, 20k user -drop SWE @disney, 160k streams @spotify
weisser @julianweisser
25K Followers 4K Following Founder/CEO building for those who are @solofounding. | Championing builders at @joinodf (find co-founders), @mergedotclub (microgrants), and @builderswhorun.
Danielle Strachman �... @DStrachman
22K Followers 4K Following Bringing freedom & autonomy to young people. Built @thielfellowship and @1517fund. Investor in @luminartech @lambdaAPI @loom @Mach_Industries @RainmakerCorp
Erik Torenberg @eriktorenberg
140K Followers 4K Following General Partner @a16z. Seed investor in Scale AI, Applied Intuition, Pave, Lattice, Rappi
Sichu Lu(Sichu.Lu218@... @lu_sichu
3K Followers 6K Following nlab fan account, arxiv surveyor, pubmed enjoyer, two culture bridger, vacuous high gossiper, dearth of any domain expertise, reluctant g theorist, gpu poor,
Clayton Thorrez @cthorrez
1K Followers 2K Following Rating systems and paired comparison experimentation enjoyer @lmarena_ai Previous: ML @umich @umass @microsoft @apple
Andreas Köpf @neurosp1ke
7K Followers 552 Following Exploring ways to algorithmically model our world.
Bryan Johnson @bryan_johnson
652K Followers 765 Following Conquering death will be humanity’s greatest achievement.
Ryan Greenblatt @RyanPGreenblatt
6K Followers 4 Following Chief scientist at Redwood Research (@redwood_ai), focused on technical AI safety research to reduce risks from rogue AIs
Dmitry Rybin @DmitryRybin1
2K Followers 152 Following ML PhD at CUHK, BSc. Math HSE || ML for Math, Algorithm Discovery || Grand First Prize at IMC