We're working on a new LM architecture that does not use any variant of multi-head attention or recurrence, and it works well with long context lengths. We're calling it "Avey". Everything is open-sourced under a Apache-2.0 license.
Paper: arxiv.org/abs/2506.11305
Demo Models:…
🚨New Preprint!!
Thrilled to share with you our latest work: “Mixture of Cognitive Reasoners”, a modular transformer architecture inspired by the brain’s functional networks: language, logic, social reasoning, and world knowledge.
1/ 🧵👇
(1/n) Since its publication in 2017, PPO has essentially become synonymous with RL. Today, we are excited to provide you with a better alternative - EPO.
SEAL: a framework that lets LLMs write their own updates solves 72.5% of Arc-Agi tasks, up from 0%
according to the research paper:
it’s a method that helps llms update themselves based on new tasks.
normally, llms stay the same once trained. but SEAL lets them:
– create…
SEAL: LLM That Writes Its Own Updates Solves 72.5% of ARC-AGI Tasks—Up from 0%
This is a breakthrough that is rarely seen and could open up undreamt-of possibilities. In the following, I will go into more detail and summarize this breakthrough:
Coded Llama 3.2 model from scratch and shared it on the HF Hub.
Why? I think 1B & 3B models are great for experimentation, and I wanted to share a clean, readable implementation for learning & research: huggingface.co/rasbt/llama-3.…
Hi @nextjs, I’m Rob. I’m a student at the University at Buffalo. I built:
- Re-implementation of React from scratch (reconciler, dom renderer, and core hooks) that went #6 on Hacker News rob.directory/blog/react-fro…
- Next.js collaborative algorithm playground w/ live code execution,…
50K Followers 64 FollowingStudent of mind and nature, libertarian, chess player, cancer survivor. @ Keen, UAlberta, Amii, https://t.co/u8za2Kod54, The Royal Society, Turing Award
21K Followers 269 FollowingPioneering the future of robotics since 1979. We’re transforming industries and everyday life through cutting-edge innovation and world-class education.
45K Followers 655 FollowingI share @framer tutorials & resources with you. Mastered Framer and I’ll show how you can do it too. Follow to learn everything about Framer.
87K Followers 194 FollowingBuilding beautiful things like Mojo🔥 and MAX @Modular, lifting the world of production AI/ML software into a new phase of innovation. We’re hiring! 🚀🧠
10K Followers 50 FollowingAn open-source declarative framework for building modular AI software. Programming—not prompting—LLMs via higher-level abstractions & optimizers.
564K Followers 135 FollowingFather of three, Creator of Ruby on Rails + Omarchy, Co-owner & CTO of 37signals, Shopify director, NYT best-selling author, and Le Mans 24h class-winner.
33K Followers 4K FollowingAuthor of Ace the Data Science Interview.
Free Book Preview 👇
https://t.co/1izgOFy1Kt
Founder of https://t.co/yyE4B5Ltpf (SQL Interview Prep)
Ex-Facebook
106K Followers 2K FollowingThe official feed of #UChicago—one of the world's leading research institutions.
Account managed by University Communications.
188 Followers 169 FollowingPh.D. from Tsinghua University, currently focusing on long context LLM (LongBench, LongAlign, LongWriter) and reasoning models (GLM-Z1, GLM-4.5)
1.7M Followers 725 FollowingOfficial account for Harvard University. Devoted to excellence in teaching, learning, and research, and to developing leaders who make a difference globally.
4K Followers 507 FollowingResearcher @OpenAI, core member of GPT image generation and member of Sora video generation. PhD @MITEECS. I do world models, RL, and robotics.
9K Followers 709 FollowingI make youtube vids on cool AI research /// AI papers newsletter https://t.co/Xn7GMDbQSd /// paper recap @TheAITimeline /// building @findmypapersAI
16K Followers 1K FollowingSenior Research Scientist - @google, Adjunct Faculty - @iitmadras, @iitbombay, Ex: @NICT_Publicity
Use of my tweets without permission ➡️ legal action
10K Followers 4K Followingsth new // ex Gemini RL+Inference @GoogleDeepMind // Chat AI @Meta // RL Agents @EA // ML+Information Theory @MIT+@Harvard+@GeorgiaTech // زن زندگی آزادی
No recent Favorites. New Favorites will appear here.