“When only a few have the resources to build and benefit from AI, we leave the rest of the world waiting at the door,” said @StanfordHAI Senior Fellow @YejinChoinka during her address to the @UN Security Council. Read her full speech here: hai.stanford.edu/policy/yejin-c…
Today, we’re releasing Power Retention, a new architecture beyond Transformers.
It enables LLMs to handle millions of tokens efficiently, unlocking long-context applications that were too costly before.
manifestai.com/articles/relea…
My first month at @cursor_ai, I helped launch Bugbot. We hit $10M ARR in 30 days with a two-person team.
I’m now hiring founding engineers to help scale Bugbot from $XXM to $XXXM ARR.
We are looking for high ownership individuals who consistently do what it takes to win. You’ll…
There are 70+ "reasoning" papers accepted at COLM 2025 (Oct 7-10, Montreal). Most papers elicit long reasoning for different tasks or understand the reasoning abilities/limitations of LLMs.
I wrote a blog post covering ~30 of those papers 👇
If you are attending COLM and are interested in full-time roles, internships, or events with our sponsors, you can upload your Resume here:
forms.gle/n953Dp2UYQzKtq…
We've trained a new Tab model that is now the default in Cursor.
This model makes 21% fewer suggestions than the previous model while having a 28% higher accept rate for the suggestions it makes.
Learn more about how we improved Tab with online RL.
COLM is coming up! Very excited. I'm starting to figure out two things:
1. A small invite-only dinner for Interconnects AI (Ai2 event news later).
2. Various research chats and catchups.
Fill out the form below or email me if you're interested :) 🍁🇨🇦
Reminder to go watch this video from @keyonV. He does a great job explaining this research area in a short period of time. Even if you're not into this topic, the methodological / proof challenges (does a blackbox have a model?) are quite interesting.
youtube.com/watch?v=hguIUm…
78K Followers 2K Followinga combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
46K Followers 1K Following(On mat leave.) Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS.
50K Followers 3K FollowingAI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
65K Followers 1K FollowingCo-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechnique
50K Followers 9K FollowingI lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
46K Followers 1K FollowingWriter https://t.co/TquuQXlLOJ. O'Reilly Author https://t.co/Fl3uPAZHLg. LLM Builder @Cohere. Visualizing AI one concept at a time.
39K Followers 995 FollowingCreator of bitsandbytes.Research Scientist @allen_ai and incoming professor @CarnegieMellon. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.
560 Followers 780 FollowingSeasoned Software Engineer with deep interests in Distributed Systems, Blockchains, Payment Systems, Crypto Regulation, AGI and LLMs, Deep Space Exploration.
391 Followers 1K Followinghttps://t.co/0Wy5ug9dxp [email protected] 7377332953
Organic non-llm AI in testing. 80 percent less hardware requirements. cpu not gpu brain sim for research
78K Followers 2K Followinga combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
46K Followers 1K Following(On mat leave.) Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS.
50K Followers 3K FollowingAI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
65K Followers 1K FollowingCo-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechnique
50K Followers 9K FollowingI lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
39K Followers 995 FollowingCreator of bitsandbytes.Research Scientist @allen_ai and incoming professor @CarnegieMellon. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.
3K Followers 1K FollowingVisiting Scientist at Schmidt Sciences. Visiting Researcher at the Stanford NLP Group
Previously: Anthropic, AI2, Google, Meta, UNC Chapel Hill
20K Followers 9K FollowingProgramme Director @ARIA_research | accelerate mathematical modelling with AI and categorical systems theory » build safe transformative AI » cancel heat death
5K Followers 826 FollowingPostdoctoral fellow at @Harvard_Data | Former computer science PhD with @Blei_Lab at @Columbia University | Researching AI + world models
576 Followers 410 FollowingPrincipal RS at IBM Research AI. Speech, Formal/Natural Language Processing. Currently LLM post-training, structured SDG/RL. Opinions my own and non stationary
18K Followers 4K FollowingAssociate Professor at UC Berkeley. Former Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learning.
18K Followers 20 FollowingA high-throughput and memory-efficient inference and serving engine for LLMs. Join https://t.co/lxJ0SfX5pJ to discuss together with the community!
14K Followers 3K Followingresearch @MIT_CSAIL @thinkymachines. work on scalable and principled algorithms in #LLM and #MLSys. in open-sourcing I trust 🐳. she/her/hers
4K Followers 2K FollowingMachine Learning, Kaggle and occasional pictures from Poland. LLM/AI Research at Snowflake. 4x Kaggle Grandmaster. Personal stuff only.
No recent Favorites. New Favorites will appear here.