Those who build AI today control the future.
The dAGI Summit (part of Open Source AI Week by Linux Foundation) is for founders, researchers and builders driving the open-source and distributed AI that gives power back to people.
We are bringing together senior researchers and…
Pluralis is pulling off something really big: a decentralized AI training run where anyone can contribute compute + earn rewards.
This matters because it shows how AI can be built as an open network, not locked inside a few giant labs.
We’re excited because this aligns with our…
Pluralis is pulling off something really big: a decentralized AI training run where anyone can contribute compute + earn rewards.
This matters because it shows how AI can be built as an open network, not locked inside a few giant labs.
We’re excited because this aligns with our…
Pluralis is one of the most exciting projects for me in decentralized AI training.
Very proud they chose to launch permissionless and open-source from day one, unlike many grifters we have seen launch closed source in the name of "protection."
We're adding support to Pluralis…
Pluralis is one of the most exciting projects for me in decentralized AI training.
Very proud they chose to launch permissionless and open-source from day one, unlike many grifters we have seen launch closed source in the name of "protection."
We're adding support to Pluralis…
Max's papers were the major reason I got convinced any of this was possible at all. I cannot emphasise enough how non-accepted/contrarian this whole line of research was. So you think your missing something because all these other smart people dismiss it. But then Max's papers…
Max's papers were the major reason I got convinced any of this was possible at all. I cannot emphasise enough how non-accepted/contrarian this whole line of research was. So you think your missing something because all these other smart people dismiss it. But then Max's papers…
Last few months has been this with the additional complexity of supporting multi-party training + running over a noisy fabric of decentralised compute we don't control. Required re-writing large portions of the distributed training stack. Not easy. Done now though.
Last few months has been this with the additional complexity of supporting multi-party training + running over a noisy fabric of decentralised compute we don't control. Required re-writing large portions of the distributed training stack. Not easy. Done now though.
People think about pretraining runs as single long monolithic loss curves but they're not like that even in the centralised case. You run stuff, move it through different data stages, go back and fork off a checkpoint, change some norm somewhere etc. etc.
People think about pretraining runs as single long monolithic loss curves but they're not like that even in the centralised case. You run stuff, move it through different data stages, go back and fork off a checkpoint, change some norm somewhere etc. etc.
Obsessing over the SWE-bench chart is one of the most mid-curve things I've ever seen. Take a second and absorb what's actually been achieved here. Understandable to not like OpenAI since they're destroying your whole identity but don't pretend its the chart that's the issue.
pretraining is an elegant science, done by mathematicians who sit in cold rooms writing optimization theory on blackboards, engineers with total absorb of distributed systems of titanic scale
posttraining is hair raising cowboy research where people drinking a lot of diet coke…
Missing the point that it'll be a system who's behaviour is controlled by a few people, in companies that don't have the best track record when it comes to this kind of thing. It's fundamentally a level of power that's never existed before it's that simple.
Missing the point that it'll be a system who's behaviour is controlled by a few people, in companies that don't have the best track record when it comes to this kind of thing. It's fundamentally a level of power that's never existed before it's that simple.
Thats a wrap for ICML2025. Incredible to watch the space go from "What are you talking about" to "That's impossible" to "Hmmm thats very interesting" in just over a year. @tha_ajanthan@hmdolatabadi
668 Followers 469 FollowingMy Island Homes is an innovative crypto community that allows co-ownership and access to luxury island real estate previously only available to the rich.
77 Followers 138 FollowingRuns a tiny DePIN GPU farm. Believes that humanity progression can come from AI. Loves computer & AI. 80s 🖥 Engineer dad. Leaving advices for my children.
188 Followers 988 FollowingSpace and sometimes not space
🇮🇪 // Mech Eng @tcddublin // Space Engineering @UCL // BD US Expansion Leanspace
Now working on indoor air quality
261 Followers 1K FollowingTurned a 5-year failed initiative into a scalable multi-billion dollar digital curriculum 💥 #superminds #sustainablebusiness #humancentereddesign
19K Followers 100 FollowingMember of Technical Staff at Anthropic AlphaGo, AlphaZero, MuZero, AlphaCode, AlphaTensor, AlphaProof Gemini RL Prev Principal Research Engineer at DeepMind
8K Followers 180 FollowingLarge Model Systems Organization: Join our Slack: https://t.co/mSPNyKTLTS We developed SGLang https://t.co/jEqIJcGwGA, Chatbot Arena (now @lmarena_ai), and Vicuna!
14K Followers 15K FollowingAustin Powered. Co-founder of OpenStack & OpenInfra Foundation. General Manager of AI & Infrastructure for the Linux Foundation. open source for fun & profit.
5K Followers 6K FollowingCM0 for the New Internet | Brand Mentor to the C Suite. Prev. CoinFund, Paxos, Waze, Groupon etc. Paying it forward & playing it loud 🤘🏻
16K Followers 0 FollowingLong-context, test-time compute, and e2e Reinforcement Learning to build a superhuman coding agent (that then builds the rest of AGI for us). Join us https://t.co/hGZKtUzsR3
146K Followers 141 FollowingWorking on a new terminal: Ghostty. 👻 Prev: founded @HashiCorp. Created Vagrant, Terraform, Vault, and others. Vision Jet Pilot. 👨✈️
39K Followers 994 FollowingCreator of bitsandbytes.Research Scientist @allen_ai and incoming professor @CarnegieMellon. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.