Really enjoyed @samsja19’s talk on the challenges of decentralized training (e.g. DiLoCo) under low-bandwidth conditions. Was surprised to learn how much weather can destabilize training 🤯
@PrimeIntellect is doing some wild stuff with decentralized RL! 🚀
Thanks for the…
too much new learning material! we're releasing a few chapters of hard study on post training AI models. it covers all major aspects plus more to come.
- Evaluating Large Language models on benchmarks and custom use cases
- Preference Alignment with DPO
- Fine tuning Vision…
hi! if you’re interested in using or writing mega kernels for AI (one big GPU kernel for an entire model) you should tune in to today’s @GPU_MODE livestream
today in ~3 hours we have the authors of MPK talking about their awesome new compiler for mega kernels!
see you there :)
I was lucky to work in both China and the US LLM labs, and I've been thinking this for a while. The current values of pretraining are indeed different:
US labs be like:
- lots of GPUs and much larger flops run
- Treating stabilities more seriously, and could not tolerate spikes…
I was lucky to work in both China and the US LLM labs, and I've been thinking this for a while. The current values of pretraining are indeed different:
US labs be like:
- lots of GPUs and much larger flops run
- Treating stabilities more seriously, and could not tolerate spikes…
Just had the most amazing Transformers (with flash attention) lecture from @danielhanchen — he broke down the guts of Transformers and walked us through the full backprop step-by-step, all by hand.
Huge thanks to @TheZachMueller for organizing!
DO NOT buy a gpu to write kernels. use @modal notebooks. take 2 mins out of your day to learn this simple trick and kick off your work without paying a shit ton for electricity or cloud gpu run 24/7
🚨 career update
i’ve joined @bulletxyz_ to build the growth engine driving the next million on-chain traders.
excited to build a @solana native trading layer that brings CEX performance fully on-chain.
more ↓
@dhh I get asked the same about terminals all the time. “How will you turn this into a business? What’s the monetization strategy?” The monetization strategy is that my bank account has 3 commas mate.
12 Followers 71 FollowingThe Mercury Protocol serves as a cutting-edge DEX aggregator designed to deliver the fastest, most efficient, and optimal trading experience in DeFi.
61 Followers 227 FollowingFounder at https://t.co/UfkNBLEkbc -- Working on saving the construction industry in the U.S., one business at a time. Follow for takes.
749 Followers 5K FollowingFounder of @unveilweb3 Host of #behindtheblockchain podcast Self-proclaimed elite #headhunter | 10+ years in the game | #Web3 #Recruitment - Link in bio
45K Followers 3K FollowingWe're in a race. It's not USA vs China but humans and AGIs vs ape power centralization.
@deepseek_ai stan #1, 2023–Deep Time
«C’est la guerre.» ®1
26K Followers 229 Followinggetting us to singularity with friends
computers can be understood: https://t.co/doHE1Qv2Sj
x @GoogleDeepMind @Microsoft
tensor core maximalist
161K Followers 315 FollowingI talk about issues long before they happen. Now and then in touch with Turiya. I post conspiracies and nothing I say is real. Don't believe anything I post.
1K Followers 85 FollowingChief AI Officer at Story Protocol. UT Austin Prof. Work on GenAI and Networked Intelligence. Stanford CS PhD. All views my own.
171K Followers 287 FollowingLearn from history's greatest entrepreneurs. Every week I read a biography of an entrepreneur and find ideas you can use in your work.
83K Followers 561 FollowingFilm director | AI Consultant | Partner with https://t.co/Vn9g3Z63CI Paris | Sharing practical ways to use AI for you and your business. All views are my own.
372K Followers 94 FollowingBest account to cleanse your timeline with positivity, humour and wholesome content | DM for credit or removal | Turn on notifications 🔔
21K Followers 465 Followingphysics of language models @ Meta (FAIR, not GenAI, not TBD)
🎓:Tsinghua Physics — MIT CSAIL — Princeton/IAS
🏅:IOI x 2 — ACM-ICPC — USACO — Codejam — math MCM
26K Followers 1K FollowingGenAI @Youtube | Building AI powered video editing | ex : @Google Search & @Microsoft Azure | 3x hackathon winner | Views my own