Chip Huyen @chipro
Data processing on GPUs @VoltronData Designing ML Systems: https://t.co/G81hL2dWmr @designmlsys #AI x #GPU huyenchip.com San Francisco, CA Joined June 2008-
Tweets513
-
Followers91K
-
Following444
-
Likes7K
I’m excited to share that I’m working on a new book about building applications with foundation models! AI Engineering builds upon Machine Learning Systems Design, but with a focus on large scale, ready made models. The book covers: - The new AI stack (e.g. how it differs from…
Excited to show what our team has been working on over the last 2.5 years: Theseus, our GPU-native query engine! This benchmark compares data queries of different scales -- 10TB, 30TB, and 100TB -- on Spark (run on CPUs) and Theseus (run on GPUs). Moving the same queries from…
Problems I'd do if I'm to do a startup again (though I probably won't any time soon because startups are hard). If you’re solving any of them, I’d love to chat. 1. Data synthesis: AI has become really good both at generating and annotating data. The challenge now is to make sure…
Claypot AI is joining Voltron Data! AI starts from data. By joining forces, we can further help companies leverage both batch and real-time data for AI applications, on top of Voltron Data’s GPU-native distributed engine Theseus. venturebeat.com/data-infrastru… For AI, GPUs are mostly…
New post: Sampling for Text Generation huyenchip.com/2024/01/16/sam… Many challenges (and opportunities) in working with AI today stem from the way models sample their outputs. This post covers: 1. Sampling strategies and variables including temperature, top-k, and top-p. 2. How…
Summary of Gemini's 60-page technical report. 1. Written in Jax and trained using TPUs. The architecture, while not explained in details, seems similar to Flamigo's. 2. Gemini Pro's performance is similar to GPT-3.5 and Gemini Ultra is reported to be better than GPT-4. Nano-1…
New blog post: Multimodality and Large Multimodal Models (LMMs) Being able to work with data of different modalities -- e.g. text, images, videos, audio, etc. -- is essential for AI to operate in the real world. This post covers multimodal systems in general, including Large…
Open challenges in LLM research The first two challenges, hallucinations and context learning, are probably the most talked about today. I’m the most excited about 3 (multimodality), 5 (new architecture), and 6 (GPU alternatives). Number 5 and number 6, new architectures and…
I had so much fun preparing this talk. Per request, here are the slides: huyenchip.com/2023/06/07/gen… The idea came from many conversations I’ve had recently with friends who need to figure out their generative AI strategy. I’d love to hear about your experience through this process.
New post: RLHF - Reinforcement Learning from Human Feedback Discussing 3 phases of ChatGPT development, where RLHF fits in, how RLHF works, hypotheses on why it works, and relationship between RLHF and hallucination. huyenchip.com/2023/05/02/rlh…
Many companies seem to want their own in-house LLMs: finetune an open-source LLM on their own data. Here are a few reasons for and against in-house LLMs I can think of. Would love to hear your thoughts.
Sebastian Raschka @rasbt
266K Followers 906 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.Mark Tenenholtz @marktenenholtz
114K Followers 544 Following Head of AI @PredeloHQ. XGBoost peddler, transformer purveyor.Chris Albon @chrisalbon
86K Followers 2K Following Director of Machine Learning at the Wikimedia Foundation. We host Wikipedia.vicki @vboykis
52K Followers 1K Following Born: USSR. Raised: USA. ML Eng @mozillaai Ex: @duosec @Tumblr, @automattic Nights: 👦 & 👧 working on some ✨ new vectors ✨Jeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordSoumith Chintala @soumithchintala
186K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.abhishek @abhi1thakur
81K Followers 662 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub StarDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈merve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papers sometimes. RTs != endorsementsShreya Shankar @sh_reya
39K Followers 589 Following I study ML & AI engineers and try to make their lives a little better. PhD-ing in databases & HCI @Berkeley_EECS @UCBEPIC and MLOps-ing around town. She/they.(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingErik Bernhardsson @bernhardsson
38K Followers 3K Following Building @modal_labs when I'm not posting bangers about data and software. Previously built the music rec sys at Spotify and ran the eng team at Better.Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pTuringPost @TheTuringPost
62K Followers 16K Following Newsletter exploring AI & ML - Weekly trends - LLM/FM insights - Unicorn spotlights - Global dynamics - History Led by @kseniase_ Elevate your AI game 👇🏼Lucas Beyer (bl16) @giffmana
56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Radek Osmulski 🇺�.. @radekosmulski
25K Followers 554 Following Resources to take your Machine Learning skills to the next level 🧪 Senior Data Scientist, RecSys @NVIDIAAI 🏫 @fastdotai trained DL Eng 📝 https://t.co/By87iXx5PuRichard Socher @RichardSocher
101K Followers 970 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindSanyam Bhutani @bhutanisanyam1
35K Followers 994 Following 👨💻 Sr Data Scientist @h2oai | Previously: @weights_biases 🎙 Podcast Host @ctdsshow 👨🎓 International Fellow @fastdotai 🎲 Grandmaster @KaggleHorace He @cHHillee
23K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemaleRosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRElectronicsseeker @libertarian108
6 Followers 903 FollowingTejal Singh @tejal567
23 Followers 92 Following ML engineer @ Agoda, IIT Roorkee grad. Passionate about making the world a better place, one algorithm at a time. When not working - 🏓⚽️🎬📚🧘♂️Marie Mortensen @MarieMorte90790
0 Followers 6 FollowingSaad @Sa4d_k1
3K Followers 303 Following AI Engineer Expert | Data Scientist | @MiskKSA Mentor @Udacity SL Interested in AI R&D اتحدث عن تطورات الذكاء الاصطناعي واقدم نصائح بالمجال - ادعم حديثين التخرجElan Eins @EinsElan51744
7 Followers 173 Followingdev.parth @dev_parth01
22 Followers 46 Following Coder 👨🏽💻 | I'm here to share my learning journey with you...🧒🏻🦇 | Keep Supporting 🥷 #CONNECTKawasemi @Kawasem12902519
1 Followers 111 FollowingGautham Koorma @_gthmk
39 Followers 253 Following Ex-Consultant, Grad Student at Berkeley School of Information, DS/AI/MLMaged Mostafa @MagedMo10513523
16 Followers 304 FollowingBrian McFadden @bmcfads
56 Followers 116 Following Edge AI/ML Engineer @EdgeImpulse @UBCOgradstudies @isdprl 😎🤘🏻💛Harshit @hokageharshit
1 Followers 80 Followingdylan cleckler @dylancleckler5
133 Followers 295 FollowingTheUnlovedBogymanHasA.. @TheBogymanNovel
653 Followers 4K Following The Unloved Bogyman is a gritty, provocative novel based on an experiment based in the 1950’s. a Twilight Zone story with darker twists. this is a new feed.T. Parkash Sachdeva @pruthpark
4 Followers 40 Following learning, talking, and building energy efficient ai/ml !!a002 @t70582
11 Followers 90 FollowingKieran Brennan @Kieran_Brennan
9 Followers 71 Following0n Cub3B1T @0nCub3b1t
172 Followers 751 Following #KEEPBUILDING #curiousminds about #blockchain, #web3, #AI, #finance #TradeFi #DeFi #GameFi https://t.co/P1d1Om54PY https://t.co/G8mTbQU6ukTrunkboy PeeZ @P10895Peez
4 Followers 204 FollowingDavid Rosenberg @drosen
1K Followers 659 Following Head of ML Strategy, CTO Office, Bloomberg; Former Adjunct Assoc Prof at NYU Center for Data Science (2015-2021); Toronto based; views expressed here are my ownRaj Pabnani @waiting4AGI_
73 Followers 1K Following "Salesforce Engineer 🚀 | Transforming clouds with code ☁️ | AGI enthusiast exploring the future of intelligence 🤖 | #Trailblazer #AGI #CodeArtist"Urubu @mengo___
66 Followers 625 FollowingJão @spacejao
960 Followers 1K Following BA in Economics at @UFJF_ | MSc in Economics at @pimes_em | Just my personal opinionsKepa Sarasola @ksb3ksb
144 Followers 889 Following@lopezrbn @lopezrbn
0 Followers 1K FollowingSamrat Man Singh @samratmansingh
341 Followers 345 Following Building https://t.co/8w6ud2pmOu Software dev in Berlin, originally from Nepal 🇳🇵. Into: 🧗 Climbing and boulderingrequiemDeVerdi @requiemDeVerdi
1 Followers 20 FollowingNegative BackGround -.. @ngtv_bg
3K Followers 2K Following apenas um engenheiro elitista palestrinha construindo ciência entre 300 MHz e 300 GHz, e mexendo com umas tabelonas para ver o que sai do outro lado.Franco @francocontigo
195 Followers 801 Following Python & Dados | Autista & TDAH | 24y | Sistemas de Informação em @ifsc 6/8mo li @moli820497
5 Followers 118 FollowingNarciso Albarracin @nalbarr3
8 Followers 51 Following Passionate about building products that empower humans with AI to make better decisions.Stephen Bonifacio @Stepanogil
306 Followers 520 Following Building Enterprise AI apps @ #JGSummit | In my manic #LLM era | Exercise fiend | Views are my own | System 2 alt: https://t.co/mWm9JhtcrnLuca @luca_nijo35613
0 Followers 47 FollowingwCRWS @wolfCuanhamaRWS
31 Followers 278 Following@padbrogram @padbrogram
1 Followers 17 FollowingSiritas S 💻 ☕ �.. @dahoba
338 Followers 933 Following 💻 code web and mobile app. drink lot of coffee ☕ watch netflix 🎧Hakan @Hakan94911329
0 Followers 147 FollowingUeder Cardoso 🇧�.. @uedercardoso
2 Followers 19 Following I'm building new things to impact the world. My target is a one-person unicorn. Founder and CEO at OneSebastian Raschka @rasbt
266K Followers 906 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.Mark Tenenholtz @marktenenholtz
114K Followers 544 Following Head of AI @PredeloHQ. XGBoost peddler, transformer purveyor.Chris Albon @chrisalbon
86K Followers 2K Following Director of Machine Learning at the Wikimedia Foundation. We host Wikipedia.vicki @vboykis
52K Followers 1K Following Born: USSR. Raised: USA. ML Eng @mozillaai Ex: @duosec @Tumblr, @automattic Nights: 👦 & 👧 working on some ✨ new vectors ✨Jeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordSoumith Chintala @soumithchintala
186K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.abhishek @abhi1thakur
81K Followers 662 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub StarDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Shreya Shankar @sh_reya
39K Followers 589 Following I study ML & AI engineers and try to make their lives a little better. PhD-ing in databases & HCI @Berkeley_EECS @UCBEPIC and MLOps-ing around town. She/they.(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingErik Bernhardsson @bernhardsson
38K Followers 3K Following Building @modal_labs when I'm not posting bangers about data and software. Previously built the music rec sys at Spotify and ran the eng team at Better.Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pLucas Beyer (bl16) @giffmana
56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Richard Socher @RichardSocher
101K Followers 970 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindSanyam Bhutani @bhutanisanyam1
35K Followers 994 Following 👨💻 Sr Data Scientist @h2oai | Previously: @weights_biases 🎙 Podcast Host @ctdsshow 👨🎓 International Fellow @fastdotai 🎲 Grandmaster @KaggleHorace He @cHHillee
23K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemaleRosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRHamel Husain @HamelHusain
23K Followers 2K Following Researcher focusing on LLMs: https://t.co/iVZDFdIQiE Previously, dev tools and infra for ML. Ex @Github, @Airbnb, @DataRobot. @fastdotai core contributor.Julien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueJeff Dean (@🏡) @JeffDean
296K Followers 6K Following Chief Scientist, Google DeepMind and Google Research. Co-designer/implementor of things like @TensorFlow, MapReduce, Bigtable, Spanner, Gemini .. (he/him)Sang Michael Xie @sangmichaelxie
3K Followers 709 Following PhD student @StanfordAILab @StanfordNLP @Stanford advised by Percy Liang and Tengyu Ma. Prev: visiting @GoogleAI Brain, BS, MS Stanford ‘17Alignment Lab AI @alignment_lab
11K Followers 3K Following Devoted to addressing alignment. We develop state of the art open sourced AI. https://t.co/6aJDLUvuU5Irwan Bello @IrwanBello
6K Followers 2K Following Supercomputers & Friends AGI research & products ex @OpenAI, founding team @character_aiAlex Reibman 🖇️ @AlexReibman
23K Followers 803 Following Accelerating @agentopsai @foomvc Agents, ML, math, and data viz. Hack reporter🕶️Rohan Pandey (e/acc) @khoomeik
3K Followers 1K Following multimodal codegen @ReworkdAI (YC S23+AIG3) || prev research @Microsoft + @CarnegieMellon '23 || 10x hackathon winner || living @AGIHouseSFNoam Brown @polynoamial
34K Followers 611 Following Researching reasoning @OpenAI | Co-created Libratus/Pluribus, the first superhuman no-limit poker AIs | Co-created CICERO | PhD from @SCSatCMUAman Sanger @amanrsanger
15K Followers 655 Following building @cursor_ai at @anysphere https://t.co/EdcQJ2dv0J | https://t.co/vJ5zNuT6WOLogan Kilpatrick @OfficialLoganK
92K Followers 2K Following Lead product for @Google AI Studio and working on the Gemini API, helping developers build with AI, my views!Arthur Mensch @arthurmensch
40K Followers 872 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcxNathan Lambert @natolambert
25K Followers 689 Following Figuring out AI @allen_ai, "rl boi" DM me papers. Writes @interconnectsai, talks @retortai Has phd and some credentialsYohei @yoheinakajima
71K Followers 8K Following VC by day, builder by night: @untappedvc, @babyagi_, @pixelbeastsnft. Build-in-public log: https://t.co/UdHHGbZba5Joshua Starmer @joshuastarmer
23K Followers 251 Following I make StatQuest videos and sing more than I should. Contact: https://t.co/ZvHk9UJ0TnJascha Sohl-Dickstein @jaschasd
19K Followers 623 Following Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.Ishaan Gulrajani @__ishaan
3K Followers 473 Following Hi! I’m a machine learning researcher @openai. Previously @stanford @facebook @google @mila_quebecAkshay 🚀 @akshay_pachaar
135K Followers 417 Following Simplifying LLMs, MLOps, Python & Machine Learning for you! • AI Engineering @LightningAI • Lead DataScientist • BITS Pilani • 3 PatentsTri Dao @tri_dao
18K Followers 364 Following Incoming Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.Tengyu Ma @tengyuma
25K Followers 512 Following Assistant professor at Stanford; Co-founder of Voyage AI (https://t.co/wpIITHLgF0) ; Working on ML, DL, RL, LLMs, and their theory.Abubakar Abid @abidlabs
12K Followers 1K Following Hind Rajab. 5 yrs old. She + 14,000 children killed by Israeli forces. PLEASE don't be silent. Take 5 min to call your reps and urge peace (link in bio)Alex Xu @alexxubyte
228K Followers 387 Following Co-Founder of ByteByteGo | Author of the bestselling book series: ‘System Design Interview’ | YouTube: https://t.co/9gPSJSrtPUYi Tay @YiTayML
29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Charity Majors @mipsytipsy
81K Followers 508 Following cofounder/CTO @honeycombio, co-author of Observability Engineering and Database Reliability Engineering. I test in production and so do you. 🐝🏳️🌈🦄Voltron Data @VoltronData
4K Followers 26 Following We offer a new way to design and build composable data systems based on open source standards.evan conrad @evanjconrad
7K Followers 1K Following Happy optimistic doodler. One of the founders of @sfcompute Also made https://t.co/ZPLberpvxD.Sherwin Wu @sherwinwu
15K Followers 517 Following Building the @OpenAI API – GPT-4, DALL·E, Whisper, TTS, Fine-Tuning, and more.Alex Gajewski @apagajewski
2K Followers 743 Following making AI markets efficient @sfcompute, prev founder @metaphorsystemsStas Bekman @StasBekman
7K Followers 268 Following Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at @ContextualAI Training LLM/RAG/Generative AI/Machine Learning/Scalability@levelsio @levelsio
417K Followers 1K Following 🦄https://t.co/sQ0aiU7v02 $202K/m 💆https://t.co/AoNP9BW2Dp $2K/m ✨https://t.co/BmbkrX4Zyf $0.1K/m 📸https://t.co/lAyoqmSBRX $57K/m 🖼https://t.co/1oqUgfD6CZ $44K/m 🌍https://t.co/BjTozWAXwG $27K/m 🛰https://t.co/ZHSvI2wjyW $51K/mHassan @nutlope
74K Followers 947 Following Developer Relations @togethercompute. Building AI apps like @roomGPT and https://t.co/3NFbnMUHJP. Tweeting about AI, web dev, and my side projects.killian @hellokillian
23K Followers 437 Following building a universal interface between language models and computers ● https://t.co/yJVGuC0xlDZhuohan Li @zhuohan123
3K Followers 685 Following CS PhD Student 👨🏻💻 @ UC Berkeley 🌁 🤖️ Machine Learning SystemsYing Sheng @ying11231
4K Followers 485 Following PhD student @Stanford. Large Language Models and Programs. | Do it anywayEric Xing @ericxing
5K Followers 18 Following Researcher, educator, entrepreneur, and administrator in computer science, artificial intelligence, and healthcare.Wing Lian (caseus) @winglian
9K Followers 2K Following @axolotl_ai OSS maintainer. Axolotl AI founder. AI/ML tinkerer. Building tools for everyone.Georgi Gerganov @ggerganov
38K Followers 243 Following Not AI | 0x0e59 0x2550 24th at the Electrica puzzle challengeMckay Wrigley @mckaywrigley
147K Followers 439 Following I make AI stuff. Teaching AI skills @TakeoffAI, building codegen tools @CodewandAI, open source AI chat @ChatbotUI. Investing in AI startups.@𝗸𝗼𝘁𝗼 ⭐.. @koto9x
43K Followers 2K Following ⾀ Alchemist ⚗️ 🎨 // @grimezsz // https://t.co/RpEVpzcN9G // @grimes_v1Teknium (e/λ) @Teknium1
29K Followers 3K Following Cofounder @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE Support me on Github SponsorsRobert Scoble @Scobleizer
504K Followers 69K Following Follow me on my new podcast with AI startups, Unaligned. Tech industry color commentator since 1993. Author/Blogger. Former strategist @Microsoft.Convai @convaitech
1K Followers 24 Following Enable your characters with human-like conversation capabilities in games and virtual world applications.Luke Dicken @LukeD
3K Followers 1K Following Head of AI @Zynga | Former Chair @IGDAFoundation | Doctor of AI from before it was a techbro hellscape. Views are mine, mostly (he/him)Greg Yang @TheGregYang
53K Followers 661 Following Cofounder https://t.co/SpHbO7FZNV. Morgan Prize Honorable Mention 2018. Developing the theory of #TensorPrograms and the practice of scaling #neuralnetworks.Robert Nishihara @robertnishihara
6K Followers 623 Following Co-founder and CEO @anyscalecompute. Co-creator of @raydistributed. Previously PhD ML at Berkeley.Igor Babuschkin @ibab
44K Followers 682 Following Maybe the real AGI was the friends we made along the way. @xAILuke Zettlemoyer @LukeZettlemoyer
8K Followers 2K FollowingIt's a great day to learn about @IbisData! Listen to @cpcloudy, Principal Engineer at @VoltronData and lead maintainer of the Ibis Project, speak with @digiglean on the @realpython's recent podcast episode "Decoupling Systems to Get Closer to the Data" > buff.ly/3JsfDMP
Hey everyone, we are starting a hacker house in the Arena in the mission on June 1st. Our downstairs unit also moved out, so we will now have the whole house! 4B3B and 2B2B. Let me know if you know anyone who is interested. Personally want a creative hacker community. I need to…
Congratuations! The Chinese version will be published by Turing Company which I co-founded. 图灵要出中文版。
I’m excited to share that I’m working on a new book about building applications with foundation models! AI Engineering builds upon Machine Learning Systems Design, but with a focus on large scale, ready made models. The book covers: - The new AI stack (e.g. how it differs from…
Can’t wait!
I’m excited to share that I’m working on a new book about building applications with foundation models! AI Engineering builds upon Machine Learning Systems Design, but with a focus on large scale, ready made models. The book covers: - The new AI stack (e.g. how it differs from…
@chipro I was just about to buy your last book! You're on fire :)
SoTA LLMs typically exhibit 99%+ non-zero activations, but it turns out that they are still intrinsically quite sparse! We introduce CATS, a simple post-training technique that achieves 50% activation sparsity for MLP layers with almost no drop in downstream evals, while…
Keep crushing it Chip!
I’m excited to share that I’m working on a new book about building applications with foundation models! AI Engineering builds upon Machine Learning Systems Design, but with a focus on large scale, ready made models. The book covers: - The new AI stack (e.g. how it differs from…
@chipro Making us all look lazy again
@chipro Looking forward to this… teaching a class on LLMOps spring 2025 and looking for good reference materials!
Published in Nature Machine Intelligence today, our new article explores the trade-offs of personalised alignment in large language models ⚖️ Personalisation has potential to democratise decisions over how LLMs behave, but brings its own set of risks... nature.com/articles/s4225…