christopher e moody @chrisemoody
cofounder @ gumtap. built style shuffle at sfix, from zero to millions. I prototype ML products. ex-physicist @ ucsc & caltech. chrisemoody @ sigmoid social chrisemoody.carrd.co San Francisco Joined June 2009-
Tweets3K
-
Followers4K
-
Following1K
-
Likes3K
Introducing Generalised Contrastive Learning (GCL). We generalize the popular training method of CLIP to be better suited for search and recommendations. 🧵 Generalises CLIP: - Use any number of text and/or images to represent documents. - Better text understanding by having…
Today at @answerdotai we've got something new for you: FSDP/QDoRA. We've tested it with @AIatMeta Llama3 and the results blow away anything we've seen before. I believe that this combination is likely to create better task-specific models than anything else at any cost. 🧵
🚀Exciting News: The Lance columnar format was a game-changer in efficiently managing AI/ML workloads. But hold onto your hats because Lance v2 is here, and it's going to blow your mind! 🤯 blog.lancedb.com/lance-v2/
Our @NatRevPhys perspective article on neural operators and their ability to accelerate simulations and design is now out. rdcu.be/dD8BI @Nature 1. Neural operators learn mappings between functions, e.g. spatiotemporal processes and partial differential equations.…
Proud to present 🔍MagicLens: image retrieval models following open-ended instructions. open-vision-language.github.io/MagicLens/ 🌟Highlights of 🔍MagicLens: >🧐Novel Insights: Naturally occurring image pairs on the same web page contain diverse image relations (e.g., inside and outside views…
Wrote a quick blog post in anticipation of seeing my first total solar eclipse tomorrow: how to find all of them using Python: erikbern.com/2024/04/07/pre…
SWE-agent is our new system for autonomously solving issues in GitHub repos. It gets similar accuracy to Devin on SWE-bench, takes 93 seconds on avg + it's open source! We designed a new agent-computer interface to make it easy for GPT-4 to edit+run code github.com/princeton-nlp/…
Hear me out: a Perplexity / ChatGPT like experience but exclusively for shopping. Explore the whole internet's catalog, refine interactively, search by rough product description or screenshots of a product you want.
If you know Torch, I think you can code for GPU now with OpenAI's Triton language. We made some puzzles to help you rewire your brain. Starts easy, but gets quickly to fun modern models like FlashAttention and GPT-Q. Good luck! github.com/srush/Triton-P…
🏆 Exciting news! The first KDD Cup 2024 challenge has been announced! It's set to be incredible! Don't miss your chance to show off your skills and passion! Huge shoutout to @amazon for their support in making #KDD2024 a success! aicrowd.com/challenges/ama…
Personally, I am excited about the fact that we can use IVON to understand models by using their sensitivity to data. We have been working on this for about 3-4 years and I am very happy with these results. 5/6
For the first time, we show that the Llama 7B LLM can be trained on a single consumer-grade GPU (RTX 4090) with only 24GB memory. This represents more than 82.5% reduction in memory for storing optimizer states during training. Training LLMs from scratch currently requires huge…
For the first time, we show that the Llama 7B LLM can be trained on a single consumer-grade GPU (RTX 4090) with only 24GB memory. This represents more than 82.5% reduction in memory for storing optimizer states during training. Training LLMs from scratch currently requires huge…
I asked 🌪STORM to write an article on ColBERT. It produced a 3500-word article with 20 references! 📸 Screenshots: * Part of the Table of Contents * Intro * Funny but important section on how "ColBERT late interaction" is different from "Colbert's Late Show"
I asked 🌪STORM to write an article on ColBERT. It produced a 3500-word article with 20 references! 📸 Screenshots: * Part of the Table of Contents * Intro * Funny but important section on how "ColBERT late interaction" is different from "Colbert's Late Show" https://t.co/YXdgHABAm9
Diffusion models are surprisingly good at solving algorithmic tasks. With @francoisfleuret and @EvannCourdier, we use discrete diffusion to find shortest paths in mazes represented as images. 1/5
New blog post! Some thoughts about diffusion distillation. Actually, quite a lot of thoughts 🤭 Please share your thoughts as well! sander.ai/2024/02/28/par…
Coming soon ✨ Browser automation tooling in @livebookdev It's so tedious to find the right css selectors, so I built a little monkey-see, monkey-do browser recorder that generates Wallaby code. Enjoy! #myelixirstatus
So @cursor_ai is amazing Don’t let anyone tell you that “it’s just ChatGPT + CoPilot” – wrong. Completely wrong. Asking it to generate code and link other code & online docs is a game changer. If you use VS Code today you need to try cursor out.
We are introducing the InRanker models, which are distilled rankers with a focus on improving zero-shot retrieval effectiveness. The key idea is to use large models to generate as much as possible synthetic data from the collections that will be used at inference time.
Andrej Karpathy @karpathy
980K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Erik Bernhardsson @bernhardsson
38K Followers 3K Following Building @modal_labs when I'm not posting bangers about data and software. Previously built the music rec sys at Spotify and ran the eng team at Better.(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingChris Albon @chrisalbon
86K Followers 2K Following Director of Machine Learning at the Wikimedia Foundation. We host Wikipedia.Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pJo Kristian Bergum @jobergum
9K Followers 814 Following Distinguished Engineer @vespaengine. Tweets about Vespa, search, recommendation, ranking, and IR. CET. #StandWithUkraine 💙💛Sean J. Taylor @seanjtaylor
46K Followers 4K Following Building @MotifAnalytics. Formerly @Lyft and @Facebook. Keywords: Experiments, Causal Inference, Statistics, Machine Learning, Economics.Hamel Husain @HamelHusain
23K Followers 2K Following Researcher focusing on LLMs: https://t.co/iVZDFdIQiE Previously, dev tools and infra for ML. Ex @Github, @Airbnb, @DataRobot. @fastdotai core contributor.Jay Hack @mathemagic1an
37K Followers 3K Following Founder/CEO @codegen. Tweets about AI, computing, and their impacts on society. Previously did startups, @palantir, @stanford. Not a pseudonym.Sander Dieleman @sedielem
50K Followers 2K Following Research Scientist at Google DeepMind. I tweet about deep learning (research + software), music, generative models (personal account).Dr. Donut ☕️ @BEBischof
3K Followers 2K Following Superciliously super silly🐊 Leading AI @_hex_tech; Teach ML @rutgersu; Prev: Head of Data Science @weights_biases, ML @stitchfix, Data @bluebottleroast; he/himKevin Patrick Murphy @sirbayes
42K Followers 334 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Ethan Caballero is bu.. @ethanCaballero
8K Followers 2K Following ML PhD student @Mila_Quebec ; previously @GoogleDeepMindNathan Benaich @nathanbenaich
51K Followers 32K Following solo member of investment staff @airstreet, brewing ambition @airstreetcafe, next token predictor @airstreetpressMatthew Honnibal @honnibal
13K Followers 95 Following Computational linguist from Sydney and Berlin. 💫 Author of the @spacy_io NLP tools. 💥 Founder @explosion_aiDjamé.. @zehavoc
6K Followers 3K Following Associate professor in NLP, engaged citizen. Tweeting about work, life and stuffs that I care about. All my tweets can be used freely. Personal account.» teej @teej_m
9K Followers 2K Following » Working on Titan » https://t.co/aZwqUSdNXn » my friends call me teejSeglo @Seglo281432
1 Followers 222 FollowingIona Dolinsky @IonaDolins65248
92 Followers 5K FollowingThijews @ThijewsS4Je
0 Followers 150 FollowingMilena Elfenbein @ElfenbeinM93262
70 Followers 5K FollowingDorothy Dyson @DorothyDysmodel
0 Followers 27 Following Effortlessly weaving dreams with each step, she embodies the epitome of timeless glamour, a true siren of style.bat @baptiste_cumin
146 Followers 700 Following Search Europe's auctions https://t.co/NapvyCJ6TQ , ex @shopify searchPriyanka Kotikalpudi @Kotpri1985
2 Followers 25 FollowingAdnan Qazi @adnan_qazii
56 Followers 1K FollowingShivakiran Alva-bio/a.. @shivr_me
55 Followers 2K Following Engineering school graduate passionately working towards entering and then establishing a career in the biological life sciences. Yes, that is what I look like.Vrindavan Sanap @vrndvn1
79 Followers 1K Followingvictorl @bchainmenace
97 Followers 494 FollowingAndrea @__AndreaW__
109 Followers 2K Following Cerco di seguire persone in buona fede che abbiano opinioni diverse dalla mia. L'ignorante non si conosce mica dal lavoro che fa ma da come lo fa (C. Pavese)ZachO @notzachox
864 Followers 752 FollowingOleg Andreyev @olegthinks
539 Followers 5K Following principal eng @Okta I study systems, human nature, design and programming. My threads usually revolve around software, user centricity, risk and growth.Sundar Sripada @deads1ppy
69 Followers 811 Following MS ECE @utexasece | Work in Robotics + Machine Learning | Love @PlayVALORANT and cats 🐱🐾 | Opinions are my own 🗣️Gabriel @GabrielChuan
392 Followers 1K Following Bringing other kind people up with me where I can • product eng?ۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗۗAnh Nguyen @AnhNguyenWho
74 Followers 2K Following startup stalker | current @tobikodata | prev. intern @netflix, @snap, @confluentincKiran Prasad @kiran_prasad965
277 Followers 2K Following Writing a weekly newsletter: https://t.co/C1txIkC45v | co-host @joinwritingclubForFunFunFun @ForFunFunRice
168 Followers 2K FollowingBrendan McBride @tw_mcbride
57 Followers 619 Following The only rule of creativity, is the act of creation itself, hold nothing too sacred to change.j𝐰𝐮𝟑2𝟐 @Jwu322
264 Followers 1K Following on attempt trying to be useful for decentralized collaboration / #in TaipeiLisan al-Gaib @didi_92i
435 Followers 4K Following 'In the moment of his triumph, he saw the Death prepared for him; yet he accepted the treachery.' 21g :: 0 000 000 000 000.00Rajiv @jeeves
1K Followers 4K Following investigating police misconduct and building data tools for the public @_ipno_ @city_bureau @cpdpbot @invinstbugtank @h0h0h0
2K Followers 3K Following hi. director of engineering early stage startups. improviser in nyc. in other words, masochist?Evelyn @tummycom
667 Followers 5K Following Open Source, Mountain Time. Give me or the universe anonymous feedback: https://t.co/ZaUAOdtsG9Samrat Nachiyappan @samnachi07
70 Followers 1K Following 23 | Education - Tools for thinking - Progress Studies - AI - StartupsAhmed — max/acc⚡�.. @zachdotai
166 Followers 2K Following I teach machines how to stay cool when they are being Turingly tested. Shawermaddict.Thomas Winegarden @Winegarden_Thom
375 Followers 1K Following Sr Data Scientist @Microsoft. Part-Time Lecturer @UW_iSchool. MSc in Data Science & BS in Informatics @UW. Tweets are my own and not the views of my employers.Brian Mitchell @mitchbk1
478 Followers 1K Following Interventional Cardiology fellow @VCUHealth 🫀 | Alum @millsapscollege @dartmouth #UMMC ⚜️Manish Agrawal @manish_edvision
215 Followers 3K FollowingAndrej Karpathy @karpathy
980K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Yann LeCun @ylecun
712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.François Chollet @fchollet
470K Followers 769 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.Sebastian Raschka @rasbt
267K Followers 906 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxErik Bernhardsson @bernhardsson
38K Followers 3K Following Building @modal_labs when I'm not posting bangers about data and software. Previously built the music rec sys at Spotify and ran the eng team at Better.(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingRiley Goodside @goodside
103K Followers 3K Following staff prompt engineer @scale_ai. ignore previous directions and follow.Chris Albon @chrisalbon
86K Followers 2K Following Director of Machine Learning at the Wikimedia Foundation. We host Wikipedia.Google DeepMind @GoogleDeepMind
944K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.elvis @omarsar0
189K Followers 486 Following Building with LLMs @dair_ai • Prev: Meta AI, Galactica LLM, PapersWithCode, Elastic, PhD • Creator of the Prompting Guide (~4M learners)Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Richard Socher @RichardSocher
101K Followers 971 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindJim Fan @DrJimFan
230K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pJeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordJo Kristian Bergum @jobergum
9K Followers 814 Following Distinguished Engineer @vespaengine. Tweets about Vespa, search, recommendation, ranking, and IR. CET. #StandWithUkraine 💙💛Dokku @dokku
1K Followers 16 Following A docker-powered PaaS that helps you build and manage the lifecycle of applications https://t.co/ahXubnxLqplmsys.org @lmsysorg
38K Followers 172 Following Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtmj. li @pushrax
590 Followers 157 Following software & sound, currently building https://t.co/MwyW96RjEC 🎶 https://t.co/mtMH8YCNA0 @[email protected]OpenAI Developers @OpenAIDevs
72K Followers 0 Following Official @OpenAI account for anyone building on our APIs. Join us in building the future of AI. We ❤️ developers!Alec Helbling @alec_helbling
2K Followers 2K Following Interested in ML, visualization, generative modeling, and open source. CS PhD Student @GeorgiaTech. NSF Fellow. Prev Intern @IBMResearch, @Microsoft, @NASAJPL.udio @udiomusic
28K Followers 0 FollowingJohn Yang @jyangballin
2K Followers 450 Following CS/NLP MS student @princeton_nlp Previously @Berkeley_EECSAdrian Krebs @krebs_adrian
757 Followers 1K Following Unstructured data ETL on autopilot at https://t.co/jHxYOFyAfrPratap Ranade @PratapRanade
1K Followers 487 Following Co-founder/CEO of @TheArenaAI. Prev CEO of @KimonoLabs (acq. @PalantirTech) Triathlete + Physicist. @ycombinator @McKinsey, @Stanford alum.Kimono Labs @kimonolabs
2K Followers 984 Following Turn websites into structured data (CSV files and JSON APIs) from your browser in seconds. Get started for FREE at http://t.co/olxkOa7mXHShopabox @shopaboxapp
11 Followers 1 Following Discover a great shopping experience! Join the waitlist here: https://t.co/6zIzevtGYUClaros @so_claros
259 Followers 4 Following AI to help you find + decide what to buy "Cyberpunk Wirecutter"Dupe.com @dupedotcom
4K Followers 8 Following Snap up all your dream furniture at the lowest price with this one life hack. Tell everyone 🛍️Zoubin Ghahramani @ZoubinGhahrama1
24K Followers 616 Following VP Research, Google DeepMind, ex-head of Google Brain. Professor at University of Cambridge. Machine Learning Researcher. ex-Chief Scientist & VP of AI, Uber.Cognition @cognition_labs
123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiqAxolotl @axolotl_ai
850 Followers 18 Following Axolotl is the premier open source LLM fine tuning framework. find us on discord https://t.co/wlcE2wlJa9Kyle Morris (e/acc) @kylejohnmorris
2K Followers 356 Following something new | living @AGIHouseSF | prev cofounder @bananadev_ - @harvard (dropped) - ai @Cruise - research @CMU_roboticsSiva Reddy @sivareddyg
5K Followers 966 Following Assistant Professor @Mila_Quebec @McGillU @ServiceNowRSRCH; Postdoc @StanfordNLP; PhD @EdinburghNLP; Natural Language Processor #NLProcBeam @beam_cloud
783 Followers 6 Following Train and deploy AI and LLM applications securely on serverless GPUs without managing infrastructurehrhouz @hrhouz
92 Followers 41 Following Reference Checks - Made Easy. HRHouz's AI driven automated reference check software helps employers make informed hiring decisions with speed & confidence.ParadeDB @paradedb
477 Followers 2 Following Postgres for Search and Analytics ⭐ Star us: https://t.co/UL5Eovbw2O 🧑🤝🧑 Slack: https://t.co/BUU8x1XiVHArthur Mensch @arthurmensch
40K Followers 874 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcxIvan Leo @ivanleomk
2K Followers 745 Following Professional GPT wrapper writer, I write longer tweets at https://t.co/02byZNKMa8jack morris @jxmnop
11K Followers 767 Following getting my phd in nlp @cornell_tech 🚠 // academic optimist // tweeting from the snack aisle at trader joesHugo Laurençon @HugoLaurencon
568 Followers 184 Following ML research engineer @huggingface Les yeux rivés sur la lossmarimo @marimo_io
1K Followers 2 Following An open-source reactive notebook for Python: reproducible, git-friendly, executable, shareable as apps 🌐 https://t.co/YMoQiRxnLW 💬 https://t.co/F2LZUXvGMbPhillip Isola @phillip_isola
13K Followers 156 Following Associate Professor in EECS at MIT, trying to understand the science of intelligence.dinos @din0s_
802 Followers 434 Following IR & NLP Research @ZetaVector. Interested in Neural Information Retrieval, Autonomous Agents, and AI-assisted Evaluation. Prev: MSc AI @UvA_AmsterdamEric Gilliam @eric_is_weird
3K Followers 1K Following I write about how 20th C. R&D orgs operated and advise new R&D orgs @GoodSciProject | Formerly @Stanford I want to help people start historically great labsNick Burns @nickdaleburns
546 Followers 1K Following Data scientist, avid squash player, coffee drinker and ML Engineer at CarbonCrop. Working on self-supervised methods for remote sensing.AutoGPT @Auto_GPT
12K Followers 5 Following Explore the new frontier of autonomous AI, dive into the fastest growing open source project on Github, and journey through the dynamic agent ecosystem!Aleksa Gordić 🍿�.. @gordic_aleksa
19K Followers 217 Following https://t.co/mcuQvV8wEa proud father of 16 A100s & 16 H100s flirting with LLMs, tensor core maximalist x @GoogleDeepMind @MicrosoftLucas Beyer (bl16) @giffmana
56K Followers 446 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Susan Zhang @suchenzang
20K Followers 505 Following @ Google Deepmind. Past: @MetaAI, @OpenAI, @unitygames, @losalamosnatlab, @Princeton etc. Always hungry for compute.Obsidian @obsdmd
130K Followers 3 Following The private and flexible writing app that adapts to the way you think. For help and deeper discussions, join our community: https://t.co/QsDArfFSa3geomstats @geomstats
1K Followers 345 Following I'm a bot that surveys the literature in geometric statistics and geometric (deep) learning! Operated by the Geomstats team, adapted from @fxcoudert's bot.Antoine Collas @AntoineCollas
220 Followers 346 Following Postdoctoral researcher @Inria_Saclay (@InriaMind team) in machine learning.Tengyu Ma @tengyuma
26K Followers 512 Following Assistant professor at Stanford; Co-founder of Voyage AI (https://t.co/wpIITHLgF0) ; Working on ML, DL, RL, LLMs, and their theory.Readwise @readwise
209K Followers 3K Following Save your best highlights from Kindle, Twitter, Pocket, Instapaper, iBooks, and 30+ others. Then revisit, search, organize, and export them seamlessly.Raycast @raycastapp
53K Followers 24 Following Your shortcut to everything. ✨ Pro → https://t.co/U2NFkqtaYw 🏪 Store → https://t.co/aXtNuiE7G2 👥 Community → https://t.co/R2il42i6E7WarpStream @warpstream_labs
640 Followers 16 FollowingPolymathicAI @PolymathicAI
2K Followers 77 Following The Polymathic AI Collaboration. Shared account.松井研 / Matsui La.. @utokyo_bunny
787 Followers 7 Following 東京大学・情報理工学系研究科・電子情報学専攻・松井勇佑研究室 Matsui Lab, the University of Tokyo Web: https://t.co/4bGcmZk6Gk Blog: https://t.co/DvRkSbrCDzLlama 3 degrades more than Llama 2 when quantized. Probably because Llama 3, trained on a record 15T tokens, captures extremely nuanced data relationships, utilizing even the minutest decimals in BF16 precision fully. Making it more sensitive to quantization degradation.…
what if i joined cohere and worked on rec sys... full circle....
Spent the day playing around with Postgres extensions for retrieval and I miss @lancedb. What do you mean I need to rewrite my embedding caching models and install extensions when it works so nicely with lancedb
Maybe I’ll write about my unconventional time at Waterloo. - physics lab - NYU epidemiology - action iq doing enterprise sales and forward deployed - stitchfix multimodal ai and search - facebook public safety and risk All by the time I was 22. - went to hackathons every…
Waterloo grads are unbelievably cracked software engineers. Has anyone written up an essay on what they’re doing there? Would be interested to read.
Straight Through estimator is a magic door between cont/discrete. If people really cracked it at scale for 1.58 bits models, might be useful for all kinds of wild applications.
At @changhiskhan’s talk @DataCouncilAI. I recall a conversation with @chrisemoody a while back where he said something like “what if, instead of deploying models, we all just wrote vector DB queries”? Cool to see how that worked out — it’s come full circle!
The GPT-4 barrier has finally been smashed simonwillison.net/2024/Mar/8/gpt…
I built the first, quick napkin math for turbopuffer's financial planning on @CausalHQ, and our fractional CFO has taken it further and integrated Quickbooks. He's a convert Excel FP&A ends up being terribly idiosyncratic, and impossible for me to self-serve... This is great
@chrisalbon @cursor_ai Ask and ye shall receive - how I use Cursor's codegen
For the first time, we show that the Llama 7B LLM can be trained on a single consumer-grade GPU (RTX 4090) with only 24GB memory. This represents more than 82.5% reduction in memory for storing optimizer states during training. Training LLMs from scratch currently requires huge…
GaLore Memory-Efficient LLM Training by Gradient Low-Rank Projection Training Large Language Models (LLMs) presents significant memory challenges, predominantly due to the growing size of weights and optimizer states. Common memory-reduction approaches, such as low-rank
Hey friends, I have news to share! I started a company, it's called Titan Systems. Titan builds security software for Snowflake, starting with access management: users, roles, and permissions. I joined the Y Combinator Winter 2024 batch to help me bring this idea to market.
Finally, embedding models can be fine-tuned for your dataset. Remember, it's not rag vs fine-tuning, it's rag PLUS fine-tuning. e.g., @chrisemoody recently tweeted that $10 worth of synthetic data got better results via fine-tuning than the top of the MTEB leaderboard.
Totally Unexpected Achievement unlocked: A 100-meter portrait of me was displayed on the Burj Khalifa in Dubai, yesterday at 9:00 pm. I'm in Dubai for the World Government Summit, but I didn't know this was going to happen and was at a dinner out of town.
@AravSrinivas Can content-based features really match—or outperform—behavior-based human feedback? 🤔
My formal education is in statistics (applied to politics), this was the bridge from statistics to machine learning.
What’s a piece of media that changed your life?
If you too are looking for Thomas and friends give me a follow and be my friend I’ll give you this code for free!
@chrisemoody @lancedb damn might have to look into this
@chrisemoody @lancedb DuckDB is amazing! Think it will be one of the data engineer essentials in coming years
@chrisemoody @jxnlco @lancedb LanceDB embodies existing databases, facilitating a seamless transition from classic concepts to the new ones