Soumith Chintala @soumithchintala
Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source. soumith.ch New York City Joined September 2009-
Tweets3K
-
Followers184K
-
Following866
-
Likes3K
The live updating image generator on meta.ai/?icebreaker=im… is a pretty sick UX.
very early LMSys Arena results peg llama3-70B at 5th place (the variance is still pretty high, so it can jump up or down a bit). This is so exciting. Can't wait to see how the 405B fares once it is released. chat.lmsys.org/?leaderboard
There's another quieter release from @AIatMeta today that's really cool. * Live Preview: As you type your image prompt, you get a live preview, making iterating for a good image easier. * Animate: now you can animate images for short bursts
Llama3 8B and 70B are out, with pretty exciting results! * The ~400B is still training but results already look promising. * Meta's own Chat interface is also live at meta.ai * TorchTune integration is shortly going live: github.com/pytorch/torcht…
Llama3 8B and 70B are out, with pretty exciting results! * The ~400B is still training but results already look promising. * Meta's own Chat interface is also live at meta.ai * TorchTune integration is shortly going live: github.com/pytorch/torcht…
Oh my god. 😂 GPT-4 uses the word “delve” so much because many of the RLHF’s (reinforcement learning human feedback) workers for GPT-4 were Nigerians who use the word “delve” a lot more relative to other countries. So GPT-4 writes like an educated anglophone African.
Oh my god. 😂 GPT-4 uses the word “delve” so much because many of the RLHF’s (reinforcement learning human feedback) workers for GPT-4 were Nigerians who use the word “delve” a lot more relative to other countries. So GPT-4 writes like an educated anglophone African. https://t.co/J1uNJkkLvm
Really excited to officially release torchtune: a PyTorch-native library for easily fine-tuning LLMs! Code: github.com/pytorch/torcht… Blog: pytorch.org/blog/torchtune… Tutorials: pytorch.org/torchtune/stab… [1/5]
Announcing the alpha release of torchtune! torchtune is a PyTorch-native library for fine-tuning LLMs. It combines hackable memory-efficient fine-tuning recipes with integrations into your favorite tools. Get started fine-tuning today! Details: hubs.la/Q02t214F0
LLM Fatigue is a variation of Decision fatigue for AI; every day there's a new release, so you stick to the familiar 3 names despite merits
Meta announces 2nd-gen inference chip MTIAv2. * 708TF/s Int8 / 353TF/s BF16 * 256MB SRAM, 128GB memory * 90W TDP. 24 chips per node, 3 nodes per rack. * standard PyTorch stack (Dynamo, Inductor, Triton) for flexibility Fabbed on TSMC's 5nm process, its fully programmable via the…
thanks to @JeffDean and @SingularMattrix for their great leadership today; and @fchollet @dwarak and many others at @GoogleDeepMind for quickly charting a good and aligned path forward together. We can go back focusing on the unlimited amounts of good work ahead of us. (Jeff,…
Schedule-Free Learning github.com/facebookresear… We have now open sourced the algorithm behind my series of mysterious plots. Each plot was either Schedule-free SGD or Adam, no other tricks!
It was hard to find quality OCR data... until today! Super excited to announce the release of the 2 largest public OCR datasets ever 📜 📜 OCR is critical for document AI: here, 26M+ pages, 18b text tokens, 6TB! Thanks to @ucsf_library, @industrydocs and @PDFAssociation 🧶 ↓
you should sign up for @cHHillee's newsletter, it's great. really informative technical articles. it also has one of the best URLs: thonking.ai/about
SWE-agent is our new system for autonomously solving issues in GitHub repos. It gets similar accuracy to Devin on SWE-bench, takes 93 seconds on avg + it's open source! We designed a new agent-computer interface to make it easy for GPT-4 to edit+run code github.com/princeton-nlp/…
move to NYC. build open models. distribute bootleg books of model weights alongside bagels and ice cream trucks. @srush_nlp @kchonyc @jefrankle and I will be around.
move to NYC. build open models. distribute bootleg books of model weights alongside bagels and ice cream trucks. @srush_nlp @kchonyc @jefrankle and I will be around. https://t.co/q18ECKhamK
SOTA AI for games like poker & Hanabi rely on search methods that don’t scale to games w/ large amounts of hidden information. In our ICLR paper, we introduce simple search methods that scale to large games & get SOTA for Hanabi w/ 100x less compute. 1/N arxiv.org/abs/2304.13138
legit new open model just dropped from Mosaic/Databricks. Seems very competitive on offline benchmarks. Check it out 👇
legit new open model just dropped from Mosaic/Databricks. Seems very competitive on offline benchmarks. Check it out 👇
Meet DBRX, a new sota open llm from @databricks. It's a 132B MoE with 36B active params trained from scratch on 12T tokens. It sets a new bar on all the standard benchmarks, and - as an MoE - inference is blazingly fast. Simply put, it's the model your data has been waiting for.
I'm excited to share more of the future we're working on at @Osmo_Labs. We're going to teleport a scent. osmo.ai/blog/teleporti… (1/2)
Yann LeCun @ylecun
708K Followers 716 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Sebastian Raschka @rasbt
265K Followers 901 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.AI at Meta @AIatMeta
527K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.PyTorch @PyTorch
378K Followers 77 Following Tensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundationAlfredo Canziani @alfcnz
86K Followers 269 Following Musician, math lover, cook, dancer, 🏳️🌈, and an ass prof of Computer Science at New York UniversityJeremy Howard @jeremyphoward
221K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordKosta Derpanis @CSProfKGD
48K Followers 198 Following #CS Associate Prof @YorkUniversity, #ComputerVision Scientist Samsung #AI, @VectorInst Faculty Affiliate, TPAMI AE, #CVPR2024/#ECCV2024 Publicity Co-chairabhishek @abhi1thakur
81K Followers 661 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub StarLucas Beyer (bl16) @giffmana
56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected](((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingEric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pRichard Socher @RichardSocher
101K Followers 967 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindHorace He @cHHillee
23K Followers 447 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemaleKevin Patrick Murphy @sirbayes
42K Followers 328 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Tanishq Mathew Abraha.. @iScienceLuvr
54K Followers 1K Following PhD at 19 | Founder and CEO at @MedARC_AI | Research Director at @StabilityAI | @kaggle Notebooks GM | Biomed. engineer @ 14 | TEDx talk➡https://t.co/xPxwKTq6QbAndrew Trask @iamtrask
74K Followers 190 Following @openminedorg, @GoogleDeepMind ethics team, @OxfordUni phd candidate, @UN pet lab, @GovAI_, creator of #GrokkingDeepLearning, NALU, and sense2vecTeeshen @Teeshen158461
11 Followers 103 FollowingC at hyluo @CHyluo89133
0 Followers 16 FollowingOpemHide @OpemHide
4 Followers 46 FollowingBrian Rienecker @GuyLadouche31
38 Followers 130 FollowingAsm.k @Kofasam99
2 Followers 48 Followingcompressionsavant @CompressLuis
0 Followers 15 FollowingStephen Morgenstern @smorgenstern_
100 Followers 2K Following @Wharton '15 | Ex-Scotia Capital | Ex Machina fan | Film finance/producing - into info asymmetry & Getty watermarks | 3x NYT Bestseller PurchaserManuel Strajman @manu_st__
0 Followers 51 Followingsomeone @placeholderer12
0 Followers 9 FollowingBOSS li @BOSSli9527
2 Followers 40 FollowingGabriel Freitas @gabriel_olvr7
12 Followers 133 Following We're random bullets, love Shot by some drunken GodTeJas @tsdesai
12 Followers 233 Followinginkspective @ioanahalunga
21 Followers 103 Following Visual debates, abstract takes on the usual, infiltrated with the unusual.sooraj @s00r_aj
0 Followers 134 FollowingKathrineHo @HoKathrine
0 Followers 13 FollowingAzeez Sodiq Abiola @AdeAbiol
370 Followers 4K Following Shopify Store Developer 👨💻|| Dm to build a successful online business 🚀 🚀Trinity Devault @traveler1556
4 Followers 133 FollowingNyarlathotep @nyar_lanthotep
52 Followers 178 FollowingERAllen @ERAllen9
178 Followers 2K Following Australian pharmacist and research scientist living in America.Chandrasekhar Raman @craman96
137 Followers 1K FollowingCooperwest -e/acc @ChrisFain16
158 Followers 299 Following Currently working on my Bachelor of Science in Engineering in Electrical Engineering at Arizona State University. Enjoys quantum mechanicsfang yun @xiaoz1989
1 Followers 38 FollowingPrerna Sharma @Prertweets
154 Followers 186 Following VC / General Partner / @antler_us / @AntlerGlobal / exUber🐱 @my17thlover
0 Followers 205 FollowingZachary Sisson @ZacharySisson3
915 Followers 2K Following E/ACC, Tesla and MTB content. For Hire: Design, consult, build, maintain trails. Trailblazer Maverick of Austin, TX.hdᅠᅠᅠᅠᅠᅠ�.. @henni443
25 Followers 442 FollowingDavid Cantu @KennyMaert21378
318 Followers 2K FollowingNir Gottlieb @nirg2014
17 Followers 139 FollowingPedro Aldea @paldeamas
0 Followers 74 FollowingElvis Miglans @vianstarlv
81 Followers 501 Following Sceptical about everything, including my own ability for being sceptical Quantitative researcher🇸🇪📊📉HoshAI @hoshaicom
2 Followers 37 Following HoshAI: Your AI-powered companion for generating text, images, audio, and video. Join us at https://t.co/ppEPkf6VlT today!SWAPNIL MISHRA @swapnilm3
67 Followers 703 FollowingHeriz Shrestha @stha_heriz
0 Followers 51 FollowingParth Bhardwaj @_Parth_Bhardwaj
9 Followers 141 FollowingMike Decker @mdexster
406 Followers 2K Following trying to be a better human everyday, not always successful...Intelligent/Unintelligent Document Processing…video games...personal viewsVittorio Neri @VittorioNeri
293 Followers 1K Following Digital Marketing Manager EMEA Roland DG. Love reading, music, digital and life.xike41 @xike41
1 Followers 20 FollowingDataQu @DataQuChile
131 Followers 484 Following We are a Machine Learning, AI, and Software Development company based in Santiago Chile and Toronto Canada. Building the tomorrow solutions today!karthik @skarthik2924
14 Followers 55 FollowingYann LeCun @ylecun
708K Followers 716 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Sebastian Raschka @rasbt
265K Followers 901 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.AI at Meta @AIatMeta
527K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.PyTorch @PyTorch
378K Followers 77 Following Tensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundationAlfredo Canziani @alfcnz
86K Followers 269 Following Musician, math lover, cook, dancer, 🏳️🌈, and an ass prof of Computer Science at New York UniversityJeremy Howard @jeremyphoward
221K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordJürgen Schmidhuber @SchmidhuberAI
106K Followers 0 Following Invented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.Lucas Beyer (bl16) @giffmana
56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pMichael Black @Michael_J_Black
58K Followers 638 Following Director, Max Planck Institute for Intelligent Systems (@MPI_IS). Chief Scientist @meshcapade. Building 3D digital humans using vision, graphics, and learning.Horace He @cHHillee
23K Followers 447 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemaleKevin Patrick Murphy @sirbayes
42K Followers 328 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Tanishq Mathew Abraha.. @iScienceLuvr
54K Followers 1K Following PhD at 19 | Founder and CEO at @MedARC_AI | Research Director at @StabilityAI | @kaggle Notebooks GM | Biomed. engineer @ 14 | TEDx talk➡https://t.co/xPxwKTq6Qbclem 🤗 @ClementDelangue
89K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform to build machine learningOriol Vinyals @OriolVinyalsML
166K Followers 82 Following VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead. Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.Jeff Dean (@🏡) @JeffDean
296K Followers 6K Following Chief Scientist, Google DeepMind and Google Research. Co-designer/implementor of things like @TensorFlow, MapReduce, Bigtable, Spanner, Gemini .. (he/him)Sasha Rush @srush_nlp
51K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzJulien Chaumond @julien_c
46K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueBrenden Lake @LakeBrenden
7K Followers 195 Following Assistant Professor of Psychology and Data Science @ NYU. Co-Director of the NYU Minds, Brains, and Machines Initiative.Jonathan Jarvis @JonathanJarvis
2K Followers 625 Following Building @getcartwheel. Previously, Google. Also Universal Patterns. Ran a couple cool companies. Love animation, diagrams & explainers.Amnon Shashua @AmnonShashua
7K Followers 209 Following CEO @Mobileye. @OrCam, @AI21Labs, @ONEZEROBANK. Sachs Prof. Computer Science at Hebrew U. Passion for #AI, cycling and #SelfDrivingCars. Opinions are my ownNormal Computing 🧠.. @NormalComputing
2K Followers 77 Following We build AI systems that natively reason, so they can partner with us on our most important problems. Join us https://t.co/BcjWCoI5b8.Ahmad Al-Dahle @Ahmad_Al_Dahle
3K Followers 47 Following #Girldad of twins. Leading GenAI @ Meta (llama, imagine, meta ai and more)Daniel Han @danielhanchen
7K Followers 923 Following Building @UnslothAI. Finetune LLMs 30x faster https://t.co/aRyAAgKOR7. Prev ML at NVIDIA. Hyperlearn used by NASA. I like maths, making code go fastUmer Adil @UmerHAdil
438 Followers 299 Following Learning & providing value to OSS AI | Contributor @huggingface @diffuserslib, @LangChainAI, gpt engineer | https://t.co/BOR9cWbN8oadammaj @MajmudarAdam
6K Followers 200 Following founding engineer @thirdweb // cs + neuro (on gap) @PennTaelin @VictorTaelin
16K Followers 893 Following Founder of @HigherOrderComp Building the massively parallel future of computing Reaching AGI to cure all diseases and suffering is all that mattersSwabha Swayamdipta @swabhz
6K Followers 460 Following Assistant Prof. @CSatUSC | Researcher in #NLProc | Previously with @uwnlp @allenai | she/herDwarkesh Patel @dwarkesh_sp
52K Followers 696 Following Being pretrained Host of Dwarkesh Podcast https://t.co/3SXlu7fy6N https://t.co/rEhnfYywXY https://t.co/hQfIWdM1UnLogan Kilpatrick @OfficialLoganK
92K Followers 2K Following Lead product for @Google AI Studio and working on the Gemini API, helping developers build with AI, my views!Shunyu Yao @ShunyuYao12
7K Followers 839 Following Language agents (ReAct, Reflexion, Tree of Thoughts) for digital automation (WebShop, SWE-bench, SWE-agent)John Yang @jyangballin
2K Followers 441 Following CS/NLP MS student @princeton_nlp Previously @Berkeley_EECSLouis Castricato @lcastricato
3K Followers 476 Following Math @uwaterloo, RLHF @BrownCSDept, Goosefluencer. x-RS @aieleuther, x-Head of LLMs @stabilityai, x-lead @CarperAI. co-founder @synth_labs. We're hiring.Aaron Defazio @aaron_defazio
6K Followers 356 Following Research Scientist at Meta working on optimization. Fundamental AI Research (FAIR) teamKarina Nguyen @karinanguyen_
12K Followers 647 Following AI research & eng @AnthropicAI, prev. intern @nytimes, @square, @dropboxAlejandro Matamala Or.. @matamalaortiz
3K Followers 527 Following Co-founder / Design @RunwayML https://t.co/K9wYeqbCvpMatei Zaharia @matei_zaharia
39K Followers 1K Following CTO at @Databricks and CS prof at @UCBerkeley. Working on data+AI, including @ApacheSpark, @DeltaLakeOSS, @MLflow, https://t.co/94gROE5Xa0. https://t.co/nmRYAKG0LZAlex Wiltschko @awiltschko
665 Followers 51 Following CEO @Osmo_Labs. Giving computers a sense of smell to improve the health & wellbeing of human life.the tiny corp @__tinygrad__
33K Followers 63 Following We make tinygrad. Our mission is to commoditize the petaflop.Kartikay Khandelwal @kakemeister
578 Followers 586 Following AI @MetaAI and @PyTorch. Previously @Stanford and @Microsoft.Mihir Patel @mvpatel2000
3K Followers 384 Following Research Engineer @MosaicML | cs, math bs/ms @StanfordWing Lian (caseus) @winglian
8K Followers 2K Following @axolotl_ai dev. OpenAccess AI Collective founder. Alignment Labs. AI/ML tinkerer. Building tools for everyone.Theofanis Karaletsos @Tkaraletsos
4K Followers 2K Following Head of AI @cziscience | probabilistic and deep MLnev @apeoffire
582 Followers 49 FollowingIlya Kostrikov @ikostrikov
8K Followers 615 Following Researcher @OpenAI, previously @Postdoc at UC Berkeley @berkeley_ai, PhD in CS @CILVRatNYUPhysical Intelligence @physical_int
4K Followers 8 Following Physical Intelligence (Pi), bringing AI into the physical world.The Sagacious Society.. @SmolModels
2K Followers 3 Following smol is simple, speedy, safe, and scheap.DJ Strouse @djstrouse
1K Followers 620 Following Reasoning about reasoning. Technically a member of staff @GoogleDeepMind. Previously, PhD @Princeton.Shayne Longpre @ShayneRedford
4K Followers 996 Following PhD @MIT. Prev: @Google Brain, @apple ML, @stanfordnlp. 🇨🇦 Interests: AI/ML/NLP, Data-centric AI, transparency & societal impactFauna Robotics @faunarobotics
722 Followers 18 FollowingYang Song @DrYangSong
10K Followers 885 Following Leading the Strategic Explorations team @OpenAI. Score-Based Models. Diffusion Models. Consistency Models.Brett Adcock @adcock_brett
171K Followers 14 Following Founder @Figure_robot (AI Robotics) & Archer Aviation (NYSE: ACHR)Nag Murty @MurtyNag
724 Followers 708 Following Founder and CEO at @sheeprobotics We are building a foundation model to drive all outdoor robots. tech-optimist | Stanford, IIT | 2x deeptech founderMark Chen @markchen90
10K Followers 245 Following Head of Frontiers Research at OpenAI. Coach for the USA IOI Team.Gideon Mann 🇮🇱 @gideonmann
3K Followers 2K Following Global Head of AI, Technology at Millennium. All opinions my own.Jonathan Ho @hojonathanho
4K Followers 151 FollowingOpenAI Developers @OpenAIDevs
70K Followers 0 Following Official @OpenAI account for anyone building on our APIs. Join us in building the future of AI. We ❤️ developers!Connor Holmes @cmikeh2
6K Followers 383 Following Systems Lead for Sora at @openai he/him Cover photo: https://t.co/xqTY8VV56gEric Steinberger @EricSteinb
7K Followers 478 Following Writing code that writes code on a mission to build safe superintelligence | CEO/cofounder @magicailabs🔥llm.c update: Our single file of 2,000 ~clean lines of C/CUDA code now trains GPT-2 (124M) on GPU at speeds ~matching PyTorch (fp32, no flash attention) github.com/karpathy/llm.c… On my A100 I'm seeing 78ms/iter for llm.c and 80ms/iter for PyTorch. Keeping in mind this is fp32,…
I'm seeing a lot of questions about the limit of how good you can make a small LLM. tldr; benchmarks saturate, models don't. LLMs will improve logarithmically forever with enough good data.
@soumithchintala @dwarkesh_sp new york city is the only city
The live updating image generator on meta.ai/?icebreaker=im… is a pretty sick UX.
I got tenure! It was fitting that I got to celebrate with the lab right after the news. Working together for the last 6.5 years has been a blast.
@soumithchintala @dwarkesh_sp everyday i resist the siren song of new york 😭
Congrats!
Llama3 8B and 70B are out, with pretty exciting results! * The ~400B is still training but results already look promising. * Meta's own Chat interface is also live at meta.ai * TorchTune integration is shortly going live: github.com/pytorch/torcht…
🦙 🦙 🦙labs.perplexity.ai and brought up llama-3 - 8b and 70b instruct models. Have fun chatting! we will soon be bringing up search-grounded online versions of them after some post-training. also available on pplx-api, and you get 5$ monthly API credits if you're already…
@soumithchintala Congrats Soumith & the rest of the team, this is amazing stuff!
I'm personally super excited to share the progress on the 400B+ training. So proud of the entire team that worked tirelessly to make this model a reality. There's lots more to come including a full research paper soon!🚀🦙🦙
@soumithchintala @lvdmaaten 🤣🤣🤣 Literally just ran the 8B that @Prince_Canuma quantized. Very nice (and fast 😉) on an M2 Ultra:
Congrats to @AIatMeta on Llama 3 release!! 🎉 ai.meta.com/blog/meta-llam… Notes: Releasing 8B and 70B (both base and finetuned) models, strong-performing in their model class (but we'll see when the rankings come in @ @lmsysorg :)) 400B is still training, but already encroaching…
These numbers are insane. I can't even imagine what the larger one(s) will be. Looks like Mistral 7B might be dead as of today though, and maybe even sonnet lol My favorite is the huge gains in coding capabilities
Excited to share what I’ve been working on for the past 9 months. So incredibly proud of the entire team that worked tirelessly to make Llama 3 happen! And this is only the beginning… ai.meta.com/blog/meta-llam…
Really proud of the work that went into making this possible, hope this helps the community push the field forward. Also in case anyone missed it, there's a sneak peak of what to come next at the end of blog post ai.meta.com/blog/meta-llam…
It’s here! Meet Llama 3, our latest generation of models that is setting a new standard for state-of-the art performance and efficiency for openly available LLMs. Key highlights • 8B and 70B parameter openly available pre-trained and fine-tuned models. • Trained on more…
It’s here! Meet Llama 3, our latest generation of models that is setting a new standard for state-of-the art performance and efficiency for openly available LLMs. Key highlights • 8B and 70B parameter openly available pre-trained and fine-tuned models. • Trained on more…