Sebastian Ruder @seb_ruder
Multilingual LLMs @cohere • Prev: @GoogleDeepMind • Newsletter: https://t.co/7JGh2qpG98 ruder.io Berlin, Deutschland Joined September 2014-
Tweets4K
-
Followers80K
-
Following1K
-
Likes10K
Command R+ The Top Open-Weights LLM + RAG and Multilingual Support My highlights on the release of the latest Cohere models and why I'm excited for LLM research. newsletter.ruder.io/p/command-r
Excited to announce the Compass Beta, a very powerful multi-aspect data search system powered by a new embedding model, Compass. We're looking for help stress-testing the model's capabilities and finding where it breaks. Sign up here: txt.cohere.com/compass-beta/
Introducing Rerank 3! Our latest model focused on powering much more complex and accurate search. It's the fastest, cheapest, and highest performance reranker that exists. We're really excited to see how this model influences RAG applications and search stacks.
我们在增强 Command 处理中文的能力方面进行了大量投资。我很期待看到华语社群将利用它创造什么。请多多提供反馈,以便我们能够改进。
한국 커뮤니티가 Command R과 R+를 사용하여 어떤 것을 만들어 낼지 매우 기대됩니다! 여러분의 소중한 피드백을 부탁드리며, 개선할 점이 있다면 언제든지 알려주시기 바랍니다.
한국 커뮤니티가 Command R과 R+를 사용하여 어떤 것을 만들어 낼지 매우 기대됩니다! 여러분의 소중한 피드백을 부탁드리며, 개선할 점이 있다면 언제든지 알려주시기 바랍니다.
Command RおよびR+の主要言語として日本語をサポートできることを嬉しく思います。日本語の機能を今後も向上させていくことを最優先事項とします。
Get ready for Cohere Build Day 🌐 Join our exclusive developer workshop in 4 cities! Dive into our new Command R & R+ models and learn from Cohere experts to develop enterprise-grade AI solutions. Submit your interest to attend now: info.cohere.ai/cohere-build-d…
(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingJeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈abhishek @abhi1thakur
81K Followers 662 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub StarRichard Socher @RichardSocher
101K Followers 970 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistSasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzEric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Julien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechniqueclem 🤗 @ClementDelangue
91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersRosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRRadek Osmulski 🇺�.. @radekosmulski
25K Followers 554 Following Resources to take your Machine Learning skills to the next level 🧪 Senior Data Scientist, RecSys @NVIDIAAI 🏫 @fastdotai trained DL Eng 📝 https://t.co/By87iXx5PuSanyam Bhutani @bhutanisanyam1
35K Followers 994 Following 👨💻 Sr Data Scientist @h2oai | Previously: @weights_biases 🎙 Podcast Host @ctdsshow 👨🎓 International Fellow @fastdotai 🎲 Grandmaster @KaggleJay Alammar @JayAlammar
35K Followers 1K Following Machine learning and language models R&D. Builder. Writer. Visualizing AI, ML, and LLMs one concept at a time. @Cohere. https://t.co/TquuQXlLOJThomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceGraham Neubig @gneubig
31K Followers 586 Following Associate professor at CMU, studying natural language processing and machine learning.jadifimo1971 @jadifimo1957738
2 Followers 24 Followingquattdockmobi1983 @quattdockm20338
11 Followers 30 FollowingOLUBUSOLA IYIOLA @OLUBUSOLAIYIOL1
46 Followers 325 Following I am a mathematician, a computer programmer, I play musical instruments like the drumset and guitar.I am graphics designer. I am more of a problem solverminoreleventh @minoreleventh
14 Followers 456 Following Less outrage, more communication. Less following, more independent thought.Smruti (Smoothie) Deo.. @DeoghareSmruti
196 Followers 626 Following Biomedical Informatics | Data Scientist | Medical Artificial Intelligence | PhD student @PrasathLab @UofCincy @CincyChildrens | Dance & Everything ArtsySamith M S @samith5691
0 Followers 17 FollowingVivi @first_time28
3 Followers 40 FollowingSher_857 @codewithsherr
0 Followers 54 FollowingArinjay Kumar @CodClever7k
103 Followers 2K Following I am a passionate software engineer/developer, coder, programmer, video editor, web designer, and modeler.asfaan murthyulas @AMurthyulas
4 Followers 138 FollowingMuntaz Kaleem @mkhb654
378 Followers 2K Following Entrepreneur & Life Learner Founder https://t.co/xeYUbtAhOmLollipop @lollipopseeker
35 Followers 139 Followingpiztachios @saladpalad
28 Followers 90 FollowingPrakash Singh @prax___5
9 Followers 145 FollowingDavid Uche @DavidUche125795
8 Followers 43 Followingravikd @ravi_ravikd
27 Followers 84 FollowingKowndinya Renduchinta.. @KowndinyaR
2 Followers 37 FollowingBayu Wibisana @Bayu08_
125 Followers 293 Following 🏡: Lombok Island. 👨🏽🎓: Continuous and Lifelong Learner. 📚: Machine Learning, Artificial Intelligence and Data Science.Suraj Naithani @SurajNathani318
2 Followers 39 FollowingSrikanth Rao M @melagiris
403 Followers 772 Following Netizen, here to click the retweet button, mostly!银河 @ynh1661160
2K Followers 5K FollowingDana Mahmood @deordered
9 Followers 650 Following Fine-tuning AI models oftentimes & practicing philosopher at other times.Asvin G @asving94
3 Followers 57 Followingjaiswati @jaiswati
29 Followers 250 FollowingIsmael Hossen @IsmaelHossen8
251 Followers 3K FollowingViviana @Viviana75842443
2 Followers 161 FollowingHalil Zabun @halilzabun_
64 Followers 52 Following ML engineer 👨💻| YouTuber 🎥| Interested in machine learning, neuroscience, linguistics, psychology, video games and their intersections.P @IiHq7kK57ZxosbT
77 Followers 1K FollowingYingjian Fu @yingjianfu
17 Followers 482 FollowingElectronicsseeker @libertarian108
7 Followers 913 FollowingLiqian @liqianmailbox
28 Followers 256 FollowingCrazy Universe @Crazy_Universe0
96 Followers 1K Followinggphya @gphya
37 Followers 5K Followinggabriel @streamlinegabo
6 Followers 195 FollowingGabriel @Gabriel32047236
2 Followers 42 Followingcinedu @Educine188720
131 Followers 426 Following Let’s talk about #Cinema #Artificialintelligence #business #politicsPrateek Tripathi @ai_prateek
0 Followers 23 FollowingZachary Cross @med_zachary
864 Followers 834 Following 🤖 @GlassHealthHQ //👨🏻⚕ @NUFeinbergMed // @Penn 🎓 // former researcher @ChildrensPhila 🔬 // he/him 🏳🌈Aurosish Rout @SCIENTIFIC1492
28 Followers 88 Following Hi i am Aurosish Rout a Full stack web developer and computer science engineerSwarup Dwivedy @swarup5662
9 Followers 43 Followingzaixianliuyun @hiboys789
0 Followers 34 FollowingPavan Reddy @pavanrdy100
3 Followers 16 Following(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingSoumith Chintala @soumithchintala
186K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Jeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Richard Socher @RichardSocher
101K Followers 970 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindHugging Face @huggingface
343K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhatePercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistAndrew Trask @iamtrask
74K Followers 190 Following @openminedorg, @GoogleDeepMind ethics team, @OxfordUni phd candidate, @UN pet lab, @GovAI_, creator of #GrokkingDeepLearning, NALU, and sense2vecSasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzEric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Julien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @Polytechniqueclem 🤗 @ClementDelangue
91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersChristopher Manning @chrmanning
126K Followers 115 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Rosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRRadek Osmulski 🇺�.. @radekosmulski
25K Followers 554 Following Resources to take your Machine Learning skills to the next level 🧪 Senior Data Scientist, RecSys @NVIDIAAI 🏫 @fastdotai trained DL Eng 📝 https://t.co/By87iXx5PuSanyam Bhutani @bhutanisanyam1
35K Followers 994 Following 👨💻 Sr Data Scientist @h2oai | Previously: @weights_biases 🎙 Podcast Host @ctdsshow 👨🎓 International Fellow @fastdotai 🎲 Grandmaster @KaggleJay Alammar @JayAlammar
35K Followers 1K Following Machine learning and language models R&D. Builder. Writer. Visualizing AI, ML, and LLMs one concept at a time. @Cohere. https://t.co/TquuQXlLOJStuart Logan @stuartlogan
2K Followers 592 Following CEO of @JoinTwine, connecting companies to experts creative/marketing/engineering freelancers. We also remove bias in AI/ML datasets. 2x founder. @9others host.Answer.AI @answerdotai
1K Followers 81 Following A new kind of AI R&D lab which creates practical end-user products based on foundational research breakthroughsWei-Lin Chiang @infwinston
3K Followers 852 Following CS PhD student at UC Berkeley. co-lead of Chatbot Arena @lmsysorgシェイン・グウ @shanegJP
53K Followers 335 Following https://t.co/yYd252xC4w Gemini 1.5 Pro @GoogleDeepMind 東京・SF。 元@GoogleAI Brain、元@OpenAI。 英語: @shaneguML。全て個人意見です。Piotr Nawrot @p_nawrot
3K Followers 224 Following PhD student in #NLProc @Edin_CDT_NLP | Previously intern @Nvidia & @MetaAIJerry Liu @jerryjliu0
44K Followers 1K Following co-founder/CEO @llama_index Careers: https://t.co/EUnMNmbCtx Enterprise: https://t.co/Ht5jwxSrQBlmsys.org @lmsysorg
37K Followers 171 Following Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtmSam Charrington @samcharrington
19K Followers 3K Following Machine learning & AI podcaster, community builder and all around enthusiast. Creator of the @TWIMLAI Podcast, TWIMLcon, TWIMLfest & the TWIML Solutions Guide.Wojciech Galuba @wgaluba
490 Followers 1K Following Head of Data & Evals @Cohere | prev: Research Eng Lead @MetaAI | founded @Meta’s A/B testing platform and the AI annotation platform | @ICepfl alumnusFaraz Khoubsirat @FarazDoTAI
590 Followers 797 Following ml @cohereai, software eng @uwaterloo will mostly talk about cars, books, podcasts and all things MLMaxime Voisin @maximevoisin_ai
738 Followers 668 Following Product manager RAG/Tools/Code @cohere. Previously @labelbox, @stanford computer vision labsSebastian Hofstätter @s_hofstaetter
1K Followers 254 Following RAG & tool use modelling co-lead @Cohere; PhD in efficient neural information retrieval from @tu_wienAbhishek Sinha @abysinha
980 Followers 1K Following VP Product, https://t.co/M4BYZuOCI7, ex-AWS, Can watch any sport especially Test Cricket. Can play none well. Avid West Wing watcher. Maker of bad cocktails.Uri Eliabayev @urieli17
13K Followers 1K Following AI Consultant and lecturer, Founder of the "Machine & Deep learning Israel" community (37K members), Contributor at @haaretzLuca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (makin Dolma 🍇), open source science fan, @QueerInAI organizer 🤖☕️🍕they/themDavid Kanter @TheKanter
4K Followers 199 Following Executive Director @MLCommons making machine learning better for everyone. @MLPerf CPUs, computer architecture, semiconductors, graphics, economics, writes @RWTMinjie Xu @chokky_vista
222 Followers 273 Following ML/NLP researcher & practitioner. RAG & tool-use @cohere 🧠 x @tractable_ai @TechAtBloomberg 👨🏻💻 PhD from Tsinghua CS 🎓 In meinem Lieben, in meinem Lied 🎵Irem Ergün @irombie
1K Followers 455 Following •ML Engineer @cohere developing LLMs🫡 • Previously: @UCR_CSE & @BilkentUniv • yoga & writing & music 🌈🦄 •Tweets in 🇹🇷🇬🇧🎸 •Blogs @ 2cute2tech 👩💻👇🏻Felipe Cruz-Salinas @fffffelipec
132 Followers 386 Following Large models @cohere. Prev: @Aleph__Alpha, @microsoftGiannis @giannis2two
189 Followers 113 Following Member of Technical Staff at @cohere, CS + Math @MIT, primarily european but surprisingly americanPhilipp Schmid @_philschmid
16K Followers 651 Following Tech Lead and LLMs at @huggingface 👨🏻💻 🤗 AWS ML Hero 🦸🏻 | Cloud & ML enthusiast | 📍Nuremberg | 🇩🇪 https://t.co/l1ppq3q3hkSandra Kublik @itsSandraKublik
3K Followers 1K Following LLM = Live + Love + Make 🌈 DevRel @Cohere I write and make videos about things I'm trying to grasp. The latest baby is my podcast. She/her. Views are mine!Saurabh Dash @TheyCallMeMr_
245 Followers 457 Following ML @CohereAI , PhD Student @GeorgiaTech. Previously, ML Research @Apple, @IITkgp. https://t.co/yZLkUsiZ7P. Learning why my machines don’t learn.Justin S. Lee @justinsylee
140 Followers 250 Following Member of Technical Staff @Cohere. Alum of Computational Science and Engineering @Harvard_IACS and Columbia Applied Math @APAMMSECUYi Chern Tan @yichern_tan
85 Followers 94 Following Command modeling @cohere. Previously @Waymo @Facebook @Yale. 🇸🇬Aidan Peppin @AidanPeppin
2K Followers 2K Following Tech & society researcher. Policy & Responsible AI @CohereForAI. Formerly @adalovelaceinst, Milltown Partners, @WellcomeTrustSwaroop Mishra @Swarooprm7
5K Followers 893 Following Research Scientist @GoogleDeepMind (Gemini). Pioneering LLM Research 🔥. Instruction tuning, Factuality, Reasoning and next gen Product. Opinions my own.Jacob Austin @jacobaustin132
3K Followers 797 Following @Google @DeepMind researcher. AI for math and science. Coding. Gemini. I also play piano. NYC. Opinions my ownNick Frosst @nickfrosst
13K Followers 846 Following cofounder @cohere - singer @goodkidband pfp: @polarfishh_26Jon Ander Campos @jaa_campos
237 Followers 313 Following Member of Technical Staff @cohere. PhD in Natural Language Processing. Previously @IxaGroup, @Apple, @AIatMeta, @CNRS and @nyuniversity.Jesujoba Alabi @alabi_jesujoba
258 Followers 733 Following PhD Student @LstSaar & @SIC_Saar, doing natural language processing #NLProc | prev @InriaParisNLP | @UniIbadan @bowenuniversity alumnus | Ọmọ Jesu |Ọmọ OgbomọṣọArash Ahmadian @aahmadian_
924 Followers 540 Following Preference Training & RL @Cohere @CohereForAI, researcher @VectorInst ece @uoftMarzieh Fadaee @mziizm
402 Followers 332 Following seeks to understand language. Senior Research Scientist @CohereForAI @Cohere. PhD from @UvA_Amsterdam. [email protected]. Contemplates in private @mzi.Armand Joulin @armandjoulin
4K Followers 344 Following principal researcher, @googledeepmind. ex director of emea at fair @metaai. mostly work on open projects: fasttext, dino, llama, gemma.Maximilian Mozes @maximilianmozes
203 Followers 482 Following Member of Technical Staff @cohere. PhD @UCL/@ucl_nlp. Previously: @GoogleAI/@SpotifyResearch. He/Him.Maha Elbayad @melbayad
620 Followers 629 Following Research Scientist @AIatMeta. @centralesupelec, @ENS_ParisSaclay and @UGrenobleAlpes alum. 💬 My opinions are my own | she/herSeraphina Goldfarb-Ta.. @seraphinagt
828 Followers 349 Following Head of AI Safety @cohere. Phd @EdinburghNLP @InfAtED. If you don't recognise me, that's because I am invisible https://t.co/oRZvFdIDcRJoelle Pineau @jpineau1
10K Followers 352 Following AI researcher. VP AI Research (FAIR), @AIatMeta. Professor of Computer Science, @mcgillu. Core academic member, @Mila_QuebecArthur Mensch @arthurmensch
40K Followers 872 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcxelliott c @echoi26
169 Followers 202 Following Embeddings @CohereAi previously @scale reformed bottom bucket tech investment banker @ EvercoreStefania Druga @Stefania_druga
10K Followers 7K Following Research Scientist @Google working on Gemini AI & multimodal LLM applications/ PhD @UW / former resident in AI / ML @msft @google @Theteamatx, alumn @mitVeronica Qing Lyu @veronica3207
725 Followers 341 Following PhD student @upennnlp | NLP, Linguistics, Explainable AI | Intern @tencent, @allenaiTolga Bolukbasi @tolgab0
277 Followers 213 Following AI/ML research @GoogleDeepmind, PhD, opinions my own.Terra Blevins @TerraBlvns
499 Followers 421 Following Grad student researching NLP at the University of Washington. she/her.🚀 Our latest Adapters library release integrates quantized model training, enabling efficient fine-tuning of LLMs with Q-LoRA, Q-Bottleneck Adapters, or Q-PrefixTuning. 🎉 Check out this notebook to learn how to fine-tune Llama 3 with Q-LoRA 🦙✨: github.com/Adapter-Hub/ad…
Exclusive for @FortuneMagazine: A year ago, I reported on @OpenAI rival @cohere as flying under the radar. Today, it is under pressure to prove its business model-I spoke to CEO @aidangomez about Cohere's latest models, new revenue channels & much more. fortune.com/2024/04/25/coh…
Command R+ is underrated for everyday use 👀. The speed & the balance of good reasoning, concise answers, and nice writing style makes it perhaps the best daily assistant 🔥
@nickfrosst From a French perspective, you have the best LLM on the market. CR+ is the only one that truly understands and speaks French. Bravo and thank you!
we open sourced our chat interface. github.com/cohere-ai/cohe…
CMDR+ is easily becoming my favorite model now that I've had more time to play with it. GG @aidangomez @cohere It's like having GPT4 on my lap, offline. I love the system/style preamble prompt setup.
We've written up our methods, results, and provided background on the critical underlying research ideas such as DoRA, in a blog post, along with open source code to allow you to replicate and build on our results: answer.ai/posts/2024-04-…
If the rise of LLMs caught you by surprise, here's your chance to get a preview of what's likely to be the next monumental jump in AI capabilities: LLM-backed agents that use software tools In this video, I'll walk you through the concepts and code of building an LLM-backed…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
@WolframRvnwlf @ylecun @cohere ‘s secret sauce is that @seb_ruder breathes multilingual
Just updated our benchmark to include the new Cohere Command models and DBRX! See results at vals.ai
Few days ago a plot comparing MMLU scores vs active parameters became quite popular. Active parameters is only relevant if you run the LLM with batch size 1. In production deployment you batch requests, completely changing your conclusions. Below a better graph comparing user…
Training a bilingual model on two identical (cloned) languages with a 90/10 split yields better results *on both languages* than with a 50/50! It also leads to gradients and hidden states which are more aligned across languages. Interested? Check out @antonschafer's new paper!
LLMs can do amazing things these days—not only in their main language (English?), but also in other ones! Our paper identifies a surprising *potential* reason why: language imbalance! (see caveats in 🧵!) arxiv.org/pdf/2404.07982… + @ravfogel T. Hofmann @tpimentelms @ImanolSchlag
LLMs can do amazing things these days—not only in their main language (English?), but also in other ones! Our paper identifies a surprising *potential* reason why: language imbalance! (see caveats in 🧵!) arxiv.org/pdf/2404.07982… + @ravfogel T. Hofmann @tpimentelms @ImanolSchlag
Zuck on: - Llama 3 - open sourcing towards AGI - custom silicon, synthetic data, & energy constraints on scaling - Caeser Augustus, intelligence explosion, bioweapons, $10b models, & much more Enjoy! Links below
It's going to be hard to adapt Llama3 for Indic languages, in my opinion. Here are a few reasons why: 👉🏼 The tokenizer used is TikToken-based, which is not really efficient in tokenizing Indic text despite having a vocabular size of 121k. 👉🏼 unlike sentence-piece based models,…