sharathts @sharath_ts
Deep Learning Research @Nvidia United States Joined December 2009-
Tweets11
-
Followers23
-
Following224
-
Likes76
NVIDIA has released Nemotron Nano 9B V2, a small 9B reasoning model that scores 43 on the Artificial Analysis Intelligence Index, the highest yet for <10B models Nemotron 9B V2 is the first Nemotron model pre-trained by @nvidia. Previous Nemotron models have been developed by…
📢New efficient Hybrid-SLM from NVIDIA-Nemotron-Nano-v2-9B: ❗️6x faster than Qwen3-8B because of Hybrid (Mamba2+Attention) design. We tried something new: pretrain & align a 12B reasoning model → compress to 9B. First real stab at reasoning-model compression. Key takeaways…
Sharing our team’s latest work on Hymba - an efficient small language model with hybrid architecture. Tech report: arxiv.org/abs/2411.13676 Discover the tradeoff between Mamba and Attention, how they can be combined, how attention sink and forced-to-attend phenomena can be…
📣 Announcing Mistral-NeMo-Minitron 8B Instruct, one of the most advanced models in its size, delivering higher accuracy and lower computational cost, with leading performance on six important benchmarks. Technical deep dive ➡️ developer.nvidia.com/blog/mistral-n… This powerful model was…
Excited to introduce MN-Minitron-8B-Instruct 📗! We've developed an even more powerful instruct model than its parent, Mistral-NeMo-12B, with significant improvements over LLaMa3.1-8B-Instruct as well! Weights on HF: huggingface.co/nvidia/Mistral… Demo: build.nvidia.com/nvidia/mistral… Our…
LLaMa-3.2 models have been released, featuring smaller sizes of 1B and 3B parameters. These models are derived from an 8B parent model through pruning and distillation, using the same pipeline as Minitron, which we proposed this summer and it was accepted for NeurIPS! 😃 Pruning…
Amazing effort by the MagpieLM team! 👏 They showed that our pruned LLaMa-3.1-4B can even outperform larger 8B models! 🚀 Nice to see the community adapting our models—this encourages us to release more! 🎉💪
Amazing effort by the MagpieLM team! 👏 They showed that our pruned LLaMa-3.1-4B can even outperform larger 8B models! 🚀 Nice to see the community adapting our models—this encourages us to release more! 🎉💪
🌟 The best 8B Base model via pruning and distillation! 🚀 Introducing Mistral-NeMo-Minitron-8B-Base model we derived from the recent Mistral-NeMo-12B. Our recipe: finetune teacher on 100B tokens, prune to 8B params, run teacher-student distillation on <400B tokens. Result: the…
🚀 We've pruned LLaMa3.1 down to 4B parameters, delivering a smaller and more efficient model! Based on our recent paper: arxiv.org/abs/2407.14679 📖 Learn all about it in our blog: developer.nvidia.com/blog/how-to-pr… 🔗 META's announcement: ai.meta.com/blog/nvidia-ll… 👐 Checkpoints at HF this…
🚀 40x Faster Model Training via Pruning and Distillation! Permissive Minitron-4B and Minitron-8B models! 🔗 Paper: arxiv.org/abs/2407.14679 🔗 GitHub: github.com/NVlabs/Minitron 🔗 Models on HF: bit.ly/4ffjnQj Key highlights of 4B/8B models: 📊 2.6B/6.2B active…

Markus Kliegl @MarkusKliegl
92 Followers 123 Following Research scientist at NVIDIA | Previously deep learning at Apple, DeepMind, Baidu Research
Kezhi Kong @KezhiKong
349 Followers 377 Following Research Scientist @NVIDIA working on Nemotron-*; PhD @UMDCS; BS @ZJU_China; opinions are my own
Ameya Mahabaleshwarka... @ameyasm1154
68 Followers 420 Following Applied Scientist | Building Nemotron LLMs/SLMs @NVIDIA | Prev: @LTIatCMU @SCSatCMU
Jerel Nienow @JerelNieno66245
12 Followers 944 Following
Jessica @SharstarEqE
42 Followers 357 Following "The essence of being human is to be able to create your own meaning." – Viktor Frankl
Dosoyn @Dosoynzbr1B4J
35 Followers 4K Following
Brenda @ogiuchinan41430
74 Followers 7K Following
Thousharn @Thousharnnn2
27 Followers 4K Following Don't remember other people's advice until you fail, and don't remember to cherish something until you lose it.
Wenjie Zheng @wjzheng_nlp
33 Followers 100 Following PhD student | Interested in Multimodal Learning | Feel free to connect me. 🧸
Saurav Muralidharan @srv_m
184 Followers 247 Following Research Scientist @NVIDIA | Making LLMs More Efficient
Shizhe Diao @shizhediao
4K Followers 2K Following Research Scientist @NVIDIA focusing on efficient post-training of LLMs. Finetuning your own LLMs with LMFlow: https://t.co/UTykmQBwFr Views are my own.
Yvonne @yvonne_hicks89
265 Followers 3K Following
Una @una_booker_
318 Followers 3K Following
James Stephenson @ICannot_Enougho
98 Followers 778 Following @ElonMusk and I own Tesla (along with several other people). https://t.co/RRas2pWwcx
Varsha M Athreya @varshamathreya
90 Followers 130 Following Auditory Neuroscientist in the making | PhD Student @PurdueSLHS | Audiologist @aiishmysuru
Swetha Mandava @swethmandava
533 Followers 483 Following Founder @ https://t.co/Off85tH1HE | Previously @YouSearchEngine @Nvidia @CarnegieMellon #Manipal
Rabeeh Karimi @KarimiRabeeh
1K Followers 769 Following past: @meta. PhD in NLP at @EPFL. Intern @allen_ai, Intern 2×@Google, @Meta, @Deepmind.
Artificial Analysis @ArtificialAnlys
60K Followers 568 Following Independent analysis of AI models and hosting providers - choose the best model and API provider for your use-case
Kezhi Kong @KezhiKong
349 Followers 377 Following Research Scientist @NVIDIA working on Nemotron-*; PhD @UMDCS; BS @ZJU_China; opinions are my own
Arash Vahdat @ArashVahdat
10K Followers 877 Following Research Director, leading fundamental generative AI research (GenAIR) @nvidia research, views are my own.
Markus Kliegl @MarkusKliegl
92 Followers 123 Following Research scientist at NVIDIA | Previously deep learning at Apple, DeepMind, Baidu Research
Sanjeev Satheesh @issanjeev
536 Followers 401 Following
Ameya Mahabaleshwarka... @ameyasm1154
68 Followers 420 Following Applied Scientist | Building Nemotron LLMs/SLMs @NVIDIA | Prev: @LTIatCMU @SCSatCMU
Demis Hassabis @demishassabis
495K Followers 152 Following Nobel Laureate. Co-Founder & CEO @GoogleDeepMind - working on AGI. Solving disease @IsomorphicLabs. Trying to understand the fundamental nature of reality.
Mostofa Patwary @mapatwary
34 Followers 16 Following
Paul Rosolie @PaulRosolie
24K Followers 613 Following Founder of https://t.co/rzTAXeQbZy ✶ Protecting 66,000 Acres of Amazonia ✶ Expedition Naturalist @tamanduaexpeditions @ageofunion
Shizhe Diao @shizhediao
4K Followers 2K Following Research Scientist @NVIDIA focusing on efficient post-training of LLMs. Finetuning your own LLMs with LMFlow: https://t.co/UTykmQBwFr Views are my own.
Saurav Muralidharan @srv_m
184 Followers 247 Following Research Scientist @NVIDIA | Making LLMs More Efficient
Ahmad Al-Dahle @Ahmad_Al_Dahle
20K Followers 106 Following #Girldad of twins. Leading GenAI @ Meta (llama, imagine, meta ai and more)
SSI Inc. @ssi
102K Followers 0 Following A straight shot to safe superintelligence. Join us https://t.co/hHla3vusDE.
Tri Dao @tri_dao
33K Followers 632 Following Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.
DeepSeek @deepseek_ai
972K Followers 0 Following Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
Adam Karvonen @a_karvonen
3K Followers 572 Following ML Researcher, doing MATS with Owain Evans. I prefer email to DM.
the tiny corp @__tinygrad__
59K Followers 134 Following We make tinygrad and sell tinybox, the best perf/$ AI computer. $25k for 4x 5090 in a quiet box. Our mission is to commoditize the petaflop.
Dougal Maclaurin @DougalMaclaurin
599 Followers 245 Following
Matthew Johnson @SingularMattrix
13K Followers 3K Following Researcher at Google Brain. I work on JAX (https://t.co/UGa5tGfinF).
Arthur Mensch @arthurmensch
54K Followers 860 Following Co-founder and CEO @MistralAI. Talk to le Chat https://t.co/ZMZG8rAlWz https://t.co/ydSK6xG4Ce https://t.co/b1uf0UK5U8
hessian.AI @Hessian_AI
2K Followers 290 Following Driving research excellence, education, practice and leadership in AI to foster economic growth and improve the human condition.
Lucie Flek @lucie_nlp
4K Followers 4K Following #NLProc researcher, computer science prof @UniBonn Nothing new to see here in 2025. You will find my news via https://t.co/ms0AC3UfbJ #eXit
Jeremy Howard @jeremyphoward
261K Followers 6K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Prev: professor @ UQ; Stanford fellow; @kaggle president; @fastmail/@enlitic/etc founder https://t.co/16UBFTX7mo
Lucas Beyer (bl16) @giffmana
110K Followers 524 Following Researcher (now: Meta. ex: OpenAI, DeepMind, Brain, RWTH Aachen), Gamer, Hacker, Belgian. Anon feedback: https://t.co/xe2XUqkKit ✗DMs → email
Rosanne Liu @savvyRL
46K Followers 1K Following (On mat leave.) Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS.
(((ل()(ل() 'yoav)))... @yoavgo
66K Followers 2K Following
Delip Rao e/σ @deliprao
62K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
Horace He @cHHillee
42K Followers 540 Following @thinkymachines Formerly @PyTorch "My learning style is Horace twitter threads" - @typedfemale
Ross Wightman @wightmanr
23K Followers 1K Following Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.
Percy Liang @percyliang
85K Followers 420 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist
Kyunghyun Cho @kchonyc
78K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
Tim Dettmers @Tim_Dettmers
39K Followers 994 Following Creator of bitsandbytes.Research Scientist @allen_ai and incoming professor @CarnegieMellon. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.
Björn Plüster @bjoern_pl
585 Followers 148 Following Founder and CTO of ellamind. LLM and open-source enthusiast. @ellamindAI, @DiscoResearchAI
Sam Bowman @sleepinyourhat
50K Followers 3K Following AI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
Hyung Won Chung @hwchung27
38K Followers 304 Following AI Research Scientist @Meta Superintelligence Labs. Past: @OpenAI / @Google Brain / PhD @MIT
Nathan Lambert @natolambert
57K Followers 860 Following Figuring out AI @allen_ai, open models, RLHF, fine-tuning, etc Contact via email. Writes @interconnectsai Wrote The RLHF Book Mountain runner