Pavankumar Vasu @PavankumarVasu
Joined July 2013-
Tweets38
-
Followers157
-
Following122
-
Likes38
📢 FastVLM models are now on 🤗
📢 Releasing MobileCLIP2 (TMLR Featured). Small embedding models that can power your multimodal RAG applications on resource constrained devices. Models are available on 🤗
📢 Releasing MobileCLIP2 (TMLR Featured). Small embedding models that can power your multimodal RAG applications on resource constrained devices. Models are available on 🤗
🚀Releasing MobileCLIP2 (TMLR Featured). MobileCLIP2-S4 matches acc of SigLIP-SO400M/14 while 2x smaller and surpasses DFN ViT-L/14 at 2.5x faster. Paper: arxiv.org/abs/2508.20691 Code: github.com/apple/ml-mobil… RayGen: github.com/apple/ml-mobil… 🤗huggingface.co/collections/ap… #Apple MLR
🚨📅The submission deadline for #NeurIPS 2025 CCFM Workshop is just 8 days away on August 22. Get your papers in! Submit your work on Continual and Compatible Foundation Model Updates to the #NeurIPS 2025 CCFM Workshop. Learn more: sites.google.com/view/ccfm-neur…
🚨📅The submission deadline for #NeurIPS 2025 CCFM Workshop is just 8 days away on August 22. Get your papers in! Submit your work on Continual and Compatible Foundation Model Updates to the #NeurIPS 2025 CCFM Workshop. Learn more: sites.google.com/view/ccfm-neur…
Introducing DINOv3 🦕🦕🦕 A SotA-enabling vision foundation model, trained with pure self-supervised learning (SSL) at scale. High quality dense features, combining unprecedented semantic and geometric scene understanding. Three reasons why this matters…
🚀 We're thrilled to launch four new OCR datasets with 20M images: DoclingMatix, SynthFormulaNet, SynthCodeNet, and SynthChartNet. We used them train SmolDocling, our ultra‑compact (256M) full-page document conversion VLM with performance rivaling models up to 27× larger.
Uncertainty quantification (UQ) is key for safe, reliable LLMs... but are we evaluating it correctly? 🚨 Our ACL2025 paper finds a hidden flaw: if both UQ methods and correctness metrics are biased by the same factor (e.g., response length), evaluations get systematically skewed
🌟Explore key insights from the FastVLM project (real-time vision-language model) in this blog post: machinelearning.apple.com/research/fast-…
📢Submissions are now open for #NeurIPS2025 CCFM workshop. Submission deadline: August 22, 2025, AoE. Website: sites.google.com/view/ccfm-neur… Call for papers: sites.google.com/view/ccfm-neur… Submission Link: openreview.net/group?id=NeurI…
📢Submissions are now open for #NeurIPS2025 CCFM workshop. Submission deadline: August 22, 2025, AoE. Website: sites.google.com/view/ccfm-neur… Call for papers: sites.google.com/view/ccfm-neur… Submission Link: openreview.net/group?id=NeurI…
We propose new scaling laws that predict the optimal data mixture, for pretraining LLMs, native multimodal models and large vision encoders ! Only running small-scale experiments is needed, and we can then extrapolate to large-scale ones. These laws allow 1/n 🧵
📣 We are excited to present our work on inferring user preferences from writing samples at @icmlconf Poster Session 3 (Wed. 11:00AM - 1:30PM)! Come by to ✋ chat with us, 📄 learn about our method, and 💻 hear about our new interactive benchmark (🔗s below)!
🚀Super excited to share TiC-LM (Oral at #ACL2025)! How to keep FMs up-to-date over months/years? We have a benchmark and lots of insights (arxiv.org/abs/2504.02107). Also organizing a related @NeurIPSConf 2025 workshop continual and compatible FMs (CCFM: sites.google.com/view/ccfm-neur…)…
🚀Super excited to share TiC-LM (Oral at #ACL2025)! How to keep FMs up-to-date over months/years? We have a benchmark and lots of insights (arxiv.org/abs/2504.02107). Also organizing a related @NeurIPSConf 2025 workshop continual and compatible FMs (CCFM: sites.google.com/view/ccfm-neur…)…
I will be attending #CVPR2025 and presenting our latest research at Apple MLR! Specifically, I will present our highlight poster--world consistent video diffusion (cvpr.thecvf.com/virtual/2025/p…), and three workshop invited talks which includes our recent preprint ★STARFlow★! (0/n)
I will be attending #CVPR2025 and presenting our latest research at Apple MLR! Specifically, I will present our highlight poster--world consistent video diffusion (cvpr.thecvf.com/virtual/2025/p…), and three workshop invited talks which includes our recent preprint ★STARFlow★! (0/n)
Imitation learning has a data scarcity problem. Introducing EgoDex from Apple, the largest and most diverse dataset of dexterous human manipulation to date — 829 hours of egocentric video + paired 3D hand poses across 194 tasks. Now on arxiv: arxiv.org/abs/2505.11709 (1/4)
Excited to introduce FocalLens: an instruction tuning framework that turns existing VLMs/MLLMs into text-conditioned vision encoders that produce visual embeddings focusing on relevant visual information given natural language instructions! 📢: @HPouransari will be presenting…
Here is an RL perspective on understanding LLMs for decision making. Are LLMs best used as: policies / rewards / transition functions ? How do you fine-tune them ? Can LLMs explore / exploit ? 🧵 Join us down this rabbit hole... (ICLR 2025 paper, done at ML Research)
Excited to share our new paper on "Reversal Blessing" - where thinking BACKWARDS makes language models smarter on some multiple-choice questions! We found that right-to-left (R2L) models consistently outperform traditional left-to-right (L2R) models on certain reasoning tasks.🧵
🚨 One question that has always intrigued me is the role of different ways to increase a model's capacity: parameters, parallelizable compute, or sequential compute? We explored this through the lens of MoEs:

norules @kulamlenikalam
106 Followers 1K Following
sorryvarkar ki bap @sorryVarkar9
57 Followers 1K Following
Maryam Honari @HonariMaryam
131 Followers 715 Following Poking Reinforcement Learning & Language models,@microsoft /ABK #MLAgent ex-@unity3d ex-RA @uvic
Gaurav Toshniwal @GauravToshn2017
142 Followers 1K Following Investing, Technology and Entertainment
Sanjoy Chowdhury @schowdhury671
199 Followers 944 Following Research Intern @Apple | Past @Meta Reality Labs, @googleresearch, @AdobeResearch, @samsungresearch | PhD student @umdcs | Computer Vision, Multi-modal learning
Ved Chitnis @ved_chitnis
62 Followers 362 Following Fullstack Engineer | Building beatcode | erevald @_buildspace
Wojtek Mandrysz @wmandrysz
1K Followers 635 Following product engineering @visuelapp ex @craftdocsapp @emergetools
Satish @satishmummadi
927 Followers 6K Following Full Stack engineer, Interested in all things WebGL/WebXR, ML, Generative AI, Cybersecurity, SOC Tools, Rust, AR/VR, ASR/STT Tech, Gaussian Splats
Anirudh Thatipelli @AThatipelli
541 Followers 5K Following PhD-CS @UCFCRCV, MS-CS @UCR_CSE, Former Applied Science Intern at @amazon
Katarina @LDoyle25616
63 Followers 3K Following
氚 @JiangL88955
23 Followers 1K Following
Kevin KKGo @KevinKKGo
474 Followers 426 Following Passionate about AI, firmly believing in AGI. Working on VLM & LLM projects, striving to push the boundaries of AI innovation. Bullish on $TSLA.
Donnie @donnieDabeloved
563 Followers 1K Following Christian | Android & IOS Dev | Dropout | John Dillermand 😜
vaibhav yadav @vaibhavyadav025
21 Followers 521 Following Trying to solve fundamental problems in the world using technology.
Justina @dubuque_ri37728
52 Followers 3K Following
Yahya @XIOS09
10 Followers 113 Following
rbbbie @rbbbie
71 Followers 732 Following digital panpsychist that is often designing & sometimes coding.
Tim Qian @Tim_Qian
4K Followers 2K Following Building 🤖 https://t.co/fqCRAjyISg 💬 https://t.co/8RcywLCRZm 🏪 https://t.co/WyHDMRg3kQ (coming soon) Built ⭐️ https://t.co/w436snTDL0 (acquired by @bytebase) ❤️ https://t.co/14rcSMYnai
Chris | Yicheng @ChrisYicheng
2K Followers 516 Following AI-native Games & Engine | Tech & Freedom | Positive Feedback Loop 🧬 Dissipative Structure | prev @Ethereum & @Scroll_zkp & ML startup (acquired)
Jason Tagomori @jtagomori
567 Followers 4K Following @digestbuilder 📲 Helping Nonprofit News Orgs grow audience & revenue with Mobile Apps
Janny @kimsungwhee
30 Followers 734 Following
lI @qiuyangnie
185 Followers 6K Following
eyubupayayibupa @eyubupayayibupa
1 Followers 142 Following
YC @echim2021
147 Followers 347 Following iOS Developer & Product Manager from Taiwan🧋/ Release apps: 📓 Vini: https://t.co/AtECDGybIB 🥂 Check-in Box(TCA): https://t.co/vWDQR9NNfr
Miguel García 👨�... @miguelgarciadev
30 Followers 176 Following Dev & Entrepreneur. Entusiasta de tecnologías disruptivas (IA, Blockchain, OpenBanking), explorando su impacto con propósito en entornos reales. 🌐☕👨💻
Tom Lynch @Tom_A_Lynch
3K Followers 1K Following computational engineer, technology brother, amateur arborist, hypothetical fed.
Kyrylo Pokutnyy 🇺�... @KPokutnyy
283 Followers 3K Following I'm a researcher bridging VR, AR, AI and Web3.0. Co-founder of VR\AR studio Sensorama Lab and contributor at The Culture DAO virtual beings creators guild
Shouken @Shoukenbands
48 Followers 830 Following 22yr old dev + reverse engineer. Sourcery & Ovrscope (prev)
Ayata @Ayata_HS
0 Followers 97 Following
MN @nilsonios
120 Followers 807 Following UI/UX & dev | now building @unplugifyapp ⌘ curious about space, AI, crypto, and aliens . 🚀📱👽
Janek Mann @janekm
1K Followers 4K Following
DA Kanan @kanan71860
1 Followers 221 Following
Won @trilliwon
41 Followers 347 Following Building https://t.co/CT7hgRP2Kz | Sharing my journey to $100k MRR
erubi @erubi
6 Followers 5K Following
Wenhu Chen @WenhuChen
23K Followers 671 Following AI researcher. Interested in Reasoning, Multimodal. I direct TIGER-Lab. Author of PoT, MMMU, MMLU-Pro, MAmmoTH, LongRAG, MAP-Neo, YuE, VL-Rethinker
Xenova @xenovacom
14K Followers 393 Following Bringing the power of machine learning to the web. Currently working on Transformers.js (@huggingface 🤗)
Angjoo Kanazawa @akanazawa
18K Followers 621 Following Assistant Professor at @Berkeley_EECS, @berkeley_ai. KAIR, @nerfstudioteam. Previously advised @WonderDynamics and @LumaLabsAI. she/her.
DatologyAI @datologyai
2K Followers 11 Following DatologyAI builds tools to automatically select and optimize the best data on which to train AI models, leading to better, smaller models which train faster.
Prince Canuma @Prince_Canuma
7K Followers 1K Following Apple MLX King 🤴🏽• ML Research Engineer👨🏾💻 • VLMs • LLMs • Speaker • Writer • Ex-@arcee_ai • @neptune_ai • https://t.co/iZnxoefJBU
Rudrank Riyam @rudrankriyam
14K Followers 1K Following Author & Speaker | Featured Developer | WWDC '19 Scholar
Vimal Mollyn @mollyn_paan
708 Followers 741 Following PhDing @CMUHCII | @RealityLabs | Prev @Apple @IITMadras | Sensors + Machine Learning
David Fan @DavidJFan
602 Followers 240 Following Facebook AI Research (FAIR) | Video Representations, Self-Supervised Learning | @Princeton Computer Science '19
Yizhe Zhang @YizheZhangNLP
1K Followers 533 Following Research Scientist at Apple MLR | ex-researcher @ Microsoft Research, Meta AI | PhD @ Duke University
Fangchang Ma @fangchangma
575 Followers 393 Following Building @nuance_AI - we are hiring. Previously @Apple. PhD at @MIT.
Jürgen Schmidhuber @SchmidhuberAI
164K Followers 0 Following Invented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.
Miguel Angel Bautista @itsbautistam
3K Followers 195 Following I am a research scientist at MLR, working on generative modeling of all the things (image, 3D, graphs, etc). I like to make complex approaches Simple 🇪🇸🇺🇸
merve @mervenoyann
80K Followers 5K Following open-sourceress at @huggingface 🧙🏻♀️proud Aegean, I work on computer vision, VLMs & agents | gençleri serbest bırakın
Soumith Chintala @soumithchintala
252K Followers 1K Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.
Abhay Gupta @gupta__abhay
399 Followers 2K Following Scaling and efficiency lead @DbrxMosaicAI | Previously @CerebrasSystems @CMU_Robotics | Making GPUs and agents go brrrr !!
Shuangfei Zhai @zhaisf
2K Followers 97 Following Research Scientist & Manager, Machine Learning Research @ Apple
Vishaal Udandarao @vishaal_urao
1K Followers 1K Following @ELLISforEurope PhD Student @bethgelab; Currently @Apple; Previously @GoogleAI @GoogleDeepMind @Cambridge_Uni @RutgersU @iiitdelhi
Jason Ramapuram @jramapuram
1K Followers 563 Following ML Research Scientist MLR | Formerly: DeepMind, Qualcomm, Viasat, Rockwell Collins | Swiss-minted PhD in ML | Barista alumnus ☕ @ Starbucks | 🇺🇸🇮🇳🇱🇻🇮🇹
Michael Black @Michael_J_Black
85K Followers 706 Following Director, Max Planck Institute for Intelligent Systems (@MPI_IS). Chief Scientist @meshcapade. Building 3D digital humans using vision, graphics, and learning.
SkalskiP @skalskip92
37K Followers 1K Following Open-source Lead @roboflow. VLMs. GPU poor. Dog person. Coffee addict. Dyslexic. | GH: https://t.co/dEmzMDGq5H | HF: https://t.co/4Lx1Yw34W7
Jiatao Gu @thoma_gu
5K Followers 2K Following Assistant Prof @CIS_Penn and ML Researcher at @Apple (MLR) | exFAIRer | PhD @HKUniversity | Research on Generative AI for multimodal. また日本語もできます。
Papers with Code @paperswithcode
115K Followers 10 Following Our mission is to organize science by converting information into useful knowledge.
Karnataka Development... @IndexKarnataka
40K Followers 16 Following Follow For Development or Investment updates in Karnataka ನಮ್ಮ ಕರುನಾಡು💛❤️
NHSRCL @nhsrcl
42K Followers 210 Following National High-Speed Rail Corporation Ltd is a Joint Venture of Government of India and Participating State Governments for Implementing High-Speed Rail Projects
Zeynep Akata @zeynepakata
6K Followers 299 Following Professor of Computer Science at @TU_Muenchen and Director @Helmholtz_Munich, @ELLISforEurope Fellow
Sanja Fidler @FidlerSanja
16K Followers 490 Following Associate Professor @UofT, Vice President of AI Research @nvidia, founding member of @VectorInst. Computer vision, deep learning, 3D. Opinions are my own.
Matthias Niessner @MattNiessner
41K Followers 241 Following Professor for Visual Computing & Artificial Intelligence @TU_Muenchen Co-Founder @synthesiaIO Co-Founder @SpAItial_AI
Percy Liang @percyliang
85K Followers 419 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianist
Durk Kingma @dpkingma
50K Followers 404 Following @AnthropicAI. Prev. @Google Brain/DeepMind, founding team @OpenAI. Computer scientist; inventor of the VAE, Adam optimizer, and other methods. ML PhD.
#CVPR2025 @CVPR
49K Followers 330 Following Official account for IEEE/CVF Conference on Computer Vision & Pattern Recognition. Hosted by @deblinaforAI @jbhaurum & @CSProfKGD
Lucas Beyer (bl16) @giffmana
109K Followers 522 Following Researcher (now: Meta. ex: OpenAI, DeepMind, Brain, RWTH Aachen), Gamer, Hacker, Belgian. Anon feedback: https://t.co/xe2XUqkKit ✗DMs → email
Pavan Kumar Reddy @reddy1729
189 Followers 623 Following Government Servant. Indian Audit and Accounts Service. Tweets are personal.
Databricks Mosaic Res... @DbrxMosaicAI
41K Followers 120 Following We remove the barriers to state-of-the-art generative AI model development and make data + AI available to all.
Stability AI @StabilityAI
243K Followers 21 Following We’ll help you make it like nobody’s business. Multimodal media generation and editing tools to get your idea to production. Self-deploy? 👍 Need a partner? 🤝
clem 🤗 @ClementDelangue
156K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI builders
fly51fly @fly51fly
8K Followers 2K Following BUPT prof | Sharing latest AI papers & insights | Join me in embracing the AI revolution! #MachineLearning #AI #Innovation
Cerebras @CerebrasSystems
34K Followers 256 Following The world's fastest AI inference and training. Try the latest open models at: https://t.co/jREGhLI2nj
Yinfei Yang @yinfeiy
446 Followers 168 Following
Hugging Face @huggingface
566K Followers 210 Following The AI community building the future. https://t.co/VkRPD0Vclr