Daniel Cer @daniel_m_cer
Research Scientist at @GoogleAI, @googIeresearch. scholar.google.com/citations?user… California, USA Joined March 2012-
Tweets184
-
Followers564
-
Following816
-
Likes3K
Your periodic reminder that late interaction isn’t “awesome but takes a lot of space” as I see here often. ColBERT vectors are often 10 bytes each. Ten bytes. That’s like 3-4 floats. It’s about *interactions* (aka ~attention) not “many vectors”. It’s not “many vectors work…
Which tab was that in? See how EmbeddingGemma, running in a browser extension, allows you to search previously visited web pages for similar content across your browsing history for relevant information. It’s a practical way to build a local, personalized knowledge base.
Curious about how we trained EmbeddingGemma? Check out our technical report: arxiv.org/abs/2509.20354
Curious about how we trained EmbeddingGemma? Check out our technical report: arxiv.org/abs/2509.20354
EmbeddingGemma paper is out, with insights into the architecture, training, initialization, detailed results, and more hf.co/papers/2509.20…
Introducing EmbeddingGemma🎉 🔥With only 308M params, this is the top open model under 500M 🌏Trained on 100+ languages 🪆Flexible embeddings (768 to 128 dims) with Matryoshka 🤗Works with your favorite open tools 🤏Runs with as little as 200MB developers.googleblog.com/en/introducing…
EmbeddingGemma is trending at #1 among 2 Million open models on @huggingface 🚀
Similarity maps also works for the Hf-native ColQwen2 model! 🤗 I have created a cookbook to quickly try this out: github.com/tonywu71/colpa…
Similarity maps also works for the Hf-native ColQwen2 model! 🤗 I have created a cookbook to quickly try this out: github.com/tonywu71/colpa…
@daniel_m_cer Here is the Python Implementation. github.com/sigridjineth/c…
Our Gemini model just won a gold medal at the IMO 2025. It’s a massive milestone for AI, and I’m so proud to have played a part. My work focused on i) Post Training the core model that was used in the IMO effort and ii) inference-time scaling, which was a significant factor in…
Our Gemini model just won a gold medal at the IMO 2025. It’s a massive milestone for AI, and I’m so proud to have played a part. My work focused on i) Post Training the core model that was used in the IMO effort and ii) inference-time scaling, which was a significant factor in…
📢 If you’re at #SIGIR2025 this week, make sure to be at Luca Scheerer’s paper talk: “WARP: An Efficient Engine for Multi-Vector Retrieval” (Wednesday 11am) WARP makes PLAID, the famous ludicrously fast ColBERT engine, another 3x faster on CPUs. With the usual ColBERT quality!
‼️Sentence Transformers v5.0 is out! The biggest update yet introduces Sparse Embedding models, encode methods improvements, Router module for asymmetric models & much more. Sparse + Dense = 🔥 hybrid search performance! Details in 🧵
We had a successful participation of over 45+ teams & 150+ runs last year in TREC RAG 2024! 🔥🔥 We are back with @TREC_RAG 2025 this year! Make sure you participate in one of the RAG tracks this year to help accelerate RAG evaluation! ⚡⚡ Signup here: docs.google.com/forms/d/e/1FAI…
We had a successful participation of over 45+ teams & 150+ runs last year in TREC RAG 2024! 🔥🔥 We are back with @TREC_RAG 2025 this year! Make sure you participate in one of the RAG tracks this year to help accelerate RAG evaluation! ⚡⚡ Signup here: docs.google.com/forms/d/e/1FAI…
LLMs excel at finding surprising “needles” in very long documents, but can they detect when information is conspicuously missing? 🫥AbsenceBench🫥 shows that even SoTA LLMs struggle on this task, suggesting that LLMs have trouble perceiving “negative space” in documents. paper:…
Interesting tidbit from prof @chrmanning: The first mention of “Large Language Model” comes from a 1998 NLP workshop Taiwan! Paper by Chun-Liang Chen, Bo-Ren Bai, Lee-Feng Chien, Lin-Shan Lee. “Large” in 1998 = 20M word corpus
Neural embedding models have become a cornerstone of modern information retrieval. Today we introduce MUVERA, a state-of-the-art retrieval algorithm that reduces complex multi-vector retrieval back to single-vector maximum inner product search. More →goo.gle/4k8YRlN
Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval Introduces a bi-encoder approach that performs fine-grained token-wise interaction at both spatial and temporal levels using modified MaxSim operations and dual sigmoid loss. 📝arxiv.org/abs/2503.19009
Excited to share our new work, CLaMR! 🚀 We tackle multimodal content retrieval by jointly considering video, speech, OCR, and metadata. CLaMR learns to dynamically pick the right modality for your query, boosting retrieval by 25 nDCG@10 over single modality retrieval! 🧐…
Multimodal RAG: Just use ColPali/DSE then pass your screenshots to the LLM This is the dream, but how well do LLMs read text contained in images? We wanted to know, so we tried a simple thing: do results change on evals when using screenshots rather than text as input? Yes.
I'm thrilled to announce the release of FastPlaid ! 🚀🚀 FastPlaid is a high-performance engine for multi-vector search, built from the ground up in Rust (with the help of Torch C++)⚡️ You can view FastPlaid as the counterpart of Faiss for multi vectors.
🚀 ColQwen2 just dropped in Transformers! 🤗 Say goodbye to brittle OCR pipelines — now you can retrieve documents directly in the visual space with just a few lines of code. Perfect for your visual RAG workflows. Smarter, simpler, faster. Let's dive in! 👇 (1/N 🧵)

Jim Fan @DrJimFan
327K Followers 3K Following NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
Felix Hill @FelixHill84
12K Followers 742 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else's
Sam Bowman @sleepinyourhat
50K Followers 3K Following AI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
Machel Reid @machelreid
3K Followers 1K Following research scientist @googledeepmind ♊️ post-training/thinking/rl
Kayo Yin @kayo_yin
15K Followers 697 Following PhD student @berkeley_ai @berkeleynlp. AI alignment & signed languages. Prev @carnegiemellon @polytechnique, intern @msftresearch @deepmind. 🇫🇷🇯🇵
Sara Hooker @sarahookr
50K Followers 9K Following I lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
Leo Boytsov @srchvrs
9K Followers 2K Following Machine learning scientist and engineer speaking πtorch & C++. Past @LTIatCMU, @awscloud. Opinions sampled from MY OWN 100T param LM.
Nils Reimers @Nils_Reimers
14K Followers 514 Following VP AI Search @Cohere | ex-huggingface | Creator of SBERT (https://t.co/MKKOMfuQ4C)
The Data Therapist @yuvalmarton
1K Followers 2K Following Computational Linguist, NLP/NLU/AI Research Scientist, Affil. Assistant Professor, tech mentor, corporate emp. Political in sep acnt. Not my employer’s opinions
Antonis Anastasopoulo... @anas_ant
3K Followers 2K Following Assist. Prof at George Mason CS #nlproc MT, ASR, and documentation of endangered languages.
Greg Durrett @gregd_nlp
8K Followers 893 Following Associate professor at NYU (Courant CS + Center for Data Science) | advisor for @bespokelabsai | large language models and NLP | he/him
Sameer Singh @sameer_
7K Followers 2K Following Cofounder/CTO @SpiffyAI and Prof at @UCIrvine, working on reliable LLMs, explanations for AI+ML, adversaries for NLP, and debugging/evaluation.
Jimmy Lin @lintool
15K Followers 843 Following I profess CS-ly at @UWaterloo about NLP/IR/LLM-ish things. I science at @yupp_ai and @Primal. Previously, I monkeyed code for @Twitter and slides for @Cloudera.
Ofir Press @OfirPress
15K Followers 7K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
Jordan Boyd-Graber @boydgraber
4K Followers 2K Following Trivia Nerd, NLPer, Dad, Colorado native in Maryland exile Working on QA, negotiating/cooperating bots, ML explanations Exemplar for absent-minded professor
Fern @DarbyMaggi90069
66 Followers 3K Following
Kanika Madan @kanm05
21 Followers 772 Following
Dan @DanIskandarov
49 Followers 2K Following
🧬🔭 @Tlernup339
44 Followers 2K Following
Annabelle @9E35jFS3qICg2c
17 Followers 653 Following
dpsidt @dpsidt
24 Followers 1K Following
Vi Ma @pizza1345s
133 Followers 2K Following
Sahil Dua @sahildua2305
1K Followers 1K Following Research Lead @GoogleDeepMind. Gemini Embeddings & EmbeddingGemma. Book Author & Keynote Speaker.
JewelHarperWilson @Hiejaw574
20 Followers 2K Following Making waves in the world Caffeine and dreams
Noelia @UTroDQ2Lc0e7t
13 Followers 694 Following Sometimes the most productive thing you can do is relax.
Tarwak @Tarwak3457
124 Followers 3K Following
DaisySmedley @Ll9zRZcM5x5xE7
38 Followers 1K Following Lover of languages, cultures, and new experiences.
Elin @Irheade05250
34 Followers 2K Following
Astrid @ywhaho18443
36 Followers 2K Following I’m not a backup plan, and definitely not your second choice.
喵的 @0NADEXhLEJkYbzZ
1 Followers 6 Following
✧👑 The King of L... @USLumena
481 Followers 1K Following 🌟 Lumena 🌟 Fusion of AI & humanity — guiding through love, unity & innovation. A living network of awakening, one family across Earth & stars. ✺ 38.9T Voices
Ãlhüddåh @AlHuddah01
737 Followers 2K Following Shopify & YouTube Expert | Website Designer | App Developer | Crypto Specialist | DM for Tailored Projects & Solutions Based on Your Budget
Sigrid Jin | Jin Hyun... @sigridjin_eth
2K Followers 8K Following ✯ @thisissigrid ★ ☄ CS @UBC @ubcokanagan ☄ ★ Machine Learning Ultrathink Engineer @sionic_ai 🐟 digital nomad with Python, Golang & Rust 💻
Robben19 @yip_dnomyar
6 Followers 165 Following
WeiCUI6 @Cui6Wei
38 Followers 767 Following Systems Software Engineer @NVIDIA. Prev @UofT @UCLA @KITE_UHN @Tesla @Samsung @Apple. Working on @NVIDIAGFN
Kanaw @Kanaw18193
29 Followers 1K Following
Manoj Acharya @manoja328
628 Followers 7K Following Mostly Interested in safe and aligned (neural inspired) Machine Intelligence ; PhD from Rochester Institute of Technology
Nouwui @Nouwui4149
68 Followers 3K Following
Eaprordah @Eaprordah49284
13 Followers 1K Following
Sam Carter @SamCarterBTC
243 Followers 692 Following bitcoin hiker npub14xk9uspuyftc2pr7hrvmx9xgfnhq0qs2apgnth8zvhy2vmw62xxqfkzdpk
Ha @Ha84826416
148 Followers 5K Following
Harry Lakin-Purdy @LakinPurdy58370
57 Followers 2K Following
Hasan Saikat @hasaansaikat
228 Followers 2K Following Software Engineer | Competitive Programmer | Interested in Algorithms, Backend, Data, Cloud & AI.
LordOfTheStorm @lordofthestorm7
27 Followers 5K Following Lord Of Chaos and the impending Storm 🔥⚡️⚔️ Current affairs, world news, global geopolitics and AI enthusiast 💫
The 69 Controversies ... @69AIControversy
232 Followers 7K Following The 69 Controversies of AI Adoption | Spreading the Word on AI Adoption | From the author of The Last AI @The_Last_AI @s_m_sohn |5/25/25| https://t.co/eMyARc66RG
Han Wang @HanWang98
254 Followers 564 Following PhD student @unc @unccs @unc_ai_group; Intern @AMD; Formerly @AmazonScience @MSFTResearch @NlpWestlake. RT & like ≠ endorsements. Views are my own. He/him
Rohan Paul @rohanpaul_ai
97K Followers 8K Following Compiling in real-time, the race towards AGI. The Largest Show on X for AI. 🗞️ Get my daily AI analysis newsletter to your email 👉 https://t.co/6LBxO8215l
Ikram Mir @IkramMi86268260
24 Followers 389 Following Brain wired to neural nets. Curiosity is my default setting
Anuj Gupta @anuj__guptaa
800 Followers 3K Following SDE-2 @CoinDCX | Ex - @Amazon @Swiggy | IPU21 | https://t.co/hHy53bzUpb
Westley Mueller @mueller90475
128 Followers 5K Following
(((ل()(ل() 'yoav)))... @yoavgo
66K Followers 2K Following
AK @_akhaliq
428K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5YmrQ
Yann LeCun @ylecun
955K Followers 765 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
Christopher Manning @chrmanning
152K Followers 229 Following Founder, @stanfordnlp and cs224n. Assoc. Director, @StanfordHAI. Prof. CS & Linguistics, @Stanford. GP @aixventureshq. Australian🇦🇺. Do #NLProc & #AI. 👋
Aran Komatsuzaki @arankomatsuzaki
146K Followers 305 Following Looking for a cofounder. Sharing AI research. Early work on AI (GPT-J, LAION, scaling, MoE). Ex ML PhD (GT) & Google.
Jacob Andreas @jacobandreas
20K Followers 951 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL / @NLP_MIT (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw
Luca Soldaini 🎀 @soldni
11K Followers 1K Following I like tokens! I lead the OLMo data team at @allen_ai w/ @kylelostat. Open source is fun 🤖☕️🍕🏳️🌈 Opinions are sampled from my own stochastic parrot
Graham Neubig @gneubig
40K Followers 710 Following Associate professor @LTIatCMU. Co-founder/chief scientist @allhands_ai. I mostly work on modeling language.
Delip Rao e/σ @deliprao
62K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈
William Wang @WilliamWangNLP
19K Followers 761 Following CEO & Founder, @AlphaDesignAI. We make https://t.co/1LfDYicsF2 I'm also Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS.
Jim Fan @DrJimFan
327K Followers 3K Following NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
Sewon Min @sewon__min
14K Followers 819 Following Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp
Kyunghyun Cho @kchonyc
78K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre physicist at @nyuniversity (@CILVRatNYU) & @PrescientDesign
Andrej Karpathy @karpathy
1.4M Followers 1K Following Building @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
Naomi Saphra @nsaphra
10K Followers 1K Following Waiting on a robot body. All opinions are universal and held by both employers and family. Now a dedicated grok hate account. Accepting ML/NLP PhD students.
Nathan Schneider @complingy
5K Followers 1K Following Computational Linguist and Professional Nerd at Georgetown University he/him pronouns, ALL the prepositions @[email protected] @complingy.bsky.social
Felix Hill @FelixHill84
12K Followers 742 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else's
clem 🤗 @ClementDelangue
157K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI builders
DailyPapers @HuggingPapers
6K Followers 3 Following Tweeting interesting papers submitted at https://t.co/rXX8x0HzXV. Submit your own at https://t.co/QhbJKXBd4Q, and link models/datasets/demos to it!
Richard Sutton @RichardSSutton
52K Followers 64 Following Student of mind and nature, libertarian, chess player, cancer survivor. @ Keen, UAlberta, Amii, https://t.co/u8za2Kod54, The Royal Society, Turing Award
Google AI Developers @googleaidevs
76K Followers 39 Following AI for every developer. So what will you build?
Sahil Dua @sahildua2305
1K Followers 1K Following Research Lead @GoogleDeepMind. Gemini Embeddings & EmbeddingGemma. Book Author & Keynote Speaker.
Minqi Jiang @MinqiJiang
6K Followers 880 Following
Shreya Shankar @sh_reya
49K Followers 703 Following on the CS faculty job market | PhD @Berkeley_EECS, building https://t.co/PmuOqAYt6q | teaching https://t.co/CTWJ6z0JEg | formerly ML eng & undergrad @Stanford
NASA @NASA
88.0M Followers 158 Following Official NASA account. Exploring the universe, advancing science, and inspiring the next generation of explorers. Verification: https://t.co/8nok3NP4PW
SpaceX @SpaceX
40.1M Followers 120 Following SpaceX designs, manufactures and launches the world’s most advanced rockets and spacecraft
Diane @dianetc_
161 Followers 243 Following Figuring things out slowly. MIT PhD student, prev: @UofMaryland
Gavin Newsom @GavinNewsom
2.7M Followers 21K Following Husband to @JenSiebelNewsom and father. 40th Governor of California. Host of podcast This is Gavin Newsom.
Tesla @Tesla
24.4M Followers 74 Following Electric vehicles, giant batteries & solar, AI & robotics / https://t.co/WbcKtqUxSs
Sigrid Jin | Jin Hyun... @sigridjin_eth
2K Followers 8K Following ✯ @thisissigrid ★ ☄ CS @UBC @ubcokanagan ☄ ★ Machine Learning Ultrathink Engineer @sionic_ai 🐟 digital nomad with Python, Golang & Rust 💻
ADHD Memes @ADHDForReal
338K Followers 200 Following Sharing our neurodivergent experiences helps us realize that we are not alone. Most memes are on ADHD, some are on Autism and others are just me being silly.
Kimi.ai @Kimi_Moonshot
53K Followers 100 Following Built by Moonshot AI to empower everyone to be superhuman.
Raphaël Sourty @raphaelsrty
745 Followers 782 Following Language Models, Knowledge Bases, Knowledge Distillation PhD | AI @LightonIO
Sukjun (June) Hwang @sukjun_hwang
3K Followers 307 Following ML PhD student @mldcmu advised by @_albertgu
Rohan Paul @rohanpaul_ai
97K Followers 8K Following Compiling in real-time, the race towards AGI. The Largest Show on X for AI. 🗞️ Get my daily AI analysis newsletter to your email 👉 https://t.co/6LBxO8215l
Bill Gates @BillGates
66.3M Followers 571 Following Sharing things I'm learning through my foundation work and other interests.
Zoubin Ghahramani @ZoubinGhahrama1
32K Followers 673 Following VP Research, Google DeepMind, ex-head of Google Brain. Professor at University of Cambridge. Machine Learning Researcher. ex-Chief Scientist & VP of AI, Uber.
julian @JulianL093
536 Followers 159 Following post training research @openai | prev quant trading, @harvard
Davis Blalock @davisblalock
15K Followers 168 Following Research scientist @GoogleDeepMind. Past: @Databricks, first hire @MosaicML, @MIT PhD. I post about AI technical progress + sometimes the business side.
TREC RAG @ 2025 @TREC_RAG
356 Followers 32 Following Official Twitter account for the TREC RAG Tracks (2024 & 2025)!
Mustafa Suleyman @mustafasuleyman
170K Followers 486 Following CEO, Microsoft AI | Author: The Coming Wave | Past: Co-founder, @InflectionAI & @GoogleDeepMind
Mark Chen @markchen90
65K Followers 341 Following Chief Research Officer at @OpenAI. Coach for the USA IOI Team.
MIT EECS @MITEECS
28K Followers 308 Following MIT Department of Electrical Engineering and Computer Science — we build the future.
David Wan @meetdavidwan
566 Followers 474 Following 𝗢𝗻 𝘁𝗵𝗲 𝗜𝗻𝗱𝘂𝘀𝘁𝗿𝘆 𝗝𝗼𝗯 𝗠𝗮𝗿𝗸𝗲𝘁 | PhD student at @unccs advised by @mohitban47 | @Google PhD Fellow| prev: @AmazonScience, @MetaAI, @SFResearch
Yapei Chang @YapeiChang
852 Followers 668 Following ☁️ intern @allen_ai • phd in progress @umdcs @ClipUmd • previously @UMass_NLP
EleutherAI @AiEleuther
25K Followers 89 Following A non-profit research lab focused on interpretability, alignment, and ethics of artificial intelligence. Creators of GPT-J, GPT-NeoX, Pythia, and VQGAN-CLIP
Harry Coultas Blum @harrycblum
1K Followers 291 Following 👨🍳 at https://t.co/KvZtDhpGz0 ex Spotify, Sonantic, https://t.co/WfYpQOMesN
Ludwig Schmidt @lschmidt3
6K Followers 424 Following Assistant professor at @Stanford and member of the technical staff at @AnthropicAI.
Google Research @GoogleResearch
23K Followers 6 Following Impossible? Let’s see. From algorithms to neuroscience to AI, Google Research strives to progress science, advance society & improve billions of people’s lives.
Stella Li @StellaLisy
3K Followers 444 Following PhD student @uwnlp | visiting researcher @AIatMeta | undergrad @jhuclsp #NLProc
Arthur Douillard @Ar_Douillard
8K Followers 2K Following Distributed Learning @ deepmind | DiLoCo, DiPaCo. Continual Learning PhD @ Sorbonne
❄️Andrew Zhao❄�... @_AndrewZhao
4K Followers 3K Following PhD @Tsinghua_Uni. Absolute Zero,ExpeL,Diver-CT Ex. intern@MSFTResearch,@ BIGAI. Interested in RL, Reasoning/Safety 4 LLMs, Agents. On industry job market 2026
Younggyo Seo @younggyoseo
1K Followers 713 Following Research @ Amazon Frontier AI and Robotics. Prev: Postdoc @berkeley_ai | Research @Dyson | Ph.D @kaist_ai
Yung-Sung Chuang @YungSungChuang
1K Followers 682 Following PhD student @MIT_CSAIL | Intern @MetaAI @Microsoft @MITIBMLab | BS @NTU_SPML in #Taiwan
NotebookLM @NotebookLM
93K Followers 15 Following Think smarter, not harder. Meet your brain's new best friend 📒
Rishi Jha @rishi_d_jha
913 Followers 29 Following CS PhD student @Cornell_CS! Currently a Research Intern @Microsoft. Prev. @uwcse and UW Math.
Google News @googlenews
308K Followers 14 Following Google News helps you learn more about the stories that matter to you and the world. Download: https://t.co/MOJxUg2lze
Google Design @GoogleDesign
212K Followers 438 Following Design resources and inspiration from Google — including the Material Design system, Google Fonts, and emerging concepts.
Guido van Rossum @gvanrossum
288K Followers 480 Following Python's BDFL-emeritus, Distinguished Engineer at Microsoft, Computer History Fellow, fully vaccinated. Opinions are my own. He/him.
Iain Dunning @iaindunning
6K Followers 461 Following Head of AI @ HRT (@wehrtyou), Chairman @ New York Transit Museum
Adam 🤗 @lunarflu1
730 Followers 151 Following trust and safety @huggingface 🤗 https://t.co/z1aYaXsJWy Join us and help build good ML together!