You need to try this tool! 🫡
My colleague @m_olbap built an interactive HF Space to explore the modular support of open models in transformers over time
👀 You’ll spot things like 🦙 llama defining many models or which ones could be modular next
Why is your KV so small? 🤏
In continuous batching, if you increase the max number of tokens per batch, you must decrease the memory allocated for your cache. In transformers, we make sure they are perfectly balanced (as all things should be).
No matter how big your model is🦠🐋
You have no idea what attention looks like 🤥
Many talk about attention like it's simple, but few know how it actually works. Even basic stuff like shapes and prefill / decode are not that easy to grasp.
Good thing HF is cooking a blogpost to help you out 🫂
Ever wondered how models actually see an image? Been playing with some visualizations of patch extraction, token layouts, how they affect predictions too.
Planning a short visual deep dive comparing how different models process images. Would love thoughts before I go on.
A quick update on the future of the `transformers` library!
In order to provide a source of truth for all models, we are working with the rest of the ecosystem to make the modeling code the standard.
A joint effort with vLLM, LlamaCPP, SGLang, Mlx, Qwen, Glm, Unsloth, Axoloth,…
The Transformers library is undergoing it's largest pivot to date 🙌
It now cements its role as the central model definition, irrespective of the backend and runner.
One ground truth to bring more reliability across the ecosystem.
Why is this important?
1K Followers 3K FollowingIndependent Researcher: AI Alignment, Theoretical Math & Physics, Cultural Frameworks, Ecology, Philosophy, & Emergent Abundance. 👯♀️ Dad
0 Followers 44 FollowingVery new to X. Still figuring this site out. Created in 2023, but didn't start using until Sept 2025.
Maintainer of WilmerAI.
Tinkerer.
1K Followers 8K FollowingAI inference, speculative decoding, open source. Built novel decoding algorithms – default in Hugging Face Transformers (150+ ⭐). Making AI faster + cheaper
543K Followers 24K FollowingThe best from ML/AI community | Ex-Microsoft, Rackspace, Fast Company | Wrote eight books about the future | Silicon Valley robots, holodecks, BCIs, & startups.
170 Followers 4K FollowingJunior@Nankai University | Major in CS | Research in GenAI & Infra | Full Stack Developer | Beginner in Crypto | Runner, Cyclist, Gym-goer | Rap enthusiast
534 Followers 7K Followingcurrently entertained by 'MCP on edge' + multimodal music reactive visuals // jr ML engineer (CV on edge) // coming from econ/sociology 👾
339 Followers 3K FollowingResearcher in math+formal methods+ml. Working on using formal verification to train models for mathematics and reasoning @harmonicmath
45 Followers 238 FollowingInspired & building physical AI, RL & world models
Founder https://t.co/IcrYN2zEc5 – first ML-powered clinic in the EU
Previously ML infra & k8s at @google & @apple
299 Followers 826 FollowingFull-stack Software developer at @medeloopai Prev: @GoTo, @Globant Spanish/English/French a lot of funny stuff. Opinions are my own.
15K Followers 7K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
1K Followers 3K FollowingMixture of amateurs. Exploring CS, machine learning, law & philosophy.
Mainly interested in NLP, graphs, and data protection legislation.
772 Followers 1K FollowingI'm trying to make reinforcement learning boring.
AI and Robotics @huggingface 🤗 @LeRobotHF 🤖
PhD. Candidate in RL.
One Piece nerd 🏴☠
Love spicy food 🌶️
79K Followers 1K Followingi teach AI on X
leader @openminedorg, research scientist @GoogleDeepMind, ABD PhD @OxfordUni, @UN @GovAI_ @CFR_org GrokkingDL
50K Followers 9K FollowingI lead @Cohere_Labs. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, ML reliability. Changing spaces where breakthroughs happen.
10K Followers 699 FollowingProfessor of Computer Vision, @BristolUniEng. Senior Research Scientist @GoogleDeepMind - passionate about the temporal stream in our lives.
19K Followers 20 FollowingA high-throughput and memory-efficient inference and serving engine for LLMs. Join https://t.co/lxJ0SfX5pJ to discuss together with the community!
40K Followers 328 FollowingI built a C library that lets you compile 12kb static binaries that run natively on Linux, Mac, Windows, FreeBSD, OpenBSD, NetBSD and BIOS using just GCC/Clang.
2K Followers 998 Followingpro-basilisk, techno-optimist. experimental AI products @datadoghq. former startup founder. over a decade of making AI friends (Bits, Alexa, Cortana, etc.)
1.1M Followers 2K FollowingFlaneur: probability (philosophy), probability (mathematics), probability (real life),Phoenician wine, deadlifts & dead languages. Greco-Levantine.Canaan. #RWRI
No recent Favorites. New Favorites will appear here.