Check out our Voxtral paper now on arxiv: arxiv.org/abs/2507.13264
Details on on pre-training, fine-tuning and alignment, with ablations covering how to chose the optimal model architecture and pre-training format!
Check out our Voxtral paper now on arxiv: arxiv.org/abs/2507.13264
Details on on pre-training, fine-tuning and alignment, with ablations covering how to chose the optimal model architecture and pre-training format! https://t.co/FKvOeFF2Y8
Introducing Mistral Small 3.2, a small update to Mistral Small 3.1 to improve:
- Instruction following: Small 3.2 is better at following precise instructions
- Repetition errors: Small 3.2 produces less infinite generations or repetitive answers
- Function calling: Small…
📰 News in Arena: Mistral Medium 3 makes a strong debut with the community!
Highlights:
💠 #11 overall in chat: a +90 point leap from Mistral Large
💠Top-tier in technical domains (#5 in Math, #7 in Hard Prompts & Coding)
💠#9 in WebDev Arena
Congrats to @MistralAI on the…
📰 News in Arena: Mistral Medium 3 makes a strong debut with the community!
Highlights:
💠 #11 overall in chat: a +90 point leap from Mistral Large
💠Top-tier in technical domains (#5 in Math, #7 in Hard Prompts & Coding)
💠#9 in WebDev Arena
Congrats to @MistralAI on the… https://t.co/t9xZdw5n15
Bard will undoubtedly eat ChatGPT's lunch. The product is a lot nicer to use already and their teams are iterating super fast!
According to some people at OpenAI, the org is in the worst of both worlds, too ossified to move like a start-up, and too small for big co moves.
27K Followers 25K FollowingWriter, Independent Thinker, Geopolitical and History Analyst on―Europe Africa, the USA, Russia and the ex-USSR; Author of Non-fiction and Fiction titles.
8K Followers 949 FollowingML, systems, and everything in between. Building @guardrails_ai. Previously founding eng @predibase, @Apple SPG, @driveai_, @IllinoisCS, @iitdelhi.
223 Followers 257 FollowingTaming LLMs @ Google Bard
Previously: NLU @ Google Assistant, Recommendations @ Youtube, Computer Vision for cancer research @ Brown
165K Followers 0 FollowingInvented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.
20K Followers 100 FollowingMember of Technical Staff at Anthropic AlphaGo, AlphaZero, MuZero, AlphaCode, AlphaTensor, AlphaProof Gemini RL Prev Principal Research Engineer at DeepMind
64K Followers 932 FollowingI like writing silly Tweets, but that doesn't pay so I also type at @googledeepmind. Principal Engineer. ex-@googlechrome. volunteer @2ndharvest. 🇺🇸🇨🇷
26K Followers 229 Followinggetting us to singularity with friends
computers can be understood: https://t.co/doHE1Qv2Sj
x @GoogleDeepMind @Microsoft
tensor core maximalist
14K Followers 648 FollowingStanford Professor of Linguistics and, by courtesy, of Computer Science. Member of technical staff @stanfordnlp and @StanfordAILab. Co-founder @ Bigspin AI.
20K Followers 3K FollowingMostly posting about robots.
currently AI @agilityrobotics
prev embodied AI @AIatMeta, @NVIDIAAI. All views my own.
writing: https://t.co/iNLA4djfZo
2K Followers 27 FollowingI post about my DIY robots hardware hobby. Robotics research lead at Mistral AI. Ex-Meta/FAIR, core contributor to Llama 3. ENS PhD. Repeat founder.
22K Followers 540 FollowingFounded the Reasoning Team in Google Brain (now in the Gemini Core team of Google DeepMind). Build LLMs to reason. Opinions my own.
8K Followers 949 FollowingML, systems, and everything in between. Building @guardrails_ai. Previously founding eng @predibase, @Apple SPG, @driveai_, @IllinoisCS, @iitdelhi.