This is probably one of THE most important paper of the last few months.
Small language models are sufficiently powerful, operationally suitable, and economical Agentic tasks.
- Phi-2 matches 30 billion models running 15x faster.
- Serving a 7 billion parameter small language…
Say hello to DINOv3 🦖🦖🦖
A major release that raises the bar of self-supervised vision foundation models.
With stunning high-resolution dense features, it’s a game-changer for vision tasks!
We scaled model size and training data, but here's what makes it special 👇
3D Object Tracking without Training Data? In our @Nature Machine Intelligence paper (nature.com/articles/s4225…), we recast 3D tracking as an inverse neural rendering task where we fit a scene graph to an image that best explains this image. The method generalizes to completely…
Very excited to share our interview with @DrYangSong. This is Part 2 of our history of diffusion series — score matching, the SDE/ODE interpretation, consistency models, and more. Enjoy!
The biggest update in 3D reconstruction world - VGGT (new weights) has been released for commercial usage as well.
Kudos to @jianyuan_wang to make this happen!
github.com/facebookresear…
Free must-read: A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
This comprehensive guide breaks down:
- What, when and how to evolve
- Evolutionary mechanisms and adaptation
- Use cases
- Challenges
and more
Check it out here arxiv.org/abs/2507.21046
Gold! A free MIT course on Efficient ML or how to turn compute-heavy models (LLMs, diffusions, etc) into production-ready models.
Covers model pruning, compression, quantization, distributed training, on-device fine-tuning…
Hands-on Project: deploying LLaMA on laptop
Absolutely fantastic resource by @rasbt
Qwen3 From Scratch. contains a from-scratch implementation of Qwen3 0.6B, 1.7B, 4B, 8B, and 32B.
This walkthrough shows how to get started with the Qwen3 language models in PyTorch. It walks through setting up the 0.6B model, grabbing…
NVIDIA just dropped paper exposing a $57 billion AI industry mistake.
While Big Tech keeps pushing expensive LLMs like ChatGPT & Claude...
Small language models handle 70% of AI agent work at 1/30th the cost.
Here's why this changes everything:
(hint: less is more)
给大家整理了技术报告link:
第一篇技术报告:Kimi K2: Open Agentic Intelligence
github.com/MoonshotAI/Kim…
第二篇技术报告访谈:Introducing ChatGPT agent: bridging research and action
openai.com/zh-Hans-CN/ind…
红杉访谈OpenAI:OpenAI Just Released ChatGPT Agent, Its Most Powerful Agent…
Viser is an incredibly powerful and easy-to-use 3D visualization tool for robotics and 3D vision research. You can visualize 3D videos, interact with IsaacGym and MuJoCo robots, and much more — all with an intuitive and customizable interface. This is a game changer for anyone…
Viser is an incredibly powerful and easy-to-use 3D visualization tool for robotics and 3D vision research. You can visualize 3D videos, interact with IsaacGym and MuJoCo robots, and much more — all with an intuitive and customizable interface. This is a game changer for anyone…
163K Followers 166 FollowingCo-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log
2K Followers 1K FollowingSenior research scientist (@NVIDIAAI, prior @CMU_Robotics, @TU_Muenchen, @RWTH), working on learning to understand the world from video.
14K Followers 519 FollowingYour guide to radiance fields | Host of the podcast @ViewDependent | DM open for business inquiries | https://t.co/llYGWliKUv | discord: https://t.co/lrl64WGvlD
163K Followers 0 FollowingInvented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.
712K Followers 288 FollowingTogether with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
4.3M Followers 3 FollowingOpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6Lg202
2.4M Followers 7K FollowingThe Official How Things Work page including Tech, AI & loads more. Also all the best News & Viral content from around the globe.
355K Followers 1K FollowingML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).
13K Followers 361 FollowingComputer Vision research group @UniofOxford led by Andrew Zisserman, Andrea Vedaldi, João Henriques, Christian Rupprecht, and Iro Laina