🚀 ARE: scaling up agent environments and evaluations
Everyone talks about RL envs so we built one we actually use. In the second half of AI, evals & envs are the bottleneck.
Today we OSS it all: Meta Agent Research Environment + GAIA-2 (code, demo, evals).
🔗Links👇
That's right, we released our first iteration of JEPAs for LLMs: arxiv.org/abs/2509.14252…
And yes, the code is public! Learning by latent space prediction has revolutionized vision models, and it will revolutionize LLMs!
🌜One small step for JEPAs, one giant leap for LLMs🌛
That's right, we released our first iteration of JEPAs for LLMs: arxiv.org/abs/2509.14252…
And yes, the code is public! Learning by latent space prediction has revolutionized vision models, and it will revolutionize LLMs!
🌜One small step for JEPAs, one giant leap for LLMs🌛
!! Important new for anyone using `transformers` from main!!
We just cut the v4 branch: we will daily cherry-pick relevant commits from main (like new models).
But main of `transformers` will start getting the v5 commits! The first and biggest one yet: github.com/huggingface/tr…
!! Important new for anyone using `transformers` from main!!
We just cut the v4 branch: we will daily cherry-pick relevant commits from main (like new models).
But main of `transformers` will start getting the v5 commits! The first and biggest one yet: github.com/huggingface/tr…
I got to try the @RealityLabs@ray_ban display ai glasses and they were truly impressive. Definitely the ipod moment for ai wearables. The wristband worked like magic and zooming in on the camera felt like a superpower
Serving a model at scale is hard. Serving it across three hardware platforms (AWS Trainium, NVIDIA GPUs, Google TPUs) while maintaining strict equivalence is a whole other level.
Makes you wonder if the hardware flexibility is truly worth the hit to development speed and…
Brush just put out its first release since the beginning of the year and the fidelity/training times are incredible! Brush is a free to use gaussian splatting platform. It's one of the few ways to train state of the art 3DGS across a variety of hardware locally.
There's also an…
When I started LLMs-from-scratch I just hoped it might help a few people learn.
Just saw the GitHub the repo has now been forked 10k times!
More than the stars, the best part is seeing thousands of people actually use and build on the code ☺️
The dream of strapping into your own personal flying machine and soaring through the sky just took a giant leap from science fiction to reality. Bussines Insider reported that Swedish startup Jetson has officially delivered its first-ever Jetson One eVT...
dronexl.co/2025/09/13/jet…
A new open reasoning model, K2-Think, was recently released boasting scores comparable to GPT-OSS 120B and getting a lot of media attention.
However, their performance relies on flawed evaluation marked by contamination, unfair comparisons, and misrepresentation of results. 🧵
GenAI is making video as easy to remix as turntables did for audio in the 80’s. I’m excited for crazy remixing of bad films that fix awful stories and dialog with better writing like a remix of the bayformers into something watchable
GenAI is making video as easy to remix as turntables did for audio in the 80’s. I’m excited for crazy remixing of bad films that fix awful stories and dialog with better writing like a remix of the bayformers into something watchable
🚨3I/ATLAS Has Just Done Something Strange... It Just Turned Green.
Deep images from Sept 7 show the interstellar visitor’s glow shifting from reddish to green-blue. Credit to astrophotographers Michael Jäger & Gerald Rhemann for the capture highlighted today.
Why has MR/Spatial Computing been stagnant?
If you ask me, a few years ago we lost both of our visionaries @akipman and @rabovitz (alphabetical).
Without visionaries to lead the way things naturally take much longer.
I do still believe this tech is the future, and continue…
Chinese Academy of Sciences: Decoding Spectrum of Cosmic "Lighthouse": Researchers Unravel Polarization Mystery of Millisecond Pulsar english.cas.cn/newsroom/resea…
63 Followers 1K FollowingNot a trader, I swear. Just obsessed with capturing lowest cost basis on falling knives, which entails lots of buying and selling. NOT INVESTMENT ADVICE.
258K Followers 216K FollowingGerman #geographer and #demographer in #Melbourne. I curate #maps and #data that explain how the #world works. Obviously all opinions are my own...
10K Followers 285 FollowingNot a trader, I swear. Just obsessed with capturing lowest cost basis on falling knives, which entails lots of buying and selling. NOT INVESTMENT ADVICE.
62K Followers 477 FollowingSpeculator & Investor since 2013 | Former Engineer & VC | Future Optimist | Here to help you make better investment decisions
43K Followers 2K FollowingCovering Congress and the defense industry for @breakingdefense . Military plane meme aficionado. Mother of @ImpalerCat. She/her
2K Followers 2K FollowingDirector, Government & International Affairs, @Canadensys1 // Former Canadian Space Agency DG Space Exploration // Still exploring
439 Followers 563 FollowingClient Support at hightechgadgets
Level 2 Seller at https://t.co/iproGcpC04…,
PhD Fellow, Good at using #Ansys, #Excel, #Matlab, for #Research
5K Followers 6K FollowingCur. - Security Cooperation Professional for ▇▇▇▇
"dude in his mom's basement talking to my son over video games" -@coleman_di92842
46K Followers 1K FollowingAI Developer Experience @GoogleDeepMind | prev: Tech Lead at @huggingface, AWS ML Hero 🤗 Sharing my own views and AI News 🧑🏻💻 https://t.co/7IosdlNz22
501 Followers 96 FollowingAs an #Space, #Science, and #Tech enthusiast, I’m lucky to live in an era nearing the singularity, with #AI breakthroughs and #MultiPlanetary possibilities.
24K Followers 3K FollowingIn the beginning Bill Clinton gave him a green card. This has made a lot of people very angry and been widely regarded as a bad move • @twocentinc
3K Followers 433 Following“Those who seek for truth behind the phenomena are condemned to an expedition in search of nothingness—the phenomena themselves are the living!”
5K Followers 4K FollowingTheatre Director & Designer w/ a bkgrd in film, photo & tech. Live streaming geek & video producer for @NASASpaceflight. Coffee addict. BFA/MA/MFA. He/Him.
57K Followers 855 FollowingFiguring out AI @allen_ai, open models, RLHF, fine-tuning, etc
Contact via email.
Writes @interconnectsai
Wrote The RLHF Book
Mountain runner
645K Followers 35 FollowingWe're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.