At @MedARC_AI we are building a comprehensive suite of medical LLM evals, and we already have tons of volunteers and lots of great progress!
The project started less than a week ago!
Are there other medical LLM evals we should include?
Announcement!! 📢
We are re-launching MedARC, now supported by @SophontAI 🚀🔥
MedARC is our medical AI research collective that I founded in 2023. We've accomplished so much already, publishing in top venues like NeurIPS and Nature Biomedical Engineering, collaborated from…
Announcement!! 📢
We are re-launching MedARC, now supported by @SophontAI 🚀🔥
MedARC is our medical AI research collective that I founded in 2023. We've accomplished so much already, publishing in top venues like NeurIPS and Nature Biomedical Engineering, collaborated from…
Nous Research presents Hermes 4, our latest line of hybrid reasoning models.
hermes4.nousresearch.com
Hermes 4 builds on our legacy of user-aligned models with expanded test-time compute capabilities.
Special attention was given to making the models creative and interesting to…
very exciting to see what Prime Intellect is doing to grow the open-source RL ecosystem.
We hope to do a similar strategy to grow the open-source medical AI ecosystem as well (part of that includes developing medical RL envs!)
More info about how to contribute coming soon!
very exciting to see what Prime Intellect is doing to grow the open-source RL ecosystem.
We hope to do a similar strategy to grow the open-source medical AI ecosystem as well (part of that includes developing medical RL envs!)
More info about how to contribute coming soon!
LLMs just beat humans at GeoGuessr in a new benchmark 🤯
With O1 outperforming all other LLMs like GPT-4.1, Gemini-2.5-pro, and even O3! Performing on par with RainBolt! 🌎
deepguessr.com
Got my domain finally and went ahead and did runs for the new models. Interestingly enough with o3 is the scores seem to be regressing. Went over some of the outputs and my best theory is that o3 overanalyzes far too often.
Go play against the models!
100 Followers 737 FollowingBorn too late to find peace in grasslands, Born too early to rebuild civilization in ruins, Born just in time to be prophet of end times 🌻
233 Followers 2K FollowingDirector Desarrollo de Negocio ByEvolution Creative Factory, entregado a su trabajo y constante en la persecución de unos objetivos claros.
6K Followers 559 Followinge/λ Currently: Doing some stuff with AI.
Prev founding team of both: @NousResearch and @TTSLabsAI
DM for interesting conversations.
16K Followers 585 FollowingDirector of Product at Google Labs. Code AI. Dive in ➡ @googlelabs, @stitchbygoogle, and @julesagent Previously @vercel, @github and @heroku
2K Followers 1K FollowingFounder and CEO of https://t.co/1jppiQKGGn. Love building things and investing in builders. Also co-founded Kettle & Wayfinder. Investor w/ @djrosent @Kindergartenvc.
1.4M Followers 1K FollowingBuilding @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
739 Followers 373 FollowingResearch Engineer @GoogleDeepMind; Building AI for climate change mitigation & adaptation; @WMO Young Scientist of the Year 2022; he/him
475 Followers 70 FollowingCreating weather certainty. We fuse unparalleled data from our constellation of smart, long-duration sensing balloons with state-of-the-art AI forecasts.
652 Followers 1K Followingsenior researcher @MSFTResearch AI for Science, PhD @DeptofPhysics. Opinions my own. Slowly moving to @megstanley.bsky.social
13K Followers 323 FollowingI make cool stuff / Tropics, Aurora, and extreme Snow lover / Graphic Designer for Max Velocity, Team Dominator, and others / Content Manager for @AtmosWX
5K Followers 7K Followinggeek, entrepreneur, 'I strictly color outside the lines!', opinions r my own indeed. @ayirpelle , universal handle at this time
167K Followers 167 FollowingCo-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log
3K Followers 35 FollowingTornado Archive is a website dedicated to preserving and visualizing worldwide tornado history, climatology, “archaeology”, and media.
857K Followers 25K FollowingExtreme meteorologist, inventor and storm chaser intercepting the most powerful storms on the planet. I'm driven to push the science and its education forward.