Gemini Model Product Manager @ DeepMind. Vision lead (image/video/live visual agents) for the Gemini model and Project Astra. Opinions are my own.linkedin.com/in/rkdoshi/Joined April 2020
Gemini has unlocked a new capability: conversational image segmentation 🖼️
This enables new use cases that were previously not possible, furthering Gemini’s SOTA image understanding capabilities! 🧵
🚀 Excited to launch "Conversational Image Segmentation" for Gemini 2.5. Now you can segment any image with natural language. Think complex queries ("people throwing frisbees"), conditional logic ("workers not wearing hard hats"), and even abstract concepts ("areas with weather…
🚀 Excited to launch "Conversational Image Segmentation" for Gemini 2.5. Now you can segment any image with natural language. Think complex queries ("people throwing frisbees"), conditional logic ("workers not wearing hard hats"), and even abstract concepts ("areas with weather…
i wish there were a programmable humanoid robot i could grab off the shelf to run RL experiments with—no reinventing motors or sensors, just pure agi brain work
I’m thinking of an “arduino for robots”: affordable and hackable—so you can build your own r2-d2. who else feels the…
📽️ The most asymmetric bet in AI is video.
🤖 Gemini accepts video inputs, turning raw video into structured data, insights, & actions
🌏 Gemini isn't just processing pixels; it's understanding context, intent, & physics
🚀 100s of startups in consumer, sports, security,…
🚀 Just joined @GoogleDeepMind as the Vision PM for the Gemini Model!
🤖 We’re leveling up image, video & spatial understanding - powering Google’s core products and unlocking new use cases for devs/enterprises. The future of AGI is visual.
🔍 Try Gemini at…
💡LLM tokens-per-seconds is the bottleneck for making the user experience around most AI Agents viable. Why?
Every agentic design pattern (reflection, tool use, planning, multi-agent collaboration) requires iterative LLM calls. Agents are really slow and users are impatient.
170 Followers 3K FollowingSpent a few years in Engineering and Product. Guaranteed to rile a few people here. Before you respond to my shitpost, think, is it the best use of your time ?
117K Followers 981 FollowingFounder of @carryhq_. Founded @teachable (sold to @hotmart). On a mission to help people be better with money. Not financial advice, views are personal.
17K Followers 104 FollowingI build sane open-source RL tools. MIT PhD, creator of Neural MMO and founder of PufferAI. DM for business: non-LLM sim engineering, RL R&D, infra & support.
2K Followers 1K FollowingPartner @NFX, pre-seed and seed. Cofounder https://t.co/nQHeWdOP1z // Alum: @stanfordgsb, @ycombinator, @amazon. From Barcelona in SF
1K Followers 1K Following400 Days is More Than Enough Time. Standing on the Shoulders of Giants. Proud Dad. Big Fan Supporting the Arts. Passionate About Humanity. GO BULLS!
12K Followers 184 Followingpost training co-lead at Google DeepMind, focusing on safety, alignment, post training capabilities • associate professor at UC Berkeley EECS
4.4M Followers 3 FollowingOpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6Lg202
30K Followers 2K FollowingI help you think like elite product leaders every day • 120+ AI mega-prompts that skip years of learning • See what's inside ↓
528K Followers 882 FollowingI run a portfolio of internet companies and host @startupideaspod. CEO: @latecheckoutplz we build companies like @ideabrowser, @meetLCA, @boringmarketer etc
40K Followers 471 FollowingBuilt an AI study note tool → 300k users, $20.5K/month 🤙 https://t.co/rNhDgb5YRP, Learn how I market my app all organic 👉 https://t.co/q7hc6iXfjZ → $5k/month
No recent Favorites. New Favorites will appear here.