[1/N] 🎥 We've made available a powerful spatial AI tool named ViPE: Video Pose Engine, to recover camera motion, intrinsics, and dense metric depth from casual videos!
Running at 3–5 FPS, ViPE handles cinematic shots, dashcams, and even 360° panoramas.
🔗 research.nvidia.com/labs/toronto-a…
We are excited to share Cosmos-Drive-Dreams 🚀
A bold new synthetic data generation (SDG) pipeline powered by world foundation models—designed to synthesize rich, challenging driving scenarios at scale.
Models, Code, Dataset, Tookit are released.
Website:…
🚀Excited to introduce GEN3C #CVPR2025, a generative video model with an explicit 3D cache for precise camera control.
🎥It applies to multiple use cases, including single-view and sparse-view NVS🖼️ and challenging settings like monocular dynamic NVS and driving simulation🚗.…
Reward models that help real robots learn new tasks—no new demos needed!
ReWiND uses language-guided rewards to train bimanual arms on OOD tasks in 1 hour!
Offline-to-online, lang-conditioned, visual RL on action-chunked transformers.
🧵
Check our Physgen3D which extends Physgen () to 3D. Try the deflate demo below 👇👇👇 Achieved by our amazing intern @boyuanchen21 and collaborators @jiang_hanxiao, Saurabh, @YunzhuLiYZ Prof. Zhao and @ShenlongWang
Check our Physgen3D which extends Physgen () to 3D. Try the deflate demo below 👇👇👇 Achieved by our amazing intern @boyuanchen21 and collaborators @jiang_hanxiao, Saurabh, @YunzhuLiYZ Prof. Zhao and @ShenlongWang https://t.co/pxxWqmcpvW
Stop by our poster #217 tmr 10:30 if you are at #ECCV2024, Prof @ShenlongWang and Prof @_saurabhg will present tmr. This is how Shenlong did toy experiments at home🤣
Stop by our poster #217 tmr 10:30 if you are at #ECCV2024, Prof @ShenlongWang and Prof @_saurabhg will present tmr. This is how Shenlong did toy experiments at home🤣 https://t.co/Ld4Caat2f4
@_akhaliq The paper presents a novel image-to-video generation method called PhysGen that can convert a single image into a realistic, physically plausible, and temporally consistent video. The key idea is to integrate a model-based physical simulation with a data-driven video generation…
Thank you AK @_akhaliq for featuring our work. Come and visit our stevenlsw.github.io/physgen/ to play the interactive demos! Don't miss our Wednesday morning poster session at #217 if you are at #ECCV2024
Thank you AK @_akhaliq for featuring our work. Come and visit our stevenlsw.github.io/physgen/ to play the interactive demos! Don't miss our Wednesday morning poster session at #217 if you are at #ECCV2024
Introducing: Opening Cabinets and Drawers in the Real World using a Commodity Mobile Manipulator
We develop a system to open unseen cabinets and drawers *zero-shot* from novel environments using the Stretch RE2: arjung128.github.io/opening-cabine…
327K Followers 3K FollowingNVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
22 Followers 253 FollowingI'm a first-year Ph.D. student at Tsinghua University, focusing on building safe and reliable LLMs.
My personal website: https://t.co/s5qVMF1nBK
272 Followers 2K FollowingMA European Interdisciplinary Translator in Humanities
Medical Science Researcher
Fostering understanding with empathy
Writer, Video Editor & Digital Journalist
372 Followers 3K FollowingIdeas, thoughts, bits of wisdom, fun and sarcasm from a worst-selling author of no book at all. Yet. One day, maybe there’ll be enough for one.
726 Followers 460 FollowingResearch scientist in robotics @ GEAR Nvidia. I obtained my PhD from University of Toronto @UofT, Vector Institute @VectorInst 😃
327K Followers 3K FollowingNVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
955K Followers 765 FollowingProfessor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
85K Followers 706 FollowingDirector, Max Planck Institute for Intelligent Systems (@MPI_IS). Chief Scientist @meshcapade. Building 3D digital humans using vision, graphics, and learning.
20K Followers 467 FollowingAssociate Professor @UTCompSci | Director @NVIDIAAI Co-Leading GEAR | CS PhD @Stanford | Building generalist robot autonomy in the wild | Opinions are my own
8K Followers 880 FollowingAssistant Professor @Cambridge_Eng, working on 3D computer vision and inverse graphics, previously postdoc @StanfordSVL, PhD @Oxford_VGG
29K Followers 1K FollowingFoundation Models for Generalizable Autonomy in Robotics. Reinforcement Learning. Assistant Professor in AI Robotics @GeorgiaTech. Prev @nvidia
1.4M Followers 1K FollowingBuilding @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
1.2M Followers 279 FollowingWe’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
2K Followers 2K FollowingPh.D. Student @PrincetonCS. Prev @Stanford @UW @pika_labs @MSFTResearch @UofIllinois. I used to work on computer vision, but it's not all I do.
1K Followers 735 FollowingProduct Operations Engineer at AIMonk Labs || Optimizing AI Systems & Driving Operational Excellence ||Sharing Insights on AI and Robotics
14K Followers 521 FollowingYour guide to radiance fields | Host of the podcast @ViewDependent | Founder and CEO of https://t.co/5MjtfpwEU3 | discord: https://t.co/lrl64WGvlD
11K Followers 63 FollowingOfficial account for the IEEE/CVF International Conference on Computer Vision. #ICCV2025 Honolulu 🇺🇸 Hosted by @natanielruizg @anfurnari @YVinker @CSProfKGD
1K Followers 708 FollowingResearch Scientist at @NVIDIA | PhD from SJTU @sjtu1896 | Interested in 3D Computer Vision, Human Digitization | Views are my own