Sparsh Garg @_sparshgarg_

MLE Perception@ Lucid Motors | 3D Perception Researcher @ Bosch Center for AI | CMU Robotics sparsh913.github.io/sparshgarg/ Newark, CA Joined October 2023

Tweets

83
Followers

156
Following

1K
Likes

1K

Songming Liu @songming_liu

4 days ago

😠💢😵‍💫Tired of endless data collection & fine-tuning every time you try out VLA? Meet RDT2, the first foundation model that zero-shot deploys on any robot arms with unseen scenes, objects & instructions. No collection. No tuning. Just plug and play🚀 Witness a clear sign of…

23 86 524 77K 404

Download Video

Skild AI @SkildAI

6 days ago

We built a robot brain that nothing can stop. Shattered limbs? Jammed motors? If the bot can move, the Brain will move it— even if it’s an entirely new robot body. Meet the omni-bodied Skild Brain:

513 918 7K 2.1M 2K

Download Video

Lukas Ziegler @lukas_m_ziegler

2 weeks ago

A robotic ballet! 🩰 Coordinating multiple robot arms on a busy factory floor is notoriously complex. Each arm needs to move without colliding with its neighbors or the surrounding equipment, and today that planning is still mostly done by hand, a process that takes specialists…

4 71 453 25K 201

Download Video

Jason Liu @JasonJZLiu

3 weeks ago

Ever wish a robot could just move to any goal in any environment—avoiding all collisions and reacting in real time? 🚀Excited to share our #CoRL2025 paper, Deep Reactive Policy (DRP), a learning-based motion planner that navigates complex scenes with moving obstacles—directly…

21 163 894 67K 410

Download Video

Lucid Motors @LucidMotors

2 months ago

Rugged by design. Elevated by nature. The #LucidGravityX concept redefines what a trail-ready adventure vehicle could be. Read more about our new bold concept: bit.ly/46Yu886

83 134 989 140K 52

Download Image

Skild AI @SkildAI

2 months ago

We’ve all seen humanoid robots doing backflips and dance routines for years. But if you ask them to climb a few stairs in the real world, they stumble! We took our robot on a walk around town to environments that it hadn’t seen before. Here’s how it works🧵⬇️

40 145 848 255K 148

Download Video

Deepak Pathak @pathak2206

2 months ago

AI that truly understands the physical world should not be limited by robot type or tasks. We tackle robotics in its full generality @SkildAI. The goal is to build a continually improving, omni-bodied brain that can control any hardware for any task.

5 8 65 7K 4

Download Video

Russ Tedrake @RussTedrake

3 months ago

TRI's latest Large Behavior Model (LBM) paper landed on arxiv last night! Check out our project website: toyotaresearchinstitute.github.io/lbm1/ One of our main goals for this paper was to put out a very careful and thorough study on the topic to help people understand the state of the…

8 108 490 81K 194

Shalev Lifshitz @Shalev_lif

3 months ago

The neural network objective function is a very complicated objective function. It's very non convex, and there are no mathematical guarantees whatsoever about its success. And so if you were to speak to somebody who studies optimization from a theoretical point of view, they…

32 133 1K 203K 510

Download Image

Inbar Mosseri @inbar_mosseri

4 months ago

Excited to share that TokenVerse won Best Paper Award at SIGGRAPH 2025! 🎉 TokenVerse enables personalization of complex visual concepts, from objects and materials to poses and lighting, each can be extracted from a single image and be recomposed into a coherent result. 👇

9 22 211 16K 63

Download Video

TuringPost @TheTuringPost

4 months ago

Log-linear attention — a new type of attention proposed by @MIT which is: - fast and efficient as linear attention - expressive as softmax It uses a small but growing number of memory slots that increases logarithmically with the sequence length. Here's how it works:

12 225 1K 103K 1K

Download Image

Yuliang Guo @33yuliangguo

4 months ago

[CVPR2025] Depth Any Camera: Zero-Shot Metric Depth Estimation from Any ... youtu.be/U1qGXx0QBwE?si… via @YouTube

0 1 3 202 0

Yuan Liu @YuanLiu41955461

9 months ago

I'm excited to share our new work Diffusion as Shader (DaS), a versatile controllable video generation method for various tasks: object manipulation, camera control, mesh-to-video, and motion transfer. Project page: igl-hkust.github.io/das/ Github: github.com/IGL-HKUST/Diff…

4 80 323 32K 177

Download Video

Ayush Jain @ayushjain1144

4 months ago

We move our eyes actively—driven by survival and efficiency—but we still don’t fully understand how. That makes supervised learning hard. In our new work, we explore how to train VLMs to reason visually using RL. ViGoRL offers a glimpse into how models like o3 might be trained.

Gabriel Sarch @GabrielSarch

4 months ago

12 61 443 62K 425

Download Video

0 1 11 549 3

Ville🤖 @VilleKuosmanen

4 months ago

Do AI robots see the world like we do? I dove head first into latent space to uncover the attention maps that show how my robot sees and understands the world.

9 42 278 44K 110

Download Video

Google DeepMind @GoogleDeepMind

4 months ago

Video, meet audio. 🎥🤝🔊 With Veo 3, our new state-of-the-art generative video model, you can add soundtracks to clips you make. Create talking characters, include sound effects, and more while developing videos in a range of cinematic styles. 🧵

657 1K 8K 1.5M 3K

Download Video

Inbar Mosseri @inbar_mosseri

4 months ago

Excited to introduce our new Veo 2 capabilities! Now with reference powered video generation (including style!), camera controls, outpainting, object add/removal & many more: deepmind.google/models/veo/#ca… Also presenting Flow, our new AI filmmaking tool. labs.google/flow

1 9 34 2K 3

Download Video

Hubert Thieblot @hthieblot

4 months ago

I'm investing up to 250k first checks in teams building: - robotics, drones, space - crypto - applied ai/ml - ar/vr - manufacturing, logistics DMs always open. Tell me what you're building!

288 104 2K 366K 2K

Yu Xiang @YuXiang_IRVL

5 months ago

🤖Why is robot manipulation still an open challenge? This video shows a kitting task -- packing multiple items into a single product. No robot today can do this autonomously. Big challenges = big opportunities for research and industry. #robotics #manipulation #automation

8 26 186 21K 72

Download Video

Shubham Tulsiani @shubhtuls

5 months ago

[1/6] Our #CVPR2025 paper “DiffusionSfM” extends our RayDiffusion framework — inferring both geometry and cameras via diffusing pixelwise ray origins and endpoints.