Thanks @_akhaliq for sharing our work!
Aim and Grasp! AimBot introduces a new design to leverage visual cues for robots - similar to scope reticles in shooting games.
Let's equip your VLA models with low-cost visual augmentation for better manipulation!
aimbot-reticle.github.io
Thanks @_akhaliq for sharing our work!
Aim and Grasp! AimBot introduces a new design to leverage visual cues for robots - similar to scope reticles in shooting games.
Let's equip your VLA models with low-cost visual augmentation for better manipulation!
aimbot-reticle.github.io
Introducing Eigent — the first multi-agent workforce on your desktop.
Eigent is a team of AI agents collaborating to complete complex tasks in parallel. It is your long-term working partner with fullly customizable workers and MCPs.
Public beta available to download for MacOS,…
Excited to share our #ICML2025 paper, Hierarchical Equivariant Policy via Frame Transfer. Our Frame Transfer interface imposes high-level decision as a coordinate frame change in the low-level, boosting sim performance by 20%+ and enabling complex manipulation with 30 demos.
Owen will be presenting our poster for the paper Hierarchical Equivariant Policy via Frame Transfer at ICML Today (see lnkd.in/e-7p9Viq for details). If you are interested in equivariance and/or robotic manipulation please stop by!
🔥 We introduce Multiverse, a new generative modeling framework for adaptive and lossless parallel generation.
🚀 Multiverse is the first open-source non-AR model to achieve AIME24 and AIME25 scores of 54% and 46%
🌐 Website: multiverse4fm.github.io
🧵 1/n
📢 (1/16) Introducing PaTH 🛣️ — a RoPE-free contextualized position encoding scheme, built for stronger state tracking, better extrapolation, and hardware-efficient training. PaTH outperforms RoPE across short and long language modeling benchmarks
arxiv.org/abs/2505.16381
🎉 We’re excited to host two challenges at LOVE: Multimodal Video Agent Workshop at CVPR 2025, advancing the frontier of video-language understanding! @CVPR#CVPR2025
📌 Track 1A: [VDC] Video Detailed Captioning Challenge
Generate rich and structured captions that cover multiple…
🤖 How do AI agents actually work together?
I made 2 short videos on Google’s Agent2Agent (A2A) protocol:
📘 Ep1: What is A2A?
📙 Ep2: Why it matters
No backend needed—just curiosity.
🎥 Watch here: youtube.com/playlist?list=…
Just posted a 21-min tutorial on Model Context Protocol (MCP) — no jargon, just real-life analogies.
🍜 Restaurant menus
🧳 Travel guides
🦸♂️ Superpowers
📝 Memory notes
I wanted to make it clear enough for anyone, even without a tech background.
🎥👇
youtu.be/0EtVAzIYbys?si…
Just realized my paper is being used as a baseline—such a strange feeling! Seeing my model tested across different settings without me doing anything is fascinating. 🤯
Added the papers using ThinkGrasp as a baseline to its GitHub—check them out!🥳
Introducing TraceVLA: a fully open-source Vision-Language-Action model reimagining spatial-temporal awareness: tracevla.github.io
✨ 3.5x gains on real robots, SOTA in simulation
💡 Fine-tunes on just 150K trajectories
⚡ Compact 4B model = 7B performance
749 Followers 958 FollowingRobotics Scientist at Frontier AI and Robotics @Amazon. PhD CS from @CSatUSC. RTs are my own paper reading list. Previously at @MSFTResearch and @GoogleDeepMind
419 Followers 619 FollowingFinal Year Undergrad at @Tsinghua_Uni; Previously @CMU_Robotics; Robot Learning and Embodied Agents; Applying for PhD (also job opportunities) at 2026 Fall!
1K Followers 1K FollowingPh.D. @CarnegieMellon. Working on agentic foundation model systems. Founder of the FM-Wild workshop series and the ASAP seminar series. They/Them
446 Followers 6K FollowingGiving meaning to mine share of star dust. Visiting fellow @WinshipAtEmory. Prev at @oracle, @maddox_ai, @KITKarlsruhe, @_nference, @val_iisc, @iitdelhi.
18 Followers 41 FollowingResearch Assistant at NUS. Robot learning and dexterous manipuation. creating true robotic life, pushing the boundaries of what’s possible with machines.
354 Followers 59 FollowingOrby is fundamentally transforming the way enterprise teams perform, giving you the power to delegate tedious tasks to automation.
2K Followers 27 FollowingI post about my DIY robots hardware hobby. Robotics research lead at Mistral AI. Ex-Meta/FAIR, core contributor to Llama 3. ENS PhD. Repeat founder.
5K Followers 151 FollowingRerun is an open-source SDK for visualizing streams of multimodal data.
⭐ GitHub https://t.co/yf1KZN7DBI
👾 Discord https://t.co/7PIlvsZO9n
6K Followers 165 FollowingGazebo is a leader in robot simulation. Maintained by @OpenRoboticsOrg and good friends with @rosorg!
Support: https://t.co/7sIsIXS07i