Yaoyao(Freax) Qian @RubyFreax

Tweets

79
Followers

381
Following

1K
Likes

191

Yinpei Dai @YinpeiD

2 months ago

Thanks @_akhaliq for sharing our work! Aim and Grasp! AimBot introduces a new design to leverage visual cues for robots - similar to scope reticles in shooting games. Let's equip your VLA models with low-cost visual augmentation for better manipulation! aimbot-reticle.github.io

AK @_akhaliq

2 months ago

5 22 175 61K 81

Download Video

1 7 21 3K 6

Guohao Li 🐫 @guohao_li

2 months ago

Introducing Eigent — the first multi-agent workforce on your desktop. Eigent is a team of AI agents collaborating to complete complex tasks in parallel. It is your long-term working partner with fullly customizable workers and MCPs. Public beta available to download for MacOS,…

137 138 675 198K 807

Download Video

Haibo Zhao @ZhaoHaibo47588

2 months ago

Excited to share our #ICML2025 paper, Hierarchical Equivariant Policy via Frame Transfer. Our Frame Transfer interface imposes high-level decision as a coordinate frame change in the low-level, boosting sim performance by 20%+ and enabling complex manipulation with 30 demos.

2 12 45 4K 10

Download Video

Yaoyao(Freax) Qian @RubyFreax

3 months ago

Owen will be presenting our poster for the paper Hierarchical Equivariant Policy via Frame Transfer at ICML Today (see lnkd.in/e-7p9Viq for details). If you are interested in equivariance and/or robotic manipulation please stop by!

0 0 2 240 0

Download Image

Yaoyao(Freax) Qian @RubyFreax

3 months ago

inspired talk! #RSS2025

1 1 10 796 2

Download Image

Yaoyao(Freax) Qian @RubyFreax

3 months ago

🤣First time at RSS! Happy to meet up and chat!

0 0 2 214 0

Download Image

Yaoyao(Freax) Qian @RubyFreax

3 months ago

🥳Visual Tree Search of Web Agent has been accepted!

Danqing Zhang @Danqing_Z

4 months ago

🥳Visual Tree Search of Web Agent has been accepted!

1 4 10 772 1

0 0 4 393 0

Infini-AI-Lab @InfiniAILab

4 months ago

🔥 We introduce Multiverse, a new generative modeling framework for adaptive and lossless parallel generation. 🚀 Multiverse is the first open-source non-AR model to achieve AIME24 and AIME25 scores of 54% and 46% 🌐 Website: multiverse4fm.github.io 🧵 1/n

6 83 221 88K 114

Download Gif

Songlin Yang @SonglinYang4

4 months ago

📢 (1/16) Introducing PaTH 🛣️ — a RoPE-free contextualized position encoding scheme, built for stronger state tracking, better extrapolation, and hardware-efficient training. PaTH outperforms RoPE across short and long language modeling benchmarks arxiv.org/abs/2505.16381

9 93 542 75K 331

Wenhao Chai @wenhaocha1

5 months ago

🎉 We’re excited to host two challenges at LOVE: Multimodal Video Agent Workshop at CVPR 2025, advancing the frontier of video-language understanding! @CVPR #CVPR2025 📌 Track 1A: [VDC] Video Detailed Captioning Challenge Generate rich and structured captions that cover multiple…

2 14 43 8K 9

Yaoyao(Freax) Qian @RubyFreax

6 months ago

🤖 How do AI agents actually work together? I made 2 short videos on Google’s Agent2Agent (A2A) protocol: 📘 Ep1: What is A2A? 📙 Ep2: Why it matters No backend needed—just curiosity. 🎥 Watch here: youtube.com/playlist?list=…

0 1 4 740 1

Yaoyao(Freax) Qian @RubyFreax

6 months ago

Just posted a 21-min tutorial on Model Context Protocol (MCP) — no jargon, just real-life analogies. 🍜 Restaurant menus 🧳 Travel guides 🦸‍♂️ Superpowers 📝 Memory notes I wanted to make it clear enough for anyone, even without a tech background. 🎥👇 youtu.be/0EtVAzIYbys?si…

0 0 1 429 2

Yaoyao(Freax) Qian @RubyFreax

7 months ago

Just realized my paper is being used as a baseline—such a strange feeling! Seeing my model tested across different settings without me doing anything is fascinating. 🤯 Added the papers using ThinkGrasp as a baseline to its GitHub—check them out!🥳

0 1 18 889 7

Download Image

Yongyuan Liang @cheryyun_l

9 months ago

Introducing TraceVLA: a fully open-source Vision-Language-Action model reimagining spatial-temporal awareness: tracevla.github.io ✨ 3.5x gains on real robots, SOTA in simulation 💡 Fine-tunes on just 150K trajectories ⚡ Compact 4B model = 7B performance