Our team is *hiring* interns & researchers! We’re a small team of hardcore researchers & engineers working on foundation models, agentic methods, and embodiment. If you have strong publications and related experience, plz fill out application form.
forms.gle/4bUeFfksUhCLap…
Our Excel Agent, Shortcut, is generally available now!
Greatly improved trust-worthiness & accuracy. ~90% win rate against top first-year analysts
26 days since early access, 28 versions shipped
So proud of the team, and really appreciate all the feedback from our users!
Our Excel Agent, Shortcut, is generally available now!
Greatly improved trust-worthiness & accuracy. ~90% win rate against top first-year analysts
26 days since early access, 28 versions shipped
So proud of the team, and really appreciate all the feedback from our users!
Shortcut – the first superhuman excel agent – is live.
While not perfect, Shortcut beats first year analysts from McKinsey/Goldman head-to-head 89.1% (220:27) when blindly judged by their managers.
We even gave humans 10x more time.
Try Shortcut now (before your boss does).
Many of you have known us as Altera. Today, I'm happy to share that we are now officially @Fundamental Research Labs!
We will be unveiling our next big step today, so it felt perfect to reintroduce ourselves:
digitalhumanity.substack.com/p/introducing-…
Can we make LLMs reason effectively without a huge inference time cost?
We show a powerful approach through learning and forgetting!
Our recipe:
1️⃣ Aggregate reasoning paths from diverse sources: Chain-of-Thought, inference-time search (Tree-of-Thought, Reasoning-via-Planning),…
Excited to announce that our web agent paper, AgentOccam, has been accepted to ICLR 2025! 🏂🏂🏂 Huge thanks to all collaborators! 😊
Special thanks to my brilliant and considerate mentor, Yao @yaoliucs, for your constant guidance and encouragement! Sapana @Sapana_007 and Rasool…
Excited to announce that our web agent paper, AgentOccam, has been accepted to ICLR 2025! 🏂🏂🏂 Huge thanks to all collaborators! 😊
Special thanks to my brilliant and considerate mentor, Yao @yaoliucs, for your constant guidance and encouragement! Sapana @Sapana_007 and Rasool… https://t.co/uAGvwKiAkr
👾 Introducing AgentOccam: Automating Web Tasks with LLMs! 🌐 AgentOccam showcases the impressive power of Large Language Models (LLMs) on web tasks, without any in-context examples, new agent roles, online feedback, or search strategies. 🏄🏄🏄
🧙 Link: arxiv.org/abs/2410.13825…
How can robots efficiently learn **new tasks/in new settings**?
Introducing EXTRACT: a reinforcement learning (RL) framework that extracts a discrete + continuously parameterized skill library from offline data for efficient RL on new tasks!
Accepted to CoRL 2024: 🧵👇
Proud to release the first LLM from @boson_ai. Higgs-Llama-3-70B, built for characters and gameplay, trained on Boson-3 base. With great MMLU-Pro performance. boson.ai/higgs-opensour…
Our team at AWS is *hiring* interns and full-time researchers!
@yaoliucs, @pratikac, I, and others work on RL, alignment, large models, and ML in general.
If you have a strong relevant publications in those areas, please fill out this form.
forms.gle/5KsNZ1zyKArLF4…
Offline RL is much harder than online RL or imitation learning as it needs to solve a sequence of counterfactual reasoning problems. That often gives an error of (1+\delta)^H, where delta is the one-step divergence of policy or extrapolation of Q and H is the horizon. 1/N
One common misconception about (deep) RL is that is was done by first defining some empirical loss as objective and then deriving model updating rules from GD, just like supervised learning. This is NOT the case for popular RL algorithms like policy gradient or TD-based. 1/N
168 Followers 482 FollowingPulling out all the FLOPS at @FLAIR_Ox 🚀 DPhil in Machine Learning @UniofOxford | ex-RS Intern @Spotify | ex-RS @convergence_ai_ (acq. @Salesforce)
956K Followers 765 FollowingProfessor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
11K Followers 722 Following"If there is not folly in the world, then the world itself is folly. You must understand that mistakes are not always regrets." - Paul Tobin, Bandette🤠
1.2M Followers 279 FollowingWe’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.
1.4M Followers 1K FollowingBuilding @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
359K Followers 1K FollowingML/AI researcher & former stats professor turned LLM research engineer. Author of "Build a Large Language Model From Scratch" (https://t.co/O8LAAMRzzW).
328K Followers 3K FollowingNVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
29K Followers 1K FollowingFoundation Models for Generalizable Autonomy in Robotics. Reinforcement Learning. Assistant Professor in AI Robotics @GeorgiaTech. Prev @nvidia
53K Followers 64 FollowingStudent of mind and nature, libertarian, chess player, cancer survivor. @ Keen, UAlberta, Amii, https://t.co/u8za2Kod54, The Royal Society, Turing Award
13K Followers 184 Followingpost training co-lead at Google DeepMind, focusing on safety, alignment, post training capabilities • associate professor at UC Berkeley EECS
10K Followers 4K Followingsth new // ex Gemini RL+Inference @GoogleDeepMind // Chat AI @Meta // RL Agents @EA // ML+Information Theory @MIT+@Harvard+@GeorgiaTech // زن زندگی آزادی
2K Followers 1K FollowingResearch at @OpenAI; Reinforcement Learning; PhD from UT Austin. Previously FAIR Paris @AIatMeta, @CMU_Robotics @NVIDIAAI @UberATG.
34K Followers 36 FollowingWorld Labs is a spatial intelligence company building Large World Models to perceive, generate, and interact with the 3D world.
400 Followers 596 FollowingApplied Scientist @awscloud. Ph.D. from @UTAustin.
Interpretable and Robust Models #NLProc.
I have a super powerful language model in my brain.
1K Followers 220 FollowingCorso is a Professor at U Michigan and Co-Founder of Voxel51 who makes the category-defining data+model codevelopment ML Tool: FiftyOne