Third year CS PhD candidate at Princeton University (@princeton_nlp @PrincetonPLI), previously CS undergrad at IIT Bombayadithyabh.github.io Princeton, NJJoined June 2023
Are AI scientists already better than human researchers?
We recruited 43 PhD students to spend 3 months executing research ideas proposed by an LLM agent vs human experts.
Main finding: LLM ideas result in worse projects than human ideas.
🤔 Recent mech interp work showed that retrieval heads can explain some long-context behavior. But can we use this insight for retrieval?
📣 Introducing QRHeads (query-focused retrieval heads) that enhance retrieval
Main contributions:
🔍 Better head detection: we find a…
Introducing COMPACT: COMPositional Atomic-to-complex Visual Capability Tuning, a data-efficient approach to improve multimodal models on complex visual tasks without scaling data volume. 📦
arxiv.org/abs/2504.21850
1/10
543K Followers 23K FollowingThe best from ML/AI community | Ex-Microsoft, Rackspace, Fast Company | Wrote eight books about the future | Silicon Valley robots, holodecks, BCIs, & startups.
2K Followers 2K FollowingPh.D. Student @PrincetonCS. Prev @Stanford @UW @pika_labs @MSFTResearch @UofIllinois. I used to work on computer vision, but it's not all I do.
9K Followers 2K FollowingMachine learning scientist and engineer speaking πtorch & C++. Past @LTIatCMU, @awscloud. Opinions sampled from MY OWN 100T param LM.
7K Followers 6K FollowingCenter for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSQtw
@[email protected]
85 Followers 231 FollowingResearch @ MATS, CS @ Princeton, working on multi-agent safety, long-context language models, and efficient inference techniques.
103 Followers 478 Followingresearch scientist at AIML @Apple ;Ex AI Researcher @SFResearch; Ph.D alumni UT Austin @UTCompSci . Reinforcement learning, diffusion model and LLMs.
266K Followers 680 FollowingBuilding with AI agents @dair_ai • Prev: Meta AI, Galactica LLM, Elastic, PaperswithCode, PhD • I share insights on how to build with AI Agents ↓
5K Followers 2K FollowingDirector and Research Scientist, FAIR @ Meta. Former Professor at UCSD. Researcher in AI privacy, security, and generalization.
2K Followers 8K Following3D Geospatial Analyst at Maxar Space Operations.
Opinions expressed on this site are my own and do not necessarily represent the views of Maxar.
9K Followers 2K FollowingMachine learning scientist and engineer speaking πtorch & C++. Past @LTIatCMU, @awscloud. Opinions sampled from MY OWN 100T param LM.
10K Followers 2K FollowingCS PhD candidate @PrincetonCITP. I tweet about AI agents, AI evals, AI for science.
AI as Normal Technology: https://t.co/5amOkqKDf2
Book: https://t.co/DabpkhNrcM
50K Followers 3K FollowingAI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
15K Followers 528 FollowingAsst. Prof. of CS at Stanford, Google DeepMind. Prev: Anthropic, Google Brain. Co-Creator of MoEs, AlphaChip, Test Time Scaling Laws.