One of my most exciting results lately! We identify experts in MoE models for properties like safety and faithfulness, and steer them to improve/hurt model faithfulness and safety. Most shockingly, with stearMoE, we can jailbreak 100% safety guardrails for open models. Details 👇
One of my most exciting results lately! We identify experts in MoE models for properties like safety and faithfulness, and steer them to improve/hurt model faithfulness and safety. Most shockingly, with stearMoE, we can jailbreak 100% safety guardrails for open models. Details 👇
@mohsen_fayyaz's recent work showed several critical issues of dense retrievers favoring spurious correlations over knowledge, which makes RAG particularly vulnerable to adversarial examples. Check out more details 👇
@mohsen_fayyaz's recent work showed several critical issues of dense retrievers favoring spurious correlations over knowledge, which makes RAG particularly vulnerable to adversarial examples. Check out more details 👇
Dense retrieval models in Retrieval Augmented Generation systems often prioritize superficial document features, overlooking actual answer relevance.
This inefficiency arises from biases in retrievers.
This paper addresses this by using controlled experiments based on Re-DocRED…
Excited to share MRAG-Bench is accepted at #ICLR2025 🇸🇬.
The image corpus is a rich source of information, and extracting knowledge from it can often be more advantageous than from a text corpus.
We study how MLLMs can utilize vision-centric multimodal knowledge. More in our…
Excited to share MRAG-Bench is accepted at #ICLR2025 🇸🇬.
The image corpus is a rich source of information, and extracting knowledge from it can often be more advantageous than from a text corpus.
We study how MLLMs can utilize vision-centric multimodal knowledge. More in our…
🚀Introducing MRAG-Bench: How do Large Vision-Language Models utilize vision-centric multimodal knowledge? 🤔Previous multimodal knowledge QA benchmarks can mainly be solved by retrieving text knowledge.💥We focus on scenarios where retrieving knowledge from image corpus is more…
Spent a fantastic weekend at Lake Arrowhead with the @uclanlp group! ❄️🏔️⬆️ Enjoyed scenic drives, delicious meals, engaging conversations, and brainstorming sessions. Truly inspiring! 🚗🥘😋💬 🖼️🧠💡
227 Followers 884 FollowingCurator of the LLMpedia (📚 The Illustrated Large Language Model Encyclopedia)
Sharing insights from the most interesting AI papers ·˖✶ ⋆.✧̣̇˚
1 Followers 15 FollowingI’m a Ph.D. student at the University of Southern California, advised by Prof. Jonathan May. My research focuses on grounding of Language Model based systems
5K Followers 311 FollowingAI, iOS & Android dev. Worked at Meta/IG, Uber, Amazon, Apple, and Microsoft building apps, developer platforms, and hardware. Tweeting about LLM psychotherapy.
629 Followers 1K FollowingApplied AI @OpenAI | Physicist | Autonomous systems | ex-@PalantirTech; ex-@AppliedInt | @uniheidelberg | Personal Views Only
349 Followers 773 FollowingPhD student @HKUST in NLP supervised by @yqsong. My research interests are Theory of Mind and Social Intelligence. @HKUSTKnowComp #NLProc
227 Followers 884 FollowingCurator of the LLMpedia (📚 The Illustrated Large Language Model Encyclopedia)
Sharing insights from the most interesting AI papers ·˖✶ ⋆.✧̣̇˚
22K Followers 540 FollowingFounded the Reasoning Team in Google Brain (now in the Gemini Core team of Google DeepMind). Build LLMs to reason. Opinions my own.
5K Followers 311 FollowingAI, iOS & Android dev. Worked at Meta/IG, Uber, Amazon, Apple, and Microsoft building apps, developer platforms, and hardware. Tweeting about LLM psychotherapy.
629 Followers 1K FollowingApplied AI @OpenAI | Physicist | Autonomous systems | ex-@PalantirTech; ex-@AppliedInt | @uniheidelberg | Personal Views Only
349 Followers 773 FollowingPhD student @HKUST in NLP supervised by @yqsong. My research interests are Theory of Mind and Social Intelligence. @HKUSTKnowComp #NLProc
10K Followers 4K Followingsth new // ex Gemini RL+Inference @GoogleDeepMind // Chat AI @Meta // RL Agents @EA // ML+Information Theory @MIT+@Harvard+@GeorgiaTech // زن زندگی آزادی
543K Followers 24K FollowingThe best from ML/AI community | Ex-Microsoft, Rackspace, Fast Company | Wrote eight books about the future | Silicon Valley robots, holodecks, BCIs, & startups.
16 Followers 121 FollowingPhD Student @TU_Muenchen. Working on the Theoretical Foundation of ML Systems.
MSc and BSc @ Sharif University. (@SharifSocial)
19K Followers 20 FollowingA high-throughput and memory-efficient inference and serving engine for LLMs. Join https://t.co/lxJ0SfX5pJ to discuss together with the community!
269K Followers 0 FollowingThe Internet's Observatory: Tracking cybersecurity and digital governance • connectivity and democracy • tools and policy for change
No recent Favorites. New Favorites will appear here.