Thank you, @rohanpaul_ai, for the great summary of Red Queen! 🚀 We’ve just released our codebase at GitHub. Feel free to explore it and create your own Red Queen attacks!
Git link: github.com/kriti-hippo/re…
Thank you, @rohanpaul_ai, for the great summary of Red Queen! 🚀 We’ve just released our codebase at GitHub. Feel free to explore it and create your own Red Queen attacks!
Git link: github.com/kriti-hippo/re…
RED QUEEN ATTACK construct a multi-turn scenario to conceal malicious intent and mislead models! To our surprise, larger models are more susceptible under our attack😮😮😮
RED QUEEN ATTACK construct a multi-turn scenario to conceal malicious intent and mislead models! To our surprise, larger models are more susceptible under our attack😮😮😮
How good are MLLM at solving IQ (abstract visual reasoning) problems? Check our new benchmark paper! MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning
Paper: arxiv.org/pdf/2404.13591…
Website: marvel770.github.io
Great pressure to present our cool work at EMNLP 2023 !!!
BrainTeaser: a lateral thinking task in a multiple-choice QA format on which large language models struggle to identify puzzle solutions, distracted by surface commonsense associations.
🔊 📝 Announcing our new paper at the EMNLP 2023 main conference!
💡 TLDR: Can LLMs solve complex brain teasers? We introduce BRAINTEASER, a multiple-choice framework aimed at evaluating lateral thinking in language models. (arxiv.org/abs/2310.05057)
2K Followers 2K FollowingDirector @ Salesforce Research. Research Interest: Large Language Model, Action Agent, Reinforcement Learning, Time Series Analytics, Learning Theory.
2K Followers 2K FollowingNLP postdoc at @SheffieldNLP
Ex @Imperial_NLP PhD, @Apple AI/ML Scholar, @UCL MSc
Model robustness and now uncertainty quantification
776 Followers 1K Following[email protected], Postdoc@tsinghua, working with Prof. Jie Tang. PhD advised by Prof. Yue Zhang. Prev: Interned @AWScloud. LLM Evaluation, Posttraining
97 Followers 127 FollowingDirector of Artificial Intelligence and Staff Research Engineer, Founding Team @Hyperbots_Inc IIT Bombay 4th Year CS PhD candidate @cfiltnlp @iitbombay
475 Followers 3K FollowingWe spotlight ML researchers & practitioners. High (S) fact: ~50% code contributors to ML paper implementations are practitioners collaborating with researchers
2K Followers 2K FollowingDirector @ Salesforce Research. Research Interest: Large Language Model, Action Agent, Reinforcement Learning, Time Series Analytics, Learning Theory.
1K Followers 416 Following100K+ on LinkedIn | Founder @ JUTEQ | Building agents for businesses worldwide | Follow to learn latest industry insights about AI Agents
97K Followers 8K FollowingCompiling in real-time, the race towards AGI.
The Largest Show on X for AI.
🗞️ Get my daily AI analysis newsletter to your email 👉 https://t.co/6LBxO8215l
2K Followers 2K FollowingNLP postdoc at @SheffieldNLP
Ex @Imperial_NLP PhD, @Apple AI/ML Scholar, @UCL MSc
Model robustness and now uncertainty quantification
148K Followers 2 FollowingMakers of Devin, the first AI software engineer. We are an applied AI lab building end-to-end software agents. Join us: https://t.co/JZDd4Vik4P
776 Followers 1K Following[email protected], Postdoc@tsinghua, working with Prof. Jie Tang. PhD advised by Prof. Yue Zhang. Prev: Interned @AWScloud. LLM Evaluation, Posttraining
1K Followers 3 FollowingThe leader in AI/ML acceleration software and creator of Colossal-AI, the open source platform for deep learning training and inference optimization.
15K Followers 51 FollowingEMNLP 2025 - The 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Hashtag: #EMNLP2025
Dates: November 5-9
Submission Deadline: May 19th
97 Followers 127 FollowingDirector of Artificial Intelligence and Staff Research Engineer, Founding Team @Hyperbots_Inc IIT Bombay 4th Year CS PhD candidate @cfiltnlp @iitbombay
475 Followers 3K FollowingWe spotlight ML researchers & practitioners. High (S) fact: ~50% code contributors to ML paper implementations are practitioners collaborating with researchers
141K Followers 39 FollowingSan Diego Dec 2-7, 25 and Mexico City Nov 30-Dec 5, 25. Tweets to this account are not monitored. Please send feedback to [email protected].