Is basic image understanding solved in today’s SOTA VLMs? Not quite.
We present VisualOverload, a VQA benchmark testing simple vision skills (like counting & OCR) in dense scenes. Even the best model (o3) only scores 19.8% on our hardest split.
קבלנו פידבקים מעולים על האג'נדה אבל לצערנו עדיין חסרות לנו דוברות בכנס.
אנחנו מנסים במספר ערוצים אבל אשמח לחיבורים לחוקרות/מפתחות/CTO שמתעסקות בתחום הדיפנס ותרצנה להציג בכנס שלנו.
הנה הכנס: machinelearning.co.il/lp-events/defe…
קבלנו פידבקים מעולים על האג'נדה אבל לצערנו עדיין חסרות לנו דוברות בכנס.
אנחנו מנסים במספר ערוצים אבל אשמח לחיבורים לחוקרות/מפתחות/CTO שמתעסקות בתחום הדיפנס ותרצנה להציג בכנס שלנו.
הנה הכנס: machinelearning.co.il/lp-events/defe…
Like old wine 🙃 Glov finally got into TMLR.
Not enough people talk about the rejections, rewrites & reviewer chaos it takes to get there.
Shoutout to @jmie_mirza for the honest post. More of this, please. Normalize the messy middle.
🔔 Last 24 hours!! 🔔
Don’t shelve that great idea!
Submit your paper to the LongVid-Foundations Workshop @ICCVConference and make part of the discussion!
📌 Proceedings track deadline: July 1st 11:59PM UTC-0
👉 openreview.net/group?id=thecv…#ICCV2025
Working on videos that are longer than 8 seconds? Want to visit Hawaii? Consider submitting to this workshop 😁
LongVid-Foundations @ICCVConference!
Proceedings: July 1, 2025
No Proceedings: Aug 30, 2025
Link: ramoscsv.github.io/longvid_founda…
#ICCV2025
⚠️ NEW DATES + NEW TRACK for LongVid-Foundations @ICCVConference!
Submit work & learn from leading experts: Katerina Fragkiadaki, Ishan Misra, Sayak Paul and Jiajun Wu!
Proceedings: July 1, 2025
No Proceedings: Aug 30, 2025
Link: ramoscsv.github.io/longvid_founda…#ICCV2025
Nimrod will present liveXiv - an evolving dataset that tackles contamination - in Two days at @iclr_conf - super cool work and a super cool presenter 🤩
Nimrod will present liveXiv - an evolving dataset that tackles contamination - in Two days at @iclr_conf - super cool work and a super cool presenter 🤩
LiveXiv accepted to ICLR :)
It dynamically generates evolving benchmark from ArXiv to mitigate data contamination, ensuring ML models are evaluated on truly unseen data.
LiveXiv accepted to ICLR :)
It dynamically generates evolving benchmark from ArXiv to mitigate data contamination, ensuring ML models are evaluated on truly unseen data.
Excited to share that our 3rd Multimodal Workshop has been accepted to CVPR 2025 in Nashville! 🎉 Looking forward to advancing discussions on vision-language models, compositional reasoning, and contextual understanding. See you there!
@CVPR
Just back from NeurIPS where we presented 'ConMe', exploring how VLMs handle compositional reasoning. Loved catching up with old friends and making new connections. A perfect reminder that I should start planning for the May deadline! 😊
4K Followers 767 Followingעכשיו מתכנתת, בעבר מנהלת מוצר ושיווק, אה וגם צלמת. בונה כמה מוצרים ומשתפת את המסע 🙆♀️
Product manager turned developer. Building software in public ^_^
543K Followers 24K FollowingThe best from ML/AI community | Ex-Microsoft, Rackspace, Fast Company | Wrote eight books about the future | Silicon Valley robots, holodecks, BCIs, & startups.
107 Followers 120 FollowingPhD, Max Planck Institute for Informatics, supervised by Prof. Bernt Schiele.
Computer Vision @ IIT Delhi, Mathematics and Statistics @ IISER Kolkata.
4K Followers 2K FollowingHead of Volumetric 3D Video at Meta
Prev Projects: Hyperscape, MapAnything, Dynamic 3D Gaussian Splatting, SplaTAM, HOTA +more
Prev PhD at RWTH + CMU + Oxford
1K Followers 8K FollowingAI inference, speculative decoding, open source. Built novel decoding algorithms – default in Hugging Face Transformers (150+ ⭐). Making AI faster + cheaper
772 Followers 4K FollowingPhysics. AI. Space eXploration. Astrophysics. Traveller. Observer. Pronounces: Protein implementation of intelligence in the universe.
45 Followers 714 FollowingMSc @CMU_Africa/@CMUEngineering | Exploring Vision x Language x Security | Focused on Computer Vision, HealthTech, and Cybersecurity
4K Followers 767 Followingעכשיו מתכנתת, בעבר מנהלת מוצר ושיווק, אה וגם צלמת. בונה כמה מוצרים ומשתפת את המסע 🙆♀️
Product manager turned developer. Building software in public ^_^
4K Followers 466 FollowingResearch scientist @AIatMeta (FAIR), prev/visiting @WeizmannScience. Interested in generative models and deep learning of irregular/geometric data.🎗️
128K Followers 984 FollowingPartner @a16z AI 🤖 and twin to @omooretweets | Investor in @elevenlabsio, @krea_ai, @bfl_ml, @hedra_labs, @WaveFormsAI, @ViggleAI, & more
42K Followers 1K FollowingSecular Bayesian.
Professor of Machine Learning @Cambridge_CL. Talent aficionado at https://t.co/RbJkoLguey
Alum of @Twitter, Magic Pony and @Balderton
496 Followers 350 FollowingIndependent Researcher | Deep Learning, Computer Vision, Model Interpretability | Former Postdoc @uni_tue, PhD @ Uni Mannheim
125K Followers 683 FollowingAuthor of the book The Curse of God - Why I Left Islam. Get your copy from https://t.co/1M1kCNEMK7. https://t.co/euzRzfnrHu
495K Followers 152 FollowingNobel Laureate. Co-Founder & CEO @GoogleDeepMind - working on AGI. Solving disease @IsomorphicLabs. Trying to understand the fundamental nature of reality.