Visit our #CVPR2026 poster #179 at 11:50-12:30 to learn about issues and solutions for negation in CLIP. Work led by Fawaz Sammani and Tzoulio Chamiti.
š Submit and learn more here: cv4edu.github.io
Do Vision-Language Models (VLMs) actually "see" everything in a crowded room? š
Today at #CVPR2026, we are presenting VisualOverload, our work exploring the critical visual perception bottlenecks of VLMs in dense scenes.
š Today (Poster Session 6), 5:30 PM - 7:30 PM, Poster 431 (ExHall A)
This is the first time I fully vibecoded a tool, and it was impressive how far I got in the little time I invested. Claude (Antigravity) did not "one-shot" this, but the few bugs I found were smaller details. Give it a try!
github.com/paulgavrikov...
š paulgavrikov.github.io/visualoverload
Joint work with Wei Lin, M. Jehanzeb Mirza, Soumya Jahagirdar, Muhammad Huzaifa, Sivan Doveh, Serena Yeung-Levy, James Glass, Hilde Kuehne.
Meet Slurm Manager: a self-hosted web dashboard for Slurm clusters.
Connect via SSH, monitor nodes & jobs in real time, submit scripts, view fairshare quotas ā all from your browser. Basically, a handy wrapper over Slurm commands via SSH.
šØ Deadline Approaching! šØ
The archival deadline has officially closed, but there is still time to share your research! The Non-Archival submission deadline for CV4Edu at #CVPR2026 is coming up on April 9th, 2026 (AOE).
We expose critical flaws in existing negation benchmarks, introduce a new MLLM-as-a-judge evaluation, and show that simple task vector steering can massively boost negation performance! š
Great work jointly led by Fawaz Sammani and Tzoulio Chamiti in collaboration with Nikos Deligiannis.
š Paper: arxiv.org/abs/2603.20554
š§āš» Code: github.com/fawazsammani...
CLIP models excel at image retrieval, but they famously struggle with negation. Do we really need fine-tuning to fix this? š¤ In our #CVPR2026 MAR workshop paper, "When Negation Is a Geometry Problem in Vision-Language Models," we rethink the problem.