Inlay

ProfilePosts

Visit our #CVPR2026 poster #179 at 11:50-12:30 to learn about issues and solutions for negation in CLIP. Work led by Fawaz Sammani and Tzoulio Chamiti.

🌐 Submit and learn more here: cv4edu.github.io

Do Vision-Language Models (VLMs) actually "see" everything in a crowded room? 🔍 Today at #CVPR2026, we are presenting VisualOverload, our work exploring the critical visual perception bottlenecks of VLMs in dense scenes. 📍 Today (Poster Session 6), 5:30 PM - 7:30 PM, Poster 431 (ExHall A)

This is the first time I fully vibecoded a tool, and it was impressive how far I got in the little time I invested. Claude (Antigravity) did not "one-shot" this, but the few bugs I found were smaller details. Give it a try! github.com/paulgavrikov...

🌎 paulgavrikov.github.io/visualoverload Joint work with Wei Lin, M. Jehanzeb Mirza, Soumya Jahagirdar, Muhammad Huzaifa, Sivan Doveh, Serena Yeung-Levy, James Glass, Hilde Kuehne.

Meet Slurm Manager: a self-hosted web dashboard for Slurm clusters. Connect via SSH, monitor nodes & jobs in real time, submit scripts, view fairshare quotas — all from your browser. Basically, a handy wrapper over Slurm commands via SSH.

🚨 Deadline Approaching! 🚨 The archival deadline has officially closed, but there is still time to share your research! The Non-Archival submission deadline for CV4Edu at #CVPR2026 is coming up on April 9th, 2026 (AOE).

We expose critical flaws in existing negation benchmarks, introduce a new MLLM-as-a-judge evaluation, and show that simple task vector steering can massively boost negation performance! 🚀

Great work jointly led by Fawaz Sammani and Tzoulio Chamiti in collaboration with Nikos Deligiannis. 📖 Paper: arxiv.org/abs/2603.20554 🧑‍💻 Code: github.com/fawazsammani...

CLIP models excel at image retrieval, but they famously struggle with negation. Do we really need fine-tuning to fix this? 🤔 In our #CVPR2026 MAR workshop paper, "When Negation Is a Geometry Problem in Vision-Language Models," we rethink the problem.