Inlay

//

ProfilePosts

Loading...

Excited about this new work from @haoyuhe.bsky.social. TLDR: Diffusion language models treat learning and inference differently which lowers performance. RL can be used to overcome this issue for certain problems.

9mo

Andreas Geiger

🚨 New Paper: "Solving Spatial Supersensing Without Spatial Supersensing" Huge credit to the Cambrian-S team for tackling one of the hardest open problems in video understanding: spatial supersensing. In our paper, we take a closer look at their benchmarks & methods 👇

6mo

For VSI-Super-Counting (VSC), we run a sanity check: 🔁 VSC-Repeat: we concatenate each video with itself 1-5× ✅ Unique object count stays the same ❌ Cambrian-S accuracy drops from 42% → 0% A genuine supersensing system should be robust here.

Presenting A Sober Look at Progress in LM Reasoning at @colmweb.org today 🇨🇦 #COLM2025 📅 Today 🕔 11:00 AM – 1:00 PM 📍 Room 710 - Poster #31 We find that many “reasoning” gains fall within variance and show how to make evaluation reproducible again. 📘 bethgelab.github.io/sober-reasoning

6mo

8mo