Question on the squish/pad
RoMa v1, v2, LoMa squish and happy.
On the other hand, VGGT, *3R family use complex pipeline: N possible sizes, resize-to-nearest+0 pad. Have you tried the opposite approach?
@jianyuanwang.bsky.social @parskatt.bsky.social @davnords.bsky.social @vincentleroy.bsky.social