From the V-Jepa2 presentation:
Action anticipation in EPIC-KITCHENS is "a very challenging task and still there’s head room for improvement on this task".
V-Jepa2 establishes a new SOTA but still <65% on both verb and noun acc.
Mido Assran at the @iclr-conf.bsky.social workshop on World Models