What if a world model could imagine the future from a completely different perspective? Introducing XVWM: given one view and an action, predict the future from another camera. A building block for theory of mind.
Collaboration with aimlabs.com
š arxiv.org/abs/2602.07277