[5/8] ESWM also supports efficient exploration by acting on uncertainty to collect experiences and navigate between states.
[6/8] When environments change (e.g., new obstacles), ESWM adapts by updating its temporally and spatially independent memories. No retraining is needed.
Video abstract [2/2]
[1/8] Existing world models rely on a sequence of observations to predict future states. This leads to: 1) redundancy due to temporal overlap (contexts grow for large envs), 2) limited adaptability when environments change due to temporal dependency.
[7/8] Beyond Grid World, ESWM is scalable to the more complex MiniGrid (high-dimensional observation) and 3D indoor scenes ProcThor (realistic pixel observations).
New paper 🚨 #ICLR26
Most world models predict the future from a past trajectory. But neuroscience suggests that such inference can instead be made from temporally independent experiences.
We built the Episodic Spatial World Model (ESWM), a model that does exactly this:
Video abstract [1/2]