//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
ProfilePosts






What's better, (iii) straightening encourages the latent Euclidean distance to better align with the geodesic distance; (iv) near-perfect reconstruction can be attained with a very low feature dimensionality (we can reduce embedding dimension from 384-->8!)
2mo
More details can be found in our paper arxiv.org/abs/2603.12231. Many hanks to my collaborators @oumayma @gaoyuezhou.bsky.social @randall @timrudner.bsky.social and amazing advisors @yann-lecun.bsky.social @mengyer.bsky.social for their guidance and support šŸ’œšŸ’œšŸ’œ
What is a good latent space for world modeling and planning? šŸ¤” Inspired by the perceptual straightening hypothesis in human vision, we introduce temporal straightening to improve representation learning for latent planning. šŸ“: agenticlearning.ai/temporal-str...
2mo
2mo
Inspired by the perceptual straightening hypothesis ( human visual systems transform natural videos into straighter internal representations), we introduce a simple fix: jointly learn an encoder & a predictor (JEPA-style) with regularization on curvatures of latent trajectories.
2mo
Large-scale visual pretraining is useful but NOT enough! It's not tailored to the dynamics of the environment and retains many planning-irrelevant low-level details. e.g. In DINOv2 feature space, the latent trajectories are curved & L2 distances don't reflect geodesic distances.
The resulting embedding space has many good properties! We find that (i) implicit straightening can happen when training the encoder using the predictor loss alone; (ii) adding straightening regularization further decreases curvature of the resulting embeddings;
2mo
2mo
Learning good representations is essential for latent planning with world models. While pretrained visual encoders produce strong semantic visual features, they are not tailored to planning and contai...
arxiv.org
Temporal Straightening for Latent Planning
Straightening also makes the loss landscape closer to convex and better conditioned, improving gradient-based planning. We test on four goal-reaching tasks and observe a significant boost in open-loop and MPC success rate using gradient descent.
2mo
yingwww.bsky.social
yingwww.bsky.social
yingwww.bsky.social
yingwww.bsky.social
yingwww.bsky.social
yingwww.bsky.social
yingwww.bsky.social