//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
ProfilePosts




Loading...
We experimented with different backbones, camera pose representations, scalability, and attention mechanisms. Our evaluation spans hundreds of full-length videos across various metrics, without aligning the predicted trajectory to the ground truth, to simulate a real-world application
šŸ“½ļø Check out Visual Odometry Transformer! VoT is an end-to-end model for getting accurate metric camera poses from monocular videos. vladimiryugay.github.io/vot/
Thanks to the team, Kien Nguyen, Theo Gevers, @cgmsnoek.bsky.social, and @martin-r-oswald.bsky.social from the University of Amsterdam!
ā© GaME code release! github.com/VladimirYuga... Grab components for your 3D reconstruction pipeline: šŸ”¹ Purely geometric out-of-view scene change detection šŸ”¹ Outdated observations filtering šŸ”¹ Evaluation videos of changing scenes Contributions welcome šŸš€
8mo
8mo
2mo
VoT does not require calibration or post-optimization and operates in real-time, capable of processing thousands of frames. It is trained on a vast amount of real-world indoor data, but can work just fine in outdoor scenarios. It uses only camera poses as supervision, making it broadly accessible
Video
8mo
8mo
Vladimir Yugay
Vladimir Yugay
Vladimir Yugay
Vladimir Yugay
Vladimir Yugay