We extended MUSt3R with semantic awareness and multi-view panoptic segmentation capabilities in PanSt3R, accepted at #ICCV2025
www.arxiv.org/abs/2506.21348
Our work on "Reasoning in visual navigation..." presented as a "Highlight" by Boris Chidlovskii and Francesco Giuliari at #cvpr2025!
Interactive site, play around with dynamical models:
europe.naverlabs.com/research/pub...
Thanks @weinzaepfelp.bsky.social for the photo.
@steevenj7.bsky.social
I gave two talks @cvprconference.bsky.social #cvpr2025
Slides are now available for those interested
1- Catch me if you can: Manoeuvre the Competition
with Your Unique Abilities
tinyurl.com/StandOut-DD2...
2- Beyond Long Video Understanding
tinyurl.com/BeyondLong-D...
During today's #CVPR2025 workshops, I will present:
- What matters in ImageNav: architecture, pre-training, sim settings, pose (poster & highlight at the Embodied AI workshop)
- CondiMen: Conditional Multi-person Human Mesh Recovery (Poster at the Rhobin workshop and at the 3D Humans workshop)
Excited to share Maximeโs latest work on Privacy Preserving Visual Localization. If interested Maxime will present his work tomorrow at #CVPR2025, Poster Session 1, Poster Number 85 (ExHall D).
Excited to share our latest work in the *St3R family. PanSt3R, accepted at #ICCV25 proposes a unified and integrated approach for panoptic 3D scene reconstruction and panoptic segmentation in a single forward pass.
www.arxiv.org/abs/2506.21348
I have created a starter pack with researchers from Naver Labs Europe @naverlabseurope.bsky.social: we are in Grenoble, France, and we do research in AI for robotics, computer vision, NLP, machine learning, HRI.
go.bsky.app/JdTFu4Q
Wanna the outstanding performance of MASt3R while using a ViT-B or ViT-S encoder instead of its ViT-L one? Don't miss how we build DUNE, a single encoder for diverse 2D & 3D tasks, at this afternoon #CVPR2025 poster session (poster #376).
paper: arxiv.org/abs/2503.14405
code: github.com/naver/dune
๐๐ข๐ฆ๐๐ฏ๐ฅ: ๐๐ฒ๐๐ฝ๐ผ๐ถ๐ป๐-๐ณ๐ฟ๐ฒ๐ฒ ๐๐ฎ๐ป๐ฑ-๐ข๐ฏ๐ท๐ฒ๐ฐ๐ ๐ฏ๐ ๐ฅ๐ฒ๐ฐ๐ผ๐ป๐๐๐ฟ๐๐ฐ๐๐ถ๐ผ๐ป ๐ณ๐ฟ๐ผ๐บ ๐ฅ๐๐ ๐ถ๐บ๐ฎ๐ด๐ฒ๐
Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel ... Grรฉgory Rogez
arxiv.org/abs/2508.16465
Trending on www.scholar-inbox.com
Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization
Maxime Pietrantoni, @gabrielacsurka.bsky.social, @sattlertorsten.bsky.social
arxiv.org/abs/2507.23569
Panoptic segmentation of 3D scenes, involving the segmentation and classification of object instances in a dense 3D reconstruction of a scene, is a challenging problem, especially when relying solely ...
1/ ๐ Paper 1:
"DUNE: Distilling a UNiversal Encoder from Heterogeneous 2D and 3D Teachers"
We propose DUNE: a ViT-based encoder distilled from multiple specialized 2D & 3D foundation models to unify visual tasks across 2D, 3D and human understanding.
*3R posts are back! ๐งต
Interested in SfM, RGB-SLAM or... both at the same time???
Come see MUSt3R @CVPR25 Friday morning, ExHall D Poster #82.
Jerome and Boris will be there to present how we can adapt DUSt3R to multiple views via a memory mechanism.
If you missed it earlier [...]
DUSt3R introduced a novel paradigm in geometric computer vision by proposing a model that can provide dense and unconstrained Stereo 3D Reconstruction of arbitrary image collections with no prior info...