Principal research scientist at Naver Labs Europe, I am interested in most aspects of computer vision, including 3D scene reconstruction and understanding, visual localization, image-text joint representation, embodied AI, ...
Loading...
๐๐ข๐ฆ๐๐ฏ๐ฅ: ๐๐ฒ๐๐ฝ๐ผ๐ถ๐ป๐-๐ณ๐ฟ๐ฒ๐ฒ ๐๐ฎ๐ป๐ฑ-๐ข๐ฏ๐ท๐ฒ๐ฐ๐ ๐ฏ๐ ๐ฅ๐ฒ๐ฐ๐ผ๐ป๐๐๐ฟ๐๐ฐ๐๐ถ๐ผ๐ป ๐ณ๐ฟ๐ผ๐บ ๐ฅ๐๐ ๐ถ๐บ๐ฎ๐ด๐ฒ๐
Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel ... Grรฉgory Rogez
arxiv.org/abs/2508.16465
Trending on www.scholar-inbox.com
We extended MUSt3R with semantic awareness and multi-view panoptic segmentation capabilities in PanSt3R, accepted at #ICCV2025
www.arxiv.org/abs/2506.21348
Excited to share our latest work in the *St3R family. PanSt3R, accepted at #ICCV25 proposes a unified and integrated approach for panoptic 3D scene reconstruction and panoptic segmentation in a single forward pass.
www.arxiv.org/abs/2506.21348
During today's #CVPR2025 workshops, I will present:
- What matters in ImageNav: architecture, pre-training, sim settings, pose (poster & highlight at the Embodied AI workshop)
- CondiMen: Conditional Multi-person Human Mesh Recovery (Poster at the Rhobin workshop and at the 3D Humans workshop)
Wanna the outstanding performance of MASt3R while using a ViT-B or ViT-S encoder instead of its ViT-L one? Don't miss how we build DUNE, a single encoder for diverse 2D & 3D tasks, at this afternoon #CVPR2025 poster session (poster #376).
paper: arxiv.org/abs/2503.14405
code: github.com/naver/dune
Gabriela Csurka
Gabriela Csurka
Panoptic segmentation of 3D scenes, involving the segmentation and classification of object instances in a dense 3D reconstruction of a scene, is a challenging problem, especially when relying solely ...
I have created a starter pack with researchers from Naver Labs Europe @naverlabseurope.bsky.social: we are in Grenoble, France, and we do research in AI for robotics, computer vision, NLP, machine learning, HRI.
go.bsky.app/JdTFu4Q
Excited to share Maximeโs latest work on Privacy Preserving Visual Localization. If interested Maxime will present his work tomorrow at #CVPR2025, Poster Session 1, Poster Number 85 (ExHall D).
Our work on "Reasoning in visual navigation..." presented as a "Highlight" by Boris Chidlovskii and Francesco Giuliari at #cvpr2025!
Interactive site, play around with dynamical models:
europe.naverlabs.com/research/pub...
Thanks @weinzaepfelp.bsky.social for the photo.
@steevenj7.bsky.social
*3R posts are back! ๐งต
Interested in SfM, RGB-SLAM or... both at the same time???
Come see MUSt3R @CVPR25 Friday morning, ExHall D Poster #82.
Jerome and Boris will be there to present how we can adapt DUSt3R to multiple views via a memory mechanism.
If you missed it earlier [...]
I gave two talks @cvprconference.bsky.social #cvpr2025
Slides are now available for those interested
1- Catch me if you can: Manoeuvre the Competition
with Your Unique Abilities
tinyurl.com/StandOut-DD2...
2- Beyond Long Video Understanding
tinyurl.com/BeyondLong-D...
Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization
Maxime Pietrantoni, @gabrielacsurka.bsky.social, @sattlertorsten.bsky.social
arxiv.org/abs/2507.23569
DUSt3R introduced a novel paradigm in geometric computer vision by proposing a model that can provide dense and unconstrained Stereo 3D Reconstruction of arbitrary image collections with no prior info...
arxiv.org
1/ ๐ Paper 1:
"DUNE: Distilling a UNiversal Encoder from Heterogeneous 2D and 3D Teachers"
We propose DUNE: a ViT-based encoder distilled from multiple specialized 2D & 3D foundation models to unify visual tasks across 2D, 3D and human understanding.