Inlay

//

Profile

Loading...

We extended MUSt3R with semantic awareness and multi-view panoptic segmentation capabilities in PanSt3R, accepted at #ICCV2025 www.arxiv.org/abs/2506.21348

Our work on "Reasoning in visual navigation..." presented as a "Highlight" by Boris Chidlovskii and Francesco Giuliari at #cvpr2025! Interactive site, play around with dynamical models: europe.naverlabs.com/research/pub... Thanks @weinzaepfelp.bsky.social for the photo. @steevenj7.bsky.social

I gave two talks @cvprconference.bsky.social #cvpr2025 Slides are now available for those interested 1- Catch me if you can: Manoeuvre the Competition with Your Unique Abilities tinyurl.com/StandOut-DD2... 2- Beyond Long Video Understanding tinyurl.com/BeyondLong-D...

During today's #CVPR2025 workshops, I will present: - What matters in ImageNav: architecture, pre-training, sim settings, pose (poster & highlight at the Embodied AI workshop) - CondiMen: Conditional Multi-person Human Mesh Recovery (Poster at the Rhobin workshop and at the 3D Humans workshop)

Excited to share Maxime’s latest work on Privacy Preserving Visual Localization. If interested Maxime will present his work tomorrow at #CVPR2025, Poster Session 1, Poster Number 85 (ExHall D).

Excited to share our latest work in the *St3R family. PanSt3R, accepted at #ICCV25 proposes a unified and integrated approach for panoptic 3D scene reconstruction and panoptic segmentation in a single forward pass. www.arxiv.org/abs/2506.21348

I have created a starter pack with researchers from Naver Labs Europe @naverlabseurope.bsky.social: we are in Grenoble, France, and we do research in AI for robotics, computer vision, NLP, machine learning, HRI. go.bsky.app/JdTFu4Q

Wanna the outstanding performance of MASt3R while using a ViT-B or ViT-S encoder instead of its ViT-L one? Don't miss how we build DUNE, a single encoder for diverse 2D & 3D tasks, at this afternoon #CVPR2025 poster session (poster #376). paper: arxiv.org/abs/2503.14405 code: github.com/naver/dune

𝗛𝗢𝗦𝘁𝟯𝗥: 𝗞𝗲𝘆𝗽𝗼𝗶𝗻𝘁-𝗳𝗿𝗲𝗲 𝗛𝗮𝗻𝗱-𝗢𝗯𝗷𝗲𝗰𝘁 𝟯𝗗 𝗥𝗲𝗰𝗼𝗻𝘀𝘁𝗿𝘂𝗰𝘁𝗶𝗼𝗻 𝗳𝗿𝗼𝗺 𝗥𝗚𝗕 𝗶𝗺𝗮𝗴𝗲𝘀 Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel ... Grégory Rogez arxiv.org/abs/2508.16465 Trending on www.scholar-inbox.com

Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization Maxime Pietrantoni, @gabrielacsurka.bsky.social, @sattlertorsten.bsky.social arxiv.org/abs/2507.23569

11mo

Jun 14, 2025

11mo

Jun 12, 2025

11mo

Jun 14, 2025

Jun 15, 2025

9mo

at://did:plc:ghwikxokwu7h7punwyvsqxxc/app.bsky.graph.starterpack/3lrlncemlxy2r

10mo

Panoptic segmentation of 3D scenes, involving the segmentation and classification of object instances in a dense 3D reconstruction of a scene, is a challenging problem, especially when relying solely ...

www.arxiv.org

PanSt3R: Multi-view Consistent Panoptic Segmentation

Gabriela Csurka

Christian Wolf

Dima Damen

Philippe Weinzaepfel

Gabriela Csurka

Christian Wolf

Philippe Weinzaepfel

Vision and Graphics Trends

Zhenjun Zhao

1/ 📄 Paper 1: "DUNE: Distilling a UNiversal Encoder from Heterogeneous 2D and 3D Teachers" We propose DUNE: a ViT-based encoder distilled from multiple specialized 2D & 3D foundation models to unify visual tasks across 2D, 3D and human understanding.

Jun 9, 2025

*3R posts are back! 🧵 Interested in SfM, RGB-SLAM or... both at the same time??? Come see MUSt3R @CVPR25 Friday morning, ExHall D Poster #82. Jerome and Boris will be there to present how we can adapt DUSt3R to multiple views via a memory mechanism. If you missed it earlier [...]

Jun 12, 2025

DUSt3R introduced a novel paradigm in geometric computer vision by proposing a model that can provide dense and unconstrained Stereo 3D Reconstruction of arbitrary image collections with no prior info...

arxiv.org

MUSt3R: Multi-view Network for Stereo 3D Reconstruction

Vincent Leroy

Yannis Kalantidis