Marc Pollefeys

162
Papers
469
Total Citations

Papers (162)

NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild

CVPR 2024
64
citations

LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry

CVPR 2024
44
citations

GLACE: Global Local Accelerated Coordinate Encoding

CVPR 2024
39
citations

VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation

CVPR 2025
29
citations

WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments

CVPR 2025
29
citations

Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion

CVPR 2024
26
citations

EgoGen: An Egocentric Synthetic Data Generator

CVPR 2024
24
citations

Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces

CVPR 2025
23
citations

GeoCalib: Learning Single-image Calibration with Geometric Optimization

ECCV 2024
23
citations

Infrastructure-based Multi-Camera Calibration using Radial Projections

ECCV 2020
20
citations

Multi-Level Neural Scene Graphs for Dynamic Urban Environments

CVPR 2024
20
citations

Diffusion Bridges for 3D Point Cloud Denoising

ECCV 2024arXiv
15
citations

F3Loc: Fusion and Filtering for Floorplan Localization

CVPR 2024
13
citations

Where am I? Scene Retrieval with Language

ECCV 2024arXiv
13
citations

3D Neural Edge Reconstruction

CVPR 2024
13
citations

Matching neural paths: transfer from recognition to correspondence search

NeurIPS 2017arXiv
11
citations

GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image

CVPR 2024
10
citations

Learning to Make Keypoints Sub-Pixel Accurate

ECCV 2024
9
citations

MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion

CVPR 2025
9
citations

CrossOver: 3D Scene Cross-Modal Alignment

CVPR 2025
7
citations

FlowR: Flowing from Sparse to Dense 3D Reconstructions

ICCV 2025arXiv
7
citations

Video Perception Models for 3D Scene Synthesis

NeurIPS 2025
5
citations

ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding

CVPR 2025
5
citations

EgoM2P: Egocentric Multimodal Multitask Pretraining

ICCV 2025
4
citations

3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection

ICCV 2025
3
citations

Learning Where to Look: Self-supervised Viewpoint Selection for Active Localization using Geometrical Information

ECCV 2024arXiv
2
citations

CroCoDL: Cross-device Collaborative Dataset for Localization

CVPR 2025
1
citations

Multi-View 3D Point Tracking

ICCV 2025
1
citations

Sparse to Dense 3D Reconstruction From Rolling Shutter Images

CVPR 2016
0
citations

Semantic 3D Reconstruction With Continuous Regularization and Ray Potentials Using a Visibility Consistency Constraint

CVPR 2016
0
citations

Designing Effective Inter-Pixel Information Flow for Natural Image Matting

CVPR 2017arXiv
0
citations

SGM-Nets: Semi-Global Matching With Neural Networks

CVPR 2017
0
citations

Comparative Evaluation of Hand-Crafted and Learned Local Features

CVPR 2017
0
citations

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

CVPR 2017
0
citations

Quad-Networks: Unsupervised Learning to Rank for Interest Point Detection

CVPR 2017
0
citations

A Multi-View Stereo Benchmark With High-Resolution Images and Multi-Camera Videos

CVPR 2017
0
citations

Toroidal Constraints for Two-Point Localization Under High Outlier Ratios

CVPR 2017
0
citations

Consensus Maximization With Linear Matrix Inequality Constraints

CVPR 2017
0
citations

Fast 3D Reconstruction of Faces With Glasses

CVPR 2017
0
citations

Hybrid Camera Pose Estimation

CVPR 2018
0
citations

Augmenting Crowd-Sourced 3D Reconstructions Using Semantic Detections

CVPR 2018
0
citations

Semantic Visual Localization

CVPR 2018arXiv
0
citations

InLoc: Indoor Visual Localization With Dense Matching and View Synthesis

CVPR 2018arXiv
0
citations

Consensus Maximization for Semantic Region Correspondences

CVPR 2018
0
citations

Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions

CVPR 2018arXiv
0
citations

BAD SLAM: Bundle Adjusted Direct RGB-D SLAM

CVPR 2019
0
citations

Understanding the Limitations of CNN-Based Absolute Camera Pose Regression

CVPR 2019
0
citations

DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene From Sparse LiDAR Data and Single Color Image

CVPR 2019
0
citations

H+O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions

CVPR 2019
0
citations

Privacy Preserving Image-Based Localization

CVPR 2019
0
citations

Hybrid Scene Compression for Visual Localization

CVPR 2019
0
citations

D2-Net: A Trainable CNN for Joint Description and Detection of Local Features

CVPR 2019
0
citations

A Cross-Season Correspondence Dataset for Robust Semantic Segmentation

CVPR 2019
0
citations

3D Appearance Super-Resolution With Deep Learning

CVPR 2019
0
citations

Why Having 10,000 Parameters in Your Camera Model Is Better Than Twelve

CVPR 2020
0
citations

Leveraging Photometric Consistency Over Time for Sparsely Supervised Hand-Object Reconstruction

CVPR 2020arXiv
0
citations

DIST: Rendering Deep Implicit Signed Distance Function With Differentiable Sphere Tracing

CVPR 2020arXiv
0
citations

Geometry-Aware Satellite-to-Ground Image Synthesis for Urban Areas

CVPR 2020
0
citations

Self-Supervised Human Depth Estimation From Monocular Videos

CVPR 2020arXiv
0
citations

Deep Shutter Unrolling Network

CVPR 2020
0
citations

RoutedFusion: Learning Real-Time Depth Map Fusion

CVPR 2020arXiv
0
citations

Privacy Preserving Localization and Mapping From Uncalibrated Cameras

CVPR 2021
0
citations

Holistic 3D Scene Understanding From a Single Image With Implicit Representation

CVPR 2021arXiv
0
citations

Privacy-Preserving Image Features via Adversarial Affine Subspace Embeddings

CVPR 2021arXiv
0
citations

DeepVideoMVS: Multi-View Stereo on Video With Recurrent Spatio-Temporal Fusion

CVPR 2021arXiv
0
citations

Back to the Feature: Learning Robust Camera Localization From Pixels To Pose

CVPR 2021arXiv
0
citations

NeuralFusion: Online Depth Fusion in Latent Space

CVPR 2021arXiv
0
citations

DeFMO: Deblurring and Shape Recovery of Fast Moving Objects

CVPR 2021arXiv
0
citations

SOLD2: Self-Supervised Occlusion-Aware Line Description and Detection

CVPR 2021arXiv
0
citations

PatchmatchNet: Learned Multi-View Patchmatch Stereo

CVPR 2021arXiv
0
citations

DeepSurfels: Learning Online Appearance Fusion

CVPR 2021arXiv
0
citations

NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

CVPR 2022
0
citations

IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo

CVPR 2022arXiv
0
citations

Motion-From-Blur: 3D Shape and Motion Estimation of Motion-Blurred Objects in Videos

CVPR 2022
0
citations

Context-Aware Sequence Alignment Using 4D Skeletal Augmentation

CVPR 2022arXiv
0
citations

Camera Pose Estimation Using Implicit Distortion Models

CVPR 2022
0
citations

Privacy Preserving Partial Localization

CVPR 2022
0
citations

Learning To Align Sequential Actions in the Wild

CVPR 2022arXiv
0
citations

Learning To Find Good Models in RANSAC

CVPR 2022
0
citations

DeepLSD: Line Segment Detection and Refinement With Deep Image Gradients

CVPR 2023arXiv
0
citations

Removing Objects From Neural Radiance Fields

CVPR 2023arXiv
0
citations

VolRecon: Volume Rendering of Signed Ray Distance Functions for Generalizable Multi-View Reconstruction

CVPR 2023arXiv
0
citations

OpenScene: 3D Scene Understanding With Open Vocabularies

CVPR 2023arXiv
0
citations

3D Line Mapping Revisited

CVPR 2023arXiv
0
citations

Four-View Geometry With Unknown Radial Distortion

CVPR 2023
0
citations

Optimizing the Viewing Graph for Structure-From-Motion

ICCV 2015
0
citations

Entropy Minimization for Convex Relaxation Approaches

ICCV 2015
0
citations

Hyperpoints and Fine Vocabularies for Large-Scale Location Recognition

ICCV 2015
0
citations

Merging the Unmatchable: Stitching Visually Disconnected SfM Models

ICCV 2015
0
citations

Non-Parametric Structure-Based Calibration of Radially Symmetric Cameras

ICCV 2015
0
citations

Camera Pose Voting for Large-Scale Image-Based Localization

ICCV 2015
0
citations

Semantically Informed Multiview Surface Refinement

ICCV 2017arXiv
0
citations

From Point Clouds to Mesh Using Regression

ICCV 2017
0
citations

Revisiting Radial Distortion Absolute Pose

ICCV 2019
0
citations

Privacy Preserving Image Queries for Camera Localization

ICCV 2019
0
citations

Polarimetric Relative Pose Estimation

ICCV 2019
0
citations

MBA-VO: Motion Blur Aware Visual Odometry

ICCV 2021
0
citations

FMODetect: Robust Detection of Fast Moving Objects

ICCV 2021
0
citations

Orthographic-Perspective Epipolar Geometry

ICCV 2021
0
citations

H2O: Two Hands Manipulating Objects for First Person Interaction Recognition

ICCV 2021
0
citations

Pixel-Perfect Structure-From-Motion With Featuremetric Refinement

ICCV 2021arXiv
0
citations

Sat2Vid: Street-View Panoramic Video Synthesis From a Single Satellite Image

ICCV 2021arXiv
0
citations

Cross-Descriptor Visual Localization and Mapping

ICCV 2021
0
citations

Towards Efficient Graph Convolutional Networks for Point Cloud Handling

ICCV 2021arXiv
0
citations

Learning Motion Priors for 4D Human Body Capture in 3D Scenes

ICCV 2021arXiv
0
citations

Tracking by 3D Model Estimation of Unknown Objects in Videos

ICCV 2023arXiv
0
citations

LightGlue: Local Feature Matching at Light Speed

ICCV 2023arXiv
0
citations

RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration

ICCV 2023arXiv
0
citations

R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras

ICCV 2023arXiv
0
citations

SGAligner: 3D Scene Alignment with Scene Graphs

ICCV 2023arXiv
0
citations

Vanishing Point Estimation in Uncalibrated Images with Prior Gravity Direction

ICCV 2023arXiv
0
citations

GlueStick: Robust Image Matching by Sticking Points and Lines Together

ICCV 2023
0
citations

IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis

ICCV 2023arXiv
0
citations

RLSAC: Reinforcement Learning Enhanced Sample Consensus for End-to-End Robust Estimation

ICCV 2023arXiv
0
citations

HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World

ICCV 2023
0
citations

Privacy Preserving Localization via Coordinate Permutations

ICCV 2023
0
citations

Guiding Local Feature Matching with Surface Curvature

ICCV 2023
0
citations

Human from Blur: Human Pose Tracking from Blurry Images

ICCV 2023arXiv
0
citations

Privacy Preserving Structure-from-Motion

ECCV 2020
0
citations

Multi-View Optimization of Local Feature Geometry

ECCV 2020
0
citations

Online Invariance Selection for Local Feature Descriptors

ECCV 2020
0
citations

Convolutional Occupancy Networks

ECCV 2020
0
citations

Calibration-free Structure-from-Motion with Calibrated Radial Trifocal Tensors

ECCV 2020
0
citations

Handcrafted Outlier Detection Revisited

ECCV 2020
0
citations

CompNVS: Novel View Synthesis with Scene Completion

ECCV 2022
0
citations

EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices

ECCV 2022
0
citations

LaMAR: Benchmarking Localization and Mapping for Augmented Reality

ECCV 2022
0
citations

NeFSAC: Neurally Filtered Minimal Samples

ECCV 2022
0
citations

3D Instance Segmentation via Multi-Task Metric Learning

ICCV 2019
0
citations

Relative Pose Estimation through Affine Corrections of Monocular Depth Priors

CVPR 2025
0
citations

EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision

CVPR 2025
0
citations

DepthSplat: Connecting Gaussian Splatting and Depth

CVPR 2025
0
citations

Learning to Filter Outlier Edges in Global SfM

CVPR 2025
0
citations

GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control

CVPR 2025
0
citations

R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization

CVPR 2025
0
citations

Structure-from-Motion with a Non-Parametric Camera Model

CVPR 2025
0
citations

CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization

ICCV 2025
0
citations

HouseTour: A Virtual Real Estate A(I)gent

ICCV 2025
0
citations

Benchmarking Egocentric Visual-Inertial SLAM at City Scale

ICCV 2025
0
citations

Planar Affine Rectification from Local Change of Scale and Orientation

ICCV 2025
0
citations

SuperDec: 3D Scene Decomposition with Superquadrics Primitives

ICCV 2025
0
citations

Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras

ICCV 2025
0
citations

Object-X: Learning to Reconstruct Multi-Modal 3D Object Representations

NeurIPS 2025
0
citations

SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes

CVPR 2024
0
citations

MuRF: Multi-Baseline Radiance Fields

CVPR 2024
0
citations

SNI-SLAM: Semantic Neural Implicit SLAM

CVPR 2024
0
citations

Efficient Solution of Point-Line Absolute Pose

CVPR 2024
0
citations

Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning

CVPR 2024
0
citations

Multiway Point Cloud Mosaicking with Diffusion and Global Optimization

CVPR 2024
0
citations

Direction Matters: Depth Estimation With a Surface Normal Classifier

CVPR 2015
0
citations

Segment Based 3D Object Shape Priors

CVPR 2015
0
citations

Scalable Structure From Motion for Densely Sampled Videos

CVPR 2015
0
citations

Discrete Optimization of Ray Potentials for Semantic 3D Reconstruction

CVPR 2015
0
citations

TI-Pooling: Transformation-Invariant Pooling for Feature Learning in Convolutional Neural Networks

CVPR 2016
0
citations

Large-Scale Location Recognition and the Geometric Burstiness Problem

CVPR 2016
0
citations

Do It Yourself Hyperspectral Imaging With Everyday Digital Cameras

CVPR 2016
0
citations

Reflection Separation using a Pair of Unpolarized and Polarized Images

NeurIPS 2019
0
citations

Shape As Points: A Differentiable Poisson Solver

NeurIPS 2021
0
citations

Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving Objects

NeurIPS 2021
0
citations

SNAP: Self-Supervised Neural Maps for Visual Positioning and Semantic Understanding

NeurIPS 2023
0
citations

The Drunkard’s Odometry: Estimating Camera Motion in Deforming Scenes

NeurIPS 2023
0
citations

OpenMask3D: Open-Vocabulary 3D Instance Segmentation

NeurIPS 2023
0
citations