Xiaowei Zhou

81
Papers
555
Total Citations

Papers (81)

Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed

CVPR 2024
142
citations

SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation

ECCV 2020
102
citations

FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views

CVPR 2025
92
citations

IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination

ECCV 2024arXiv
54
citations

Generating Human Motion in 3D Scenes from Text Descriptions

CVPR 2024
46
citations

Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation

CVPR 2025
44
citations

SAM-guided Graph Cut for 3D Instance Segmentation

ECCV 2024
32
citations

EnvGS: Modeling View-Dependent Appearance with Environment Gaussian

CVPR 2025
16
citations

FreeTimeGS: Free Gaussian Primitives at Anytime Anywhere for Dynamic Scene Reconstruction

CVPR 2025
15
citations

Multi-view Reconstruction via SfM-guided Monocular Depth Estimation

CVPR 2025
11
citations

BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation

ICCV 2025
1
citations

EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds

ICCV 2025
0
citations

Hierarchy UGP: Hierarchy Unified Gaussian Primitive for Large-Scale Dynamic Scene Reconstruction

ICCV 2025
0
citations

Motion-2-to-3: Leveraging 2D Motion Data for 3D Motion Generations

ICCV 2025
0
citations

EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Priors

CVPR 2024
0
citations

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

CVPR 2024
0
citations

Relightable and Animatable Neural Avatar from Sparse-View Video

CVPR 2024
0
citations

SpatialTracker: Tracking Any 2D Pixels in 3D Space

CVPR 2024
0
citations

4K4D: Real-Time 4D View Synthesis at 4K Resolution

CVPR 2024
0
citations

Detector-Free Structure from Motion

CVPR 2024
0
citations

3D Shape Estimation From 2D Landmarks: A Convex Relaxation Approach

CVPR 2015
0
citations

Sparseness Meets Deepness: 3D Human Pose Estimation From Monocular Video

CVPR 2016
0
citations

Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations

CVPR 2017arXiv
0
citations

Coarse-To-Fine Volumetric Prediction for Single-Image 3D Human Pose

CVPR 2017arXiv
0
citations

Learning to Estimate 3D Human Pose and Shape From a Single Color Image

CVPR 2018arXiv
0
citations

Multi-Image Semantic Matching by Mining Consistent Features

CVPR 2018arXiv
0
citations

Ordinal Depth Supervision for 3D Human Pose Estimation

CVPR 2018arXiv
0
citations

Extreme Relative Pose Estimation for RGB-D Scans via Scene Completion

CVPR 2019
0
citations

PVNet: Pixel-Wise Voting Network for 6DoF Pose Estimation

CVPR 2019
0
citations

Fast and Robust Multi-Person 3D Pose Estimation From Multiple Views

CVPR 2019
0
citations

Learning Transformation Synchronization

CVPR 2019
0
citations

Path-Invariant Map Networks

CVPR 2019
0
citations

Coherent Reconstruction of Multiple Humans From a Single Image

CVPR 2020arXiv
0
citations

Deep Snake for Real-Time Instance Segmentation

CVPR 2020arXiv
0
citations

Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation

CVPR 2020
0
citations

Reconstructing 3D Human Pose by Watching Humans in the Mirror

CVPR 2021arXiv
0
citations

Neural Body: Implicit Neural Representations With Structured Latent Codes for Novel View Synthesis of Dynamic Humans

CVPR 2021arXiv
0
citations

VS-Net: Voting With Segmentation for Visual Localization

CVPR 2021
0
citations

LoFTR: Detector-Free Local Feature Matching With Transformers

CVPR 2021arXiv
0
citations

NeuralRecon: Real-Time Coherent 3D Reconstruction From Monocular Video

CVPR 2021arXiv
0
citations

Neural Rays for Occlusion-Aware Image-Based Rendering

CVPR 2022arXiv
0
citations

Neural 3D Scene Reconstruction With the Manhattan-World Assumption

CVPR 2022arXiv
0
citations

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

CVPR 2022arXiv
0
citations

Modeling Indirect Illumination for Inverse Rendering

CVPR 2022arXiv
0
citations

OnePose: One-Shot Object Pose Estimation Without CAD Models

CVPR 2022arXiv
0
citations

PlanarRecon: Real-Time 3D Plane Detection and Reconstruction From Posed Monocular Videos

CVPR 2022
0
citations

Ray Priors Through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation

CVPR 2022arXiv
0
citations

Painting 3D Nature in 2D: View Synthesis of Natural Scenes From a Single Semantic Mask

CVPR 2023arXiv
0
citations

Neural Scene Chronology

CVPR 2023
0
citations

Learning Human Mesh Recovery in 3D Scenes

CVPR 2023
0
citations

Reconstructing Humans with a Biomechanically Accurate Skeleton

CVPR 2025
0
citations

Long-Term Visual Localization With Mobile Sensors

CVPR 2023arXiv
0
citations

TensoIR: Tensorial Inverse Rendering

CVPR 2023arXiv
0
citations

Learning Neural Volumetric Representations of Dynamic Humans in Minutes

CVPR 2023arXiv
0
citations

AutoRecon: Automated 3D Object Discovery and Reconstruction

CVPR 2023arXiv
0
citations

Single Image Pop-Up From Discriminatively Learned Parts

ICCV 2015
0
citations

Multi-Image Matching via Fast Alternating Minimization

ICCV 2015
0
citations

Fast Multi-Image Matching via Density-Based Clustering

ICCV 2017
0
citations

Prior Guided Dropout for Robust Visual Localization in Dynamic Environments

ICCV 2019
0
citations

Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies

ICCV 2021arXiv
0
citations

You Don't Only Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and Tracking

ICCV 2021
0
citations

Ponder: Point Cloud Pre-training via Neural Rendering

ICCV 2023arXiv
0
citations

Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models

ICCV 2023
0
citations

Deep Active Contours for Real-time 6-DoF Object Tracking

ICCV 2023
0
citations

Learning Feature Descriptors using Camera Pose Supervision

ECCV 2020
0
citations

Motion Capture from Internet Videos

ECCV 2020
0
citations

Representing Volumetric Videos As Dynamic MLP Maps

CVPR 2023arXiv
0
citations

LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation

CVPR 2025
0
citations

Glossy Object Reconstruction with Cost-effective Polarized Acquisition

CVPR 2025
0
citations

StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models

CVPR 2025
0
citations

ReTracker: Exploring Image Matching for Robust Online Any Point Tracking

ICCV 2025
0
citations

SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion

ICCV 2025
0
citations

MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space

ICCV 2025
0
citations

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

ICCV 2025
0
citations

Precise Action-to-Video Generation Through Visual Action Prompts

ICCV 2025
0
citations

UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction

ICCV 2025
0
citations

ERNet: Efficient Non-Rigid Registration Network for Point Sequences

ICCV 2025
0
citations

GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs

NeurIPS 2019
0
citations

TotalSelfScan: Learning Full-body Avatars from Self-Portrait Videos of Faces, Hands, and Bodies

NeurIPS 2022
0
citations

OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models

NeurIPS 2022
0
citations

Compact Neural Volumetric Video Representations with Dynamic Codebooks

NeurIPS 2023
0
citations