Xiaowei Zhou
81
Papers
555
Total Citations
Papers (81)
Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed
CVPR 2024
142
citations
SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation
ECCV 2020
102
citations
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views
CVPR 2025
92
citations
IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination
ECCV 2024arXiv
54
citations
Generating Human Motion in 3D Scenes from Text Descriptions
CVPR 2024
46
citations
Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation
CVPR 2025
44
citations
SAM-guided Graph Cut for 3D Instance Segmentation
ECCV 2024
32
citations
EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
CVPR 2025
16
citations
FreeTimeGS: Free Gaussian Primitives at Anytime Anywhere for Dynamic Scene Reconstruction
CVPR 2025
15
citations
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
CVPR 2025
11
citations
BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation
ICCV 2025
1
citations
EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds
ICCV 2025
0
citations
Hierarchy UGP: Hierarchy Unified Gaussian Primitive for Large-Scale Dynamic Scene Reconstruction
ICCV 2025
0
citations
Motion-2-to-3: Leveraging 2D Motion Data for 3D Motion Generations
ICCV 2025
0
citations
EfficientDreamer: High-Fidelity and Robust 3D Creation via Orthogonal-view Diffusion Priors
CVPR 2024
0
citations
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
CVPR 2024
0
citations
Relightable and Animatable Neural Avatar from Sparse-View Video
CVPR 2024
0
citations
SpatialTracker: Tracking Any 2D Pixels in 3D Space
CVPR 2024
0
citations
4K4D: Real-Time 4D View Synthesis at 4K Resolution
CVPR 2024
0
citations
Detector-Free Structure from Motion
CVPR 2024
0
citations
3D Shape Estimation From 2D Landmarks: A Convex Relaxation Approach
CVPR 2015
0
citations
Sparseness Meets Deepness: 3D Human Pose Estimation From Monocular Video
CVPR 2016
0
citations
Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations
CVPR 2017arXiv
0
citations
Coarse-To-Fine Volumetric Prediction for Single-Image 3D Human Pose
CVPR 2017arXiv
0
citations
Learning to Estimate 3D Human Pose and Shape From a Single Color Image
CVPR 2018arXiv
0
citations
Multi-Image Semantic Matching by Mining Consistent Features
CVPR 2018arXiv
0
citations
Ordinal Depth Supervision for 3D Human Pose Estimation
CVPR 2018arXiv
0
citations
Extreme Relative Pose Estimation for RGB-D Scans via Scene Completion
CVPR 2019
0
citations
PVNet: Pixel-Wise Voting Network for 6DoF Pose Estimation
CVPR 2019
0
citations
Fast and Robust Multi-Person 3D Pose Estimation From Multiple Views
CVPR 2019
0
citations
Learning Transformation Synchronization
CVPR 2019
0
citations
Path-Invariant Map Networks
CVPR 2019
0
citations
Coherent Reconstruction of Multiple Humans From a Single Image
CVPR 2020arXiv
0
citations
Deep Snake for Real-Time Instance Segmentation
CVPR 2020arXiv
0
citations
Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation
CVPR 2020
0
citations
Reconstructing 3D Human Pose by Watching Humans in the Mirror
CVPR 2021arXiv
0
citations
Neural Body: Implicit Neural Representations With Structured Latent Codes for Novel View Synthesis of Dynamic Humans
CVPR 2021arXiv
0
citations
VS-Net: Voting With Segmentation for Visual Localization
CVPR 2021
0
citations
LoFTR: Detector-Free Local Feature Matching With Transformers
CVPR 2021arXiv
0
citations
NeuralRecon: Real-Time Coherent 3D Reconstruction From Monocular Video
CVPR 2021arXiv
0
citations
Neural Rays for Occlusion-Aware Image-Based Rendering
CVPR 2022arXiv
0
citations
Neural 3D Scene Reconstruction With the Manhattan-World Assumption
CVPR 2022arXiv
0
citations
Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
CVPR 2022arXiv
0
citations
Modeling Indirect Illumination for Inverse Rendering
CVPR 2022arXiv
0
citations
OnePose: One-Shot Object Pose Estimation Without CAD Models
CVPR 2022arXiv
0
citations
PlanarRecon: Real-Time 3D Plane Detection and Reconstruction From Posed Monocular Videos
CVPR 2022
0
citations
Ray Priors Through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation
CVPR 2022arXiv
0
citations
Painting 3D Nature in 2D: View Synthesis of Natural Scenes From a Single Semantic Mask
CVPR 2023arXiv
0
citations
Neural Scene Chronology
CVPR 2023
0
citations
Learning Human Mesh Recovery in 3D Scenes
CVPR 2023
0
citations
Reconstructing Humans with a Biomechanically Accurate Skeleton
CVPR 2025
0
citations
Long-Term Visual Localization With Mobile Sensors
CVPR 2023arXiv
0
citations
TensoIR: Tensorial Inverse Rendering
CVPR 2023arXiv
0
citations
Learning Neural Volumetric Representations of Dynamic Humans in Minutes
CVPR 2023arXiv
0
citations
AutoRecon: Automated 3D Object Discovery and Reconstruction
CVPR 2023arXiv
0
citations
Single Image Pop-Up From Discriminatively Learned Parts
ICCV 2015
0
citations
Multi-Image Matching via Fast Alternating Minimization
ICCV 2015
0
citations
Fast Multi-Image Matching via Density-Based Clustering
ICCV 2017
0
citations
Prior Guided Dropout for Robust Visual Localization in Dynamic Environments
ICCV 2019
0
citations
Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies
ICCV 2021arXiv
0
citations
You Don't Only Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and Tracking
ICCV 2021
0
citations
Ponder: Point Cloud Pre-training via Neural Rendering
ICCV 2023arXiv
0
citations
Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models
ICCV 2023
0
citations
Deep Active Contours for Real-time 6-DoF Object Tracking
ICCV 2023
0
citations
Learning Feature Descriptors using Camera Pose Supervision
ECCV 2020
0
citations
Motion Capture from Internet Videos
ECCV 2020
0
citations
Representing Volumetric Videos As Dynamic MLP Maps
CVPR 2023arXiv
0
citations
LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation
CVPR 2025
0
citations
Glossy Object Reconstruction with Cost-effective Polarized Acquisition
CVPR 2025
0
citations
StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models
CVPR 2025
0
citations
ReTracker: Exploring Image Matching for Robust Online Any Point Tracking
ICCV 2025
0
citations
SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion
ICCV 2025
0
citations
MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space
ICCV 2025
0
citations
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models
ICCV 2025
0
citations
Precise Action-to-Video Generation Through Visual Action Prompts
ICCV 2025
0
citations
UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction
ICCV 2025
0
citations
ERNet: Efficient Non-Rigid Registration Network for Point Sequences
ICCV 2025
0
citations
GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs
NeurIPS 2019
0
citations
TotalSelfScan: Learning Full-body Avatars from Self-Portrait Videos of Faces, Hands, and Bodies
NeurIPS 2022
0
citations
OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models
NeurIPS 2022
0
citations
Compact Neural Volumetric Video Representations with Dynamic Codebooks
NeurIPS 2023
0
citations