Zhiguo Cao
50
Papers
282
Total Citations
Papers (50)
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
ICLR 2024
118
citations
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
ECCV 2020
80
citations
Vision Transformer Off-the-Shelf: A Surprising Baseline for Few-Shot Class-Agnostic Counting
AAAI 2024arXiv
28
citations
Unifying Automatic and Interactive Matting with Pretrained ViTs
CVPR 2024
14
citations
3D Multi-frame Fusion for Video Stabilization
CVPR 2024
13
citations
Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix
AAAI 2024arXiv
10
citations
In-Context Matting
CVPR 2024
6
citations
Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields
CVPR 2024
6
citations
Training Matting Models Without Alpha Labels
AAAI 2025
4
citations
DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting
CVPR 2025
3
citations
Monocular Relative Depth Perception With Web Stereo Data Supervision
CVPR 2018
0
citations
NM-Net: Mining Reliable Neighbors for Robust Feature Correspondences
CVPR 2019
0
citations
Structure-Guided Ranking Loss for Single Image Depth Prediction
CVPR 2020
0
citations
3DV: 3D Dynamic Voxel for Action Recognition in Depth Video
CVPR 2020arXiv
0
citations
P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds
CVPR 2020arXiv
0
citations
Composing Photos Like a Photographer
CVPR 2021
0
citations
Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting
CVPR 2022arXiv
0
citations
BokehMe: When Neural Rendering Meets Classical Rendering
CVPR 2022
0
citations
3D Cinemagraphy From a Single Image
CVPR 2023arXiv
0
citations
Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation
CVPR 2023
0
citations
Real-Time Multi-Person Eyeblink Detection in the Wild for Untrimmed Video
CVPR 2023arXiv
0
citations
A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation From a Single RGB Image
CVPR 2023
0
citations
When Unsupervised Domain Adaptation Meets Tensor Representations
ICCV 2017arXiv
0
citations
A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation From a Single Depth Image
ICCV 2019
0
citations
From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer
ICCV 2019
0
citations
TransView: Inside, Outside, and Across the Cropping View Boundaries
ICCV 2021
0
citations
Neural Video Depth Stabilizer
ICCV 2023arXiv
0
citations
Point-Query Quadtree for Crowd Counting, Localization, and More
ICCV 2023arXiv
0
citations
CH3Depth: Efficient and Flexible Depth Foundation Model with Flow Matching
CVPR 2025
0
citations
Fast Full-frame Video Stabilization with Iterative Optimization
ICCV 2023arXiv
0
citations
When Epipolar Constraint Meets Non-Local Operators in Multi-View Stereo
ICCV 2023
0
citations
Constraining Depth Map Geometry for Multi-View Stereo: A Dual-Depth Approach with Saddle-shaped Depth Cells
ICCV 2023arXiv
0
citations
Sparse-to-Dense Depth Completion Revisited: Sampling Strategy and Graph Construction
ECCV 2020
0
citations
Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction
ECCV 2020
0
citations
C3P: Cross-Domain Pose Prior Propagation for Weakly Supervised 3D Human Pose Estimation
ECCV 2022
0
citations
MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial Occlusion Effects
ECCV 2022
0
citations
Robust Object Detection with Inaccurate Bounding Boxes
ECCV 2022
0
citations
FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling
ECCV 2022
0
citations
3D Instances as 1D Kernels
ECCV 2022
0
citations
Learning to Upsample by Learning to Sample
ICCV 2023arXiv
0
citations
TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion
CVPR 2025
0
citations
WildAvatar: Learning In-the-wild 3D Avatars from the Web
CVPR 2025
0
citations
Exploring Contextual Attribute Density in Referring Expression Counting
CVPR 2025
0
citations
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
ICCV 2025
0
citations
MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction
ICCV 2025
0
citations
SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement
ICCV 2025
0
citations
S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes
CVPR 2024
0
citations
Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations
CVPR 2024
0
citations
DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video
CVPR 2024
0
citations
SAPA: Similarity-Aware Point Affiliation for Feature Upsampling
NeurIPS 2022
0
citations