Zhiguo Cao

50
Papers
282
Total Citations

Papers (50)

The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World

ICLR 2024
118
citations

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

ECCV 2020
80
citations

Vision Transformer Off-the-Shelf: A Surprising Baseline for Few-Shot Class-Agnostic Counting

AAAI 2024arXiv
28
citations

Unifying Automatic and Interactive Matting with Pretrained ViTs

CVPR 2024
14
citations

3D Multi-frame Fusion for Video Stabilization

CVPR 2024
13
citations

Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix

AAAI 2024arXiv
10
citations

In-Context Matting

CVPR 2024
6
citations

Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields

CVPR 2024
6
citations

Training Matting Models Without Alpha Labels

AAAI 2025
4
citations

DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting

CVPR 2025
3
citations

Monocular Relative Depth Perception With Web Stereo Data Supervision

CVPR 2018
0
citations

NM-Net: Mining Reliable Neighbors for Robust Feature Correspondences

CVPR 2019
0
citations

Structure-Guided Ranking Loss for Single Image Depth Prediction

CVPR 2020
0
citations

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

CVPR 2020arXiv
0
citations

P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds

CVPR 2020arXiv
0
citations

Composing Photos Like a Photographer

CVPR 2021
0
citations

Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting

CVPR 2022arXiv
0
citations

BokehMe: When Neural Rendering Meets Classical Rendering

CVPR 2022
0
citations

3D Cinemagraphy From a Single Image

CVPR 2023arXiv
0
citations

Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation

CVPR 2023
0
citations

Real-Time Multi-Person Eyeblink Detection in the Wild for Untrimmed Video

CVPR 2023arXiv
0
citations

A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation From a Single RGB Image

CVPR 2023
0
citations

When Unsupervised Domain Adaptation Meets Tensor Representations

ICCV 2017arXiv
0
citations

A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation From a Single Depth Image

ICCV 2019
0
citations

From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer

ICCV 2019
0
citations

TransView: Inside, Outside, and Across the Cropping View Boundaries

ICCV 2021
0
citations

Neural Video Depth Stabilizer

ICCV 2023arXiv
0
citations

Point-Query Quadtree for Crowd Counting, Localization, and More

ICCV 2023arXiv
0
citations

CH3Depth: Efficient and Flexible Depth Foundation Model with Flow Matching

CVPR 2025
0
citations

Fast Full-frame Video Stabilization with Iterative Optimization

ICCV 2023arXiv
0
citations

When Epipolar Constraint Meets Non-Local Operators in Multi-View Stereo

ICCV 2023
0
citations

Constraining Depth Map Geometry for Multi-View Stereo: A Dual-Depth Approach with Saddle-shaped Depth Cells

ICCV 2023arXiv
0
citations

Sparse-to-Dense Depth Completion Revisited: Sampling Strategy and Graph Construction

ECCV 2020
0
citations

Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction

ECCV 2020
0
citations

C3P: Cross-Domain Pose Prior Propagation for Weakly Supervised 3D Human Pose Estimation

ECCV 2022
0
citations

MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial Occlusion Effects

ECCV 2022
0
citations

Robust Object Detection with Inaccurate Bounding Boxes

ECCV 2022
0
citations

FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling

ECCV 2022
0
citations

3D Instances as 1D Kernels

ECCV 2022
0
citations

Learning to Upsample by Learning to Sample

ICCV 2023arXiv
0
citations

TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion

CVPR 2025
0
citations

WildAvatar: Learning In-the-wild 3D Avatars from the Web

CVPR 2025
0
citations

Exploring Contextual Attribute Density in Referring Expression Counting

CVPR 2025
0
citations

Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency

ICCV 2025
0
citations

MuGS: Multi-Baseline Generalizable Gaussian Splatting Reconstruction

ICCV 2025
0
citations

SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement

ICCV 2025
0
citations

S-DyRF: Reference-Based Stylized Radiance Fields for Dynamic Scenes

CVPR 2024
0
citations

Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations

CVPR 2024
0
citations

DyBluRF: Dynamic Neural Radiance Fields from Blurry Monocular Video

CVPR 2024
0
citations

SAPA: Similarity-Aware Point Affiliation for Feature Upsampling

NeurIPS 2022
0
citations