Chen Zhao

25
Papers
248
Total Citations

Papers (25)

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

CVPR 2025
70
citations

End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames

CVPR 2024
51
citations

Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking

AAAI 2025
38
citations

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

ICCV 2025
22
citations

HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields

CVPR 2024
19
citations

TexOct: Generating Textures of 3D Models with Octree-based Diffusion

CVPR 2024
12
citations

Towards Automated Movie Trailer Generation

CVPR 2024
10
citations

Splatter-360: Generalizable 360 Gaussian Splatting for Wide-baseline Panoramic Images

CVPR 2025
10
citations

UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset

NeurIPS 2025
7
citations

SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning

CVPR 2025arXiv
3
citations

Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis

ICCV 2025arXiv
3
citations

Auto-Regressively Generating Multi-View Consistent Images

ICCV 2025
1
citations

BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation

ICCV 2025
1
citations

SolverLLM: Leveraging Test-Time Scaling for Optimization Problem via LLM-Guided Search

NeurIPS 2025
1
citations

BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding

CVPR 2025
0
citations

Dr2Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning

CVPR 2024
0
citations

From Zero to Detail: Deconstructing Ultra-High-Definition Image Restoration from Progressive Spectral Perspective

CVPR 2025
0
citations

DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses

CVPR 2024
0
citations

TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting

CVPR 2025
0
citations

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

CVPR 2024
0
citations

Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations

ICML 2024
0
citations

TexGarment: Consistent Garment UV Texture Generation via Efficient 3D Structure-Guided Diffusion Transformer

CVPR 2025
0
citations

OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction

CVPR 2025
0
citations

Metric-Agnostic Continual Learning for Sustainable Group Fairness

AAAI 2025
0
citations

Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image Restoration

CVPR 2024
0
citations