Fan Zhang

22
Papers
1,483
Total Citations

Papers (22)

VBench: Comprehensive Benchmark Suite for Video Generative Models

CVPR 2024
996
citations

Generative Multimodal Models are In-Context Learners

CVPR 2024
422
citations

Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion

CVPR 2024
31
citations

HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution

CVPR 2025arXiv
10
citations

UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion

CVPR 2025
8
citations

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

NeurIPS 2025
7
citations

Fine-grained Prototypical Voting with Heterogeneous Mixup for Semi-supervised 2D-3D Cross-modal Retrieval

CVPR 2024
4
citations

HumanSAM: Classifying Human-centric Forgery Videos in Human Spatial, Appearance, and Motion Anomaly

ICCV 2025arXiv
3
citations

CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering

ICCV 2025
1
citations

AdaptiveAE: An Adaptive Exposure Strategy for HDR Capturing in Dynamic Scenes

ICCV 2025arXiv
1
citations

PNVC: Towards Practical INR-based Video Compression

AAAI 2025
0
citations

SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation

AAAI 2025
0
citations

DREAM: Decoupled Discriminative Learning with Bigraph-aware Alignment for Semi-supervised 2D-3D Cross-modal Retrieval

AAAI 2025
0
citations

LDMVFI: Video Frame Interpolation with Latent Diffusion Models

AAAI 2024arXiv
0
citations

GIViC: Generative Implicit Video Compression

ICCV 2025
0
citations

LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content

CVPR 2024
0
citations

CapsFusion: Rethinking Image-Text Data at Scale

CVPR 2024
0
citations

GauUpdate: New Object Insertion in 3D Gaussian Fields with Consistent Global Illumination

ICCV 2025
0
citations

Subspace Constraint and Contribution Estimation for Heterogeneous Federated Learning

CVPR 2025
0
citations

OneGT: One-Shot Geometry-Texture Neural Rendering for Head Avatars

ICCV 2025
0
citations

Blind Video Super-Resolution based on Implicit Kernels

ICCV 2025
0
citations

HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos

CVPR 2025
0
citations