Fan Zhang
22
Papers
1,483
Total Citations
Papers (22)
VBench: Comprehensive Benchmark Suite for Video Generative Models
CVPR 2024
996
citations
Generative Multimodal Models are In-Context Learners
CVPR 2024
422
citations
Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion
CVPR 2024
31
citations
HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution
CVPR 2025arXiv
10
citations
UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion
CVPR 2025
8
citations
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models
NeurIPS 2025
7
citations
Fine-grained Prototypical Voting with Heterogeneous Mixup for Semi-supervised 2D-3D Cross-modal Retrieval
CVPR 2024
4
citations
HumanSAM: Classifying Human-centric Forgery Videos in Human Spatial, Appearance, and Motion Anomaly
ICCV 2025arXiv
3
citations
CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering
ICCV 2025
1
citations
AdaptiveAE: An Adaptive Exposure Strategy for HDR Capturing in Dynamic Scenes
ICCV 2025arXiv
1
citations
PNVC: Towards Practical INR-based Video Compression
AAAI 2025
0
citations
SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation
AAAI 2025
0
citations
DREAM: Decoupled Discriminative Learning with Bigraph-aware Alignment for Semi-supervised 2D-3D Cross-modal Retrieval
AAAI 2025
0
citations
LDMVFI: Video Frame Interpolation with Latent Diffusion Models
AAAI 2024arXiv
0
citations
GIViC: Generative Implicit Video Compression
ICCV 2025
0
citations
LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content
CVPR 2024
0
citations
CapsFusion: Rethinking Image-Text Data at Scale
CVPR 2024
0
citations
GauUpdate: New Object Insertion in 3D Gaussian Fields with Consistent Global Illumination
ICCV 2025
0
citations
Subspace Constraint and Contribution Estimation for Heterogeneous Federated Learning
CVPR 2025
0
citations
OneGT: One-Shot Geometry-Texture Neural Rendering for Head Avatars
ICCV 2025
0
citations
Blind Video Super-Resolution based on Implicit Kernels
ICCV 2025
0
citations
HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos
CVPR 2025
0
citations