Fan Zhang

35
Papers
1,482
Total Citations

Papers (35)

VBench: Comprehensive Benchmark Suite for Video Generative Models

CVPR 2024
996
citations

Generative Multimodal Models are In-Context Learners

CVPR 2024
422
citations

Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion

CVPR 2024
31
citations

HIIF: Hierarchical Encoding based Implicit Image Function for Continuous Super-resolution

CVPR 2025arXiv
10
citations

UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion

CVPR 2025
8
citations

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

NeurIPS 2025
7
citations

Fine-grained Prototypical Voting with Heterogeneous Mixup for Semi-supervised 2D-3D Cross-modal Retrieval

CVPR 2024
4
citations

HumanSAM: Classifying Human-centric Forgery Videos in Human Spatial, Appearance, and Motion Anomaly

ICCV 2025arXiv
3
citations

CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering

ICCV 2025
1
citations

DREAM: Decoupled Discriminative Learning with Bigraph-aware Alignment for Semi-supervised 2D-3D Cross-modal Retrieval

AAAI 2025
0
citations

LDMVFI: Video Frame Interpolation with Latent Diffusion Models

AAAI 2024arXiv
0
citations

LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content

CVPR 2024
0
citations

CapsFusion: Rethinking Image-Text Data at Scale

CVPR 2024
0
citations

Casual Stereoscopic Panorama Stitching

CVPR 2015
0
citations

Fusing Subcategory Probabilities for Texture Classification

CVPR 2015
0
citations

High-Speed Tracking With Multi-Kernel Correlation Filters

CVPR 2018arXiv
0
citations

Noise-Tolerant Paradigm for Training Face Recognition CNNs

CVPR 2019
0
citations

Unsupervised Instance Segmentation in Microscopy Images via Panoptic Domain Adaptation and Task Re-Weighting

CVPR 2020arXiv
0
citations

Learning Temporal Consistency for Low Light Video Enhancement From Single Images

CVPR 2021
0
citations

ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

CVPR 2022
0
citations

Locally-Transferred Fisher Vectors for Texture Classification

ICCV 2017
0
citations

ACFNet: Attentional Class Feature Network for Semantic Segmentation

ICCV 2019
0
citations

MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition

ICCV 2023arXiv
0
citations

Learning Rain Location Prior for Nighttime Deraining

ICCV 2023
0
citations

HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos

CVPR 2025
0
citations

Subspace Constraint and Contribution Estimation for Heterogeneous Federated Learning

CVPR 2025
0
citations

GauUpdate: New Object Insertion in 3D Gaussian Fields with Consistent Global Illumination

ICCV 2025
0
citations

GIViC: Generative Implicit Video Compression

ICCV 2025
0
citations

AdaptiveAE: An Adaptive Exposure Strategy for HDR Capturing in Dynamic Scenes

ICCV 2025
0
citations

OneGT: One-Shot Geometry-Texture Neural Rendering for Head Avatars

ICCV 2025
0
citations

Blind Video Super-Resolution based on Implicit Kernels

ICCV 2025
0
citations

PNVC: Towards Practical INR-based Video Compression

AAAI 2025
0
citations

SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation

AAAI 2025
0
citations

Distributionally Robust Local Non-parametric Conditional Estimation

NeurIPS 2020
0
citations

HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation

NeurIPS 2023
0
citations