Hehe Fan
22
Papers
81
Total Citations
Papers (22)
Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition
AAAI 2025
41
citations
BVINet: Unlocking Blind Video Inpainting with Zero Annotations
ICCV 2025
12
citations
EnergyMoGen: Compositional Human Motion Generation with Energy-Based Diffusion Model in Latent Space
CVPR 2025
10
citations
Clustering for Protein Representation Learning
CVPR 2024
8
citations
Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling
AAAI 2024arXiv
7
citations
ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning
AAAI 2025
2
citations
Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration
CVPR 2025
1
citations
Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation With Reliable Voted Pseudo Labels
CVPR 2022
0
citations
PointListNet: Deep Learning on 3D Point Lists
CVPR 2023
0
citations
Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos
ICCV 2017
0
citations
Attract or Distract: Exploit the Margin of Open Set
ICCV 2019
0
citations
STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition
ICCV 2023arXiv
0
citations
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
ICCV 2023arXiv
0
citations
Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos
ICCV 2023arXiv
0
citations
Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion
CVPR 2025
0
citations
Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction
ECCV 2022
0
citations
InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation
ICCV 2025
0
citations
MMAD: Multi-label Micro-Action Detection in Videos
ICCV 2025
0
citations
DocMSU: A Comprehensive Benchmark for Document-Level Multimodal Sarcasm Understanding
AAAI 2024
0
citations
Uncovering What Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly
CVPR 2024
0
citations
Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
ICML 2024
0
citations
Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos
CVPR 2021
0
citations