Fan Ma
15
Papers
200
Total Citations
Papers (15)
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
CVPR 2024
109
citations
Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval
CVPR 2024
45
citations
LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels
CVPR 2024
19
citations
Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion
AAAI 2025
8
citations
Clustering for Protein Representation Learning
CVPR 2024
8
citations
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
CVPR 2025
8
citations
Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
AAAI 2025
3
citations
Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion
CVPR 2025
0
citations
VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens
CVPR 2024
0
citations
From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment
ICCV 2025
0
citations
InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation
ICCV 2025
0
citations
BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
AAAI 2025
0
citations
Stitching Segments and Sentences towards Generalization in Video-Text Pre-training
AAAI 2024
0
citations
CapHuman: Capture Your Moments in Parallel Universes
CVPR 2024
0
citations
Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity
CVPR 2024
0
citations