Yifei Fan
3
Papers
10
Total Citations
Papers (3)
VIXEN: Visual Text Comparison Network for Image Difference Captioning
AAAI 2024arXiv
9
citations
The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique Like Photographers
CVPR 2025
1
citations
DiffTell: A High-Quality Dataset for Describing Image Manipulation Changes
ICCV 2025
0
citations