Zongyang Ma
4
Papers
11
Total Citations
Papers (4)
EA-VTR: Event-Aware Video-Text Retrieval
ECCV 2024arXiv
7
citations
UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning
NeurIPS 2025arXiv
4
citations
VisionMath: Vision-Form Mathematical Problem-Solving
ICCV 2025
0
citations
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
CVPR 2024
0
citations