Zongyang Ma
6
Papers
4
Total Citations
Papers (6)
UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning
NeurIPS 2025arXiv
4
citations
VisionMath: Vision-Form Mathematical Problem-Solving
ICCV 2025
0
citations
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
CVPR 2024
0
citations
Open-Vocabulary One-Stage Detection With Hierarchical Visual-Language Knowledge Distillation
CVPR 2022arXiv
0
citations
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval
CVPR 2023
0
citations
Order-Prompted Tag Sequence Generation for Video Tagging
ICCV 2023
0
citations