Xingyi Zhou
5
Papers
27
Total Citations
Papers (5)
Distilling Vision-Language Models on Millions of Videos
CVPR 2024
20
citations
Dense Video Object Captioning from Disjoint Supervision
ICLR 2025arXiv
7
citations
Visual Lexicon: Rich Image Features in Language Space
CVPR 2025
0
citations
Streaming Dense Video Captioning
CVPR 2024
0
citations
Pixel-Aligned Language Model
CVPR 2024
0
citations