Zhengyang Liang
5
Papers
268
Total Citations
Papers (5)
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding
CVPR 2025
142
citations
MLVU: Benchmarking Multi-task Long Video Understanding
CVPR 2025
89
citations
Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly
CVPR 2025
19
citations
Self-Supervised Multi-Modal Knowledge Graph Contrastive Hashing for Cross-Modal Search
AAAI 2024
16
citations
MomentSeeker: A Task-Oriented Benchmark For Long-Video Moment Retrieval
NeurIPS 2025arXiv
2
citations