Bo Zhao
8
Papers
288
Total Citations
Papers (8)
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding
CVPR 2025
142
citations
MLVU: Benchmarking Multi-task Long Video Understanding
CVPR 2025
89
citations
STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
ICCV 2025
36
citations
Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly
CVPR 2025
19
citations
BOOD: Boundary-based Out-Of-Distribution Data Generation
ICML 2025
2
citations
SEGA: A Stepwise Evolution Paradigm for Content-Aware Layout Generation with Design Prior
ICCV 2025
0
citations
MMCR: Benchmarking Cross-Source Reasoning in Scientific Papers
ICCV 2025
0
citations
Towards Universal Dataset Distillation via Task-Driven Diffusion
CVPR 2025
0
citations