ICCV 2025 "multi-image understanding" Papers
2 papers found
Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM
Han Wang, Yuxiang Nie, Yongjie Ye et al.
ICCV 2025posterarXiv:2412.09530
15
citations
VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models
JIACHENG RUAN, Wenzhen Yuan, Xian Gao et al.
ICCV 2025posterarXiv:2503.07478
15
citations