2025 "video language models" Papers
3 papers found
Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization
Pritam Sarkar, Ali Etemad
NEURIPS 2025oralarXiv:2504.12083
2
citations
Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding
Ye Wang, Ziheng Wang, Boshen Xu et al.
NEURIPS 2025oralarXiv:2503.13377
42
citations
Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding
Xiaoqian Shen, Wenxuan Zhang, Jun Chen et al.
NEURIPS 2025oralarXiv:2510.14032
6
citations