2025 "video question-answering" Papers
3 papers found
Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding
Weiyu Guo, Ziyang Chen, Shaoguang WANG et al.
NeurIPS 2025oralarXiv:2503.13139
18
citations
Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization
Pritam Sarkar, Ali Etemad
NeurIPS 2025oralarXiv:2504.12083
2
citations
Temporal Chain of Thought: Long-Video Understanding by Thinking in Frames
Anurag Arnab, Ahmet Iscen, Mathilde Caron et al.
NeurIPS 2025oralarXiv:2507.02001
8
citations