CVPR 2025 "long-form video understanding" Papers
2 papers found
BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
Shuming Liu, Chen Zhao, Tianqi Xu et al.
CVPR 2025posterarXiv:2503.21483
26
citations
Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
Zeyi Huang, Yuyang Ji, Xiaofang Wang et al.
CVPR 2025posterarXiv:2501.04336
7
citations