"long-form video processing" Papers
2 papers found
Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing
Yudong Liu, Jingwei Sun, Yueqian Lin et al.
ICCV 2025posterarXiv:2503.10742
6
citations
Text-Conditioned Resampler For Long Form Video Understanding
Bruno Korbar, Yongqin Xian, Alessio Tonioni et al.
ECCV 2024posterarXiv:2312.11897
24
citations