Oral "video temporal grounding" Papers
2 papers found
Universal Video Temporal Grounding with Generative Multi-modal Large Language Models
Zeqian Li, Shangzhe Di, Zhonghua Zhai et al.
NeurIPS 2025oralarXiv:2506.18883
9
citations
When One Moment Isn't Enough: Multi-Moment Retrieval with Cross-Moment Interactions
Zhuo Cao, Heming Du, Bingqing Zhang et al.
NeurIPS 2025oralarXiv:2510.17218