2025 Oral "temporal grounding" Papers
2 papers found
MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks
Sanjoy Chowdhury, Mohamed Elmoghany, Yohan Abeysinghe et al.
NEURIPS 2025oralarXiv:2506.07016
5
citations
Tracking and Understanding Object Transformations
Yihong Sun, Xinyu Yang, Jennifer Sun et al.
NEURIPS 2025oralarXiv:2511.04678