by Ethan Chen Papers
2 papers found
MUVR: A Multi-Modal Untrimmed Video Retrieval Benchmark with Multi-Level Visual Correspondence
Yue Feng, Jinwei Hu, Qijia Lu et al.
NEURIPS 2025posterarXiv:2510.21406
Sylber: Syllabic Embedding Representation of Speech from Raw Audio
Cheol Jun Cho, Nicholas Lee, Akshat Gupta et al.
ICLR 2025posterarXiv:2410.07168
15
citations