NEURIPS 2025 "video captioning" Papers
2 papers found
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Jang Hyun Cho, Andrea Madotto, Effrosyni Mavroudi et al.
NEURIPS 2025oralarXiv:2504.13180
40
citations
VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation
Wenhao Wang, Yi Yang
NEURIPS 2025posterarXiv:2503.01739
10
citations