2025 "audio description generation" Papers
3 papers found
Contextual AD Narration with Interleaved Multimodal Sequence
Hanlin Wang, Zhan Tong, Kecheng Zheng et al.
CVPR 2025posterarXiv:2403.12922
7
citations
DistinctAD: Distinctive Audio Description Generation in Contexts
Bo Fang, Wenhao Wu, Qiangqiang Wu et al.
CVPR 2025highlightarXiv:2411.18180
4
citations
Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation
Junyu Xie, Tengda Han, Max Bain et al.
ICCV 2025posterarXiv:2504.01020
3
citations