2025 "audio-language models" Papers
4 papers found
ADIFF: Explaining audio difference using natural language
Soham Deshmukh, Shuo Han, Rita Singh et al.
ICLR 2025posterarXiv:2502.04476
9
citations
ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models
Weifei Jin, Yuxin Cao, Junjie Su et al.
NEURIPS 2025posterarXiv:2510.26096
Mellow: a small audio language model for reasoning
Soham Deshmukh, Satvik Dixit, Rita Singh et al.
NEURIPS 2025posterarXiv:2503.08540
17
citations
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Ziyang Ma, Yinghao Ma, Yanqiao Zhu et al.
NEURIPS 2025posterarXiv:2505.13032
52
citations