2025 "multimodal agents" Papers
2 papers found
Audio Large Language Models Can Be Descriptive Speech Quality Evaluators
CHEN CHEN, Yuchen Hu, Siyin Wang et al.
ICLR 2025posterarXiv:2501.17202
22
citations
MIP against Agent: Malicious Image Patches Hijacking Multimodal OS Agents
Lukas Aichberger, Alasdair Paren, Guohao Li et al.
NeurIPS 2025posterarXiv:2503.10809
10
citations