ICLR Poster "multimodal agents" Papers
2 papers found
Audio Large Language Models Can Be Descriptive Speech Quality Evaluators
CHEN CHEN, Yuchen Hu, Siyin Wang et al.
ICLR 2025posterarXiv:2501.17202
22
citations
VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks
Lawrence Jang, Yinheng Li, Dan Zhao et al.
ICLR 2025posterarXiv:2410.19100
26
citations