NEURIPS 2025 "large multimodal models" Papers
7 papers found
Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind
Qingmei Li, Yang Zhang, Zurong Mai et al.
NEURIPS 2025posterarXiv:2505.12207
1
citations
ConViS-Bench: Estimating Video Similarity Through Semantic Concepts
Benedetta Liberatori, Alessandro Conti, Lorenzo Vaquero et al.
NEURIPS 2025posterarXiv:2509.19245
1
citations
FlowPrune: Accelerating Attention Flow Calculation by Pruning Flow Network
Shuo Xu, Yu Chen, Shuxia Lin et al.
NEURIPS 2025poster
MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks
Sanjoy Chowdhury, Mohamed Elmoghany, Yohan Abeysinghe et al.
NEURIPS 2025oralarXiv:2506.07016
5
citations
MS-Bench: Evaluating LMMs in Ancient Manuscript Study through a Dunhuang Case Study
Yuqing Zhang, Yue Han, Shuanghe Zhu et al.
NEURIPS 2025poster
Seeing the Arrow of Time in Large Multimodal Models
Zihui (Sherry) Xue, Romy Luo, Kristen Grauman
NEURIPS 2025oralarXiv:2506.03340
5
citations
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Sicong Leng, Yun Xing, Zesen Cheng et al.
NEURIPS 2025posterarXiv:2410.12787
27
citations