NeurIPS "multi-modal large language models" Papers
4 papers found
EndoBench: A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis
Shengyuan Liu, Boyun Zheng, Wenting Chen et al.
NeurIPS 2025posterarXiv:2505.23601
9
citations
HOComp: Interaction-Aware Human-Object Composition
Dong Liang, Jinyuan Jia, Yuhao LIU et al.
NeurIPS 2025posterarXiv:2507.16813
RadarQA: Multi-modal Quality Analysis of Weather Radar Forecasts
Xuming He, Zhiyuan You, Junchao Gong et al.
NeurIPS 2025posterarXiv:2508.12291
4
citations
To Think or Not To Think: A Study of Thinking in Rule-Based Visual Reinforcement Fine-Tuning
Ming Li, Jike Zhong, Shitian Zhao et al.
NeurIPS 2025spotlight