NEURIPS "multimodal benchmark" Papers
2 papers found
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?
Tianhong Zhou, xu yin, Yingtao Zhu et al.
NEURIPS 2025posterarXiv:2505.24173
5
citations
MMCSBench: A Fine-Grained Benchmark for Large Vision-Language Models in Camouflage Scenes
Jin Zhang, Ruiheng Zhang, Zhe Cao et al.
NEURIPS 2025poster