"multimodal question answering" Papers
2 papers found
MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models
Wenbo Hu, Jia-Chen Gu, Zi-Yi Dou et al.
ICLR 2025posterarXiv:2410.08182
29
citations
TCM-Ladder: A Benchmark for Multimodal Question Answering on Traditional Chinese Medicine
Jiacheng Xie, Yang Yu, Ziyang Zhang et al.
NeurIPS 2025posterarXiv:2505.24063
2
citations