"medical vision-language models" Papers
2 papers found
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Yunfei Xie, Ce Zhou, Lang Gao et al.
ICLR 2025posterarXiv:2408.02900
70
citations
WSI-LLaVA: A Multimodal Large Language Model for Whole Slide Image
Yuci Liang, Xinheng Lyu, Meidan Ding et al.
ICCV 2025posterarXiv:2412.02141
10
citations