"cross-modal fusion" Papers
2 papers found
Learnable Retrieval Enhanced Visual-Text Alignment and Fusion for Radiology Report Generation
Qin Zhou, Guoyan Liang, Xindi Li et al.
ICCV 2025posterarXiv:2507.07568
RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis
Haolin Li, Tianjie Dai, Zhe Chen et al.
NeurIPS 2025posterarXiv:2509.19980