2024 "modality gap" Papers
6 papers found
DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval
Xiangpeng Yang, Linchao Zhu, Xiaohan Wang et al.
AAAI 2024paperarXiv:2401.10588
Improving Cross-Modal Alignment with Synthetic Pairs for Text-Only Image Captioning
Zhiyue Liu, Jinyuan Liu, Fanrong Ma
AAAI 2024paperarXiv:2312.08865
20
citations
Improving Medical Multi-modal Contrastive Learning with Expert Annotations
Yogesh Kumar, Pekka Marttinen
ECCV 2024posterarXiv:2403.10153
23
citations
Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition
Yicheng Liu, Jie Wen, Chengliang Liu et al.
ICML 2024poster
Learning Modality Knowledge Alignment for Cross-Modality Transfer
Wenxuan Ma, Shuang Li, Lincan Cai et al.
ICML 2024poster
SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection
Haimei Zhao, Qiming Zhang, Shanshan Zhao et al.
AAAI 2024paperarXiv:2303.16818
24
citations