"multimodal retrieval" Papers
4 papers found
MM-EMBED: UNIVERSAL MULTIMODAL RETRIEVAL WITH MULTIMODAL LLMS
Sheng-Chieh Lin, Chankyu Lee, Mohammad Shoeybi et al.
ICLR 2025posterarXiv:2411.02571
78
citations
REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing
Weihan Xu, Yimeng Ma, Jingyue Huang et al.
NeurIPS 2025posterarXiv:2505.18880
1
citations
Vision-Language Models Do Not Understand Negation
Kumail Alhamoud, Shaden Alshammari, Yonglong Tian et al.
CVPR 2025posterarXiv:2501.09425
36
citations
Context-I2W: Mapping Images to Context-Dependent Words for Accurate Zero-Shot Composed Image Retrieval
Yuanmin Tang, Jing Yu, Keke Gai et al.
AAAI 2024paperarXiv:2309.16137
57
citations