2025 Poster "modality bias" Papers
2 papers found
MM-EMBED: UNIVERSAL MULTIMODAL RETRIEVAL WITH MULTIMODAL LLMS
Sheng-Chieh Lin, Chankyu Lee, Mohammad Shoeybi et al.
ICLR 2025posterarXiv:2411.02571
78
citations
VinTAGe: Joint Video and Text Conditioning for Holistic Audio Generation
Saksham Singh Kushwaha, Yapeng Tian
CVPR 2025posterarXiv:2412.10768
12
citations