CVPR Poster "cross-modal alignment" Papers
2 papers found
AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment
Yan Li, Yifei Xing, Xiangyuan Lan et al.
CVPR 2025posterarXiv:2412.00833
17
citations
It’s a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data
Dominik Schnaus, Nikita Araslanov, Daniel Cremers
CVPR 2025posterarXiv:2503.24129
6
citations