Poster "cross-modal integration" Papers
4 papers found
COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation
Fanding Huang, Jingyan Jiang, Qinting Jiang et al.
CVPR 2025posterarXiv:2503.23388
2
citations
Modality-Specialized Synergizers for Interleaved Vision-Language Generalists
Zhiyang Xu, Minqian Liu, Ying Shen et al.
ICLR 2025posterarXiv:2407.03604
8
citations
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
Peiran Xu, Xicheng Gong, Yadong Mu
ICCV 2025posterarXiv:2510.16457
Universal Visuo-Tactile Video Understanding for Embodied Interaction
Yifan Xie, Mingyang Li, Shoujie Li et al.
NEURIPS 2025posterarXiv:2505.22566
2
citations