"modality alignment" Papers
11 papers found
Gramian Multimodal Representation Learning and Alignment
Giordano Cicchetti, Eleonora Grassucci, Luigi Sigillo et al.
ICLR 2025posterarXiv:2412.11959
29
citations
Learning Fine-Grained Representations through Textual Token Disentanglement in Composed Video Retrieval
Yue Wu, Zhaobo Qi, Yiling Wu et al.
ICLR 2025poster
7
citations
Learning Source-Free Domain Adaptation for Visible-Infrared Person Re-Identification
Yongxiang Li, Yanglin Feng, Yuan Sun et al.
NeurIPS 2025poster
Multi-modal Learning: A Look Back and the Road Ahead
Divyam Madaan, Sumit Chopra, Kyunghyun Cho
ICLR 2025poster
Multimodal Tabular Reasoning with Privileged Structured Information
Jun-Peng Jiang, Yu Xia, Hai-Long Sun et al.
NeurIPS 2025posterarXiv:2506.04088
6
citations
One Filters All: A Generalist Filter For State Estimation
Shiqi Liu, Wenhan Cao, Chang Liu et al.
NeurIPS 2025posterarXiv:2509.20051
2
citations
Vocabulary-Guided Gait Recognition
Panjian Huang, Saihui Hou, Chunshui Cao et al.
NeurIPS 2025poster
CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing
Faegheh Sardari, Armin Mustafa, Philip JB Jackson et al.
ECCV 2024posterarXiv:2405.10690
10
citations
Conceptual Codebook Learning for Vision-Language Models
Yi Zhang, Ke Yu, Siqi Wu et al.
ECCV 2024posterarXiv:2407.02350
6
citations
Tabular Insights, Visual Impacts: Transferring Expertise from Tables to Images
Jun-Peng Jiang, Han-Jia Ye, Leye Wang et al.
ICML 2024spotlight
Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition
Qianrui Zhou, Hua Xu, Hao Li et al.
AAAI 2024paperarXiv:2312.14667
33
citations