"multimodal fusion" Papers
10 papers found
AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment
Yan Li, Yifei Xing, Xiangyuan Lan et al.
CVPR 2025posterarXiv:2412.00833
17
citations
Can We Talk Models Into Seeing the World Differently?
Paul Gavrikov, Jovita Lukasik, Steffen Jung et al.
ICLR 2025posterarXiv:2403.09193
15
citations
CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale
ZeMing Gong, Austin Wang, Xiaoliang Huo et al.
ICLR 2025posterarXiv:2405.17537
18
citations
Multimodal LiDAR-Camera Novel View Synthesis with Unified Pose-free Neural Fields
Weiyi Xue, Fan Lu, Yunwei Zhu et al.
NeurIPS 2025poster
Debiasing Multimodal Sarcasm Detection with Contrastive Learning
Mengzhao Jia, Can Xie, Liqiang Jing
AAAI 2024paperarXiv:2312.10493
43
citations
Exploiting Polarized Material Cues for Robust Car Detection
Wen Dong, Haiyang Mei, Ziqi Wei et al.
AAAI 2024paperarXiv:2401.02606
7
citations
Frequency Spectrum Is More Effective for Multimodal Representation and Fusion: A Multimodal Spectrum Rumor Detector
An Lao, Qi Zhang, Chongyang Shi et al.
AAAI 2024paperarXiv:2312.11023
38
citations
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
Ding Jia, Jianyuan Guo, Kai Han et al.
ICML 2024poster
Multimodal Prototyping for cancer survival prediction
Andrew Song, Richard Chen, Guillaume Jaume et al.
ICML 2024poster
Predictive Dynamic Fusion
Bing Cao, Yinan Xia, Yi Ding et al.
ICML 2024poster