"multimodal pre-training" Papers
2 papers found
From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation
Kun Su, Xiulong Liu, Eli Shlizerman
ICML 2024poster
Structural Information Guided Multimodal Pre-training for Vehicle-Centric Perception
Xiao Wang, Wentao Wu, Chenglong Li et al.
AAAI 2024paperarXiv:2312.09812
7
citations