"cross-modal consistency" Papers
3 papers found
Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency
Xiangyu Guo, Zhanqian Wu, Kaixin Xiong et al.
NeurIPS 2025oralarXiv:2506.07497
8
citations
Concept-Guided Prompt Learning for Generalization in Vision-Language Models
Yi Zhang, Ce Zhang, Ke Yu et al.
AAAI 2024paperarXiv:2401.07457
33
citations
Unified Medical Image Pre-training in Language-Guided Common Semantic Space
Xiaoxuan He, Yifan Yang, Xinyang Jiang et al.
ECCV 2024posterarXiv:2311.14851
5
citations