2024 "vision-language model" Papers
5 papers found
Bottom-Up Domain Prompt Tuning for Generalized Face Anti-Spoofing
SI-QI LIU, Qirui Wang, Pong Chi Yuen
ECCV 2024poster
8
citations
Dolphins: Multimodal Language Model for Driving
Yingzi Ma, Yulong Cao, Jiachen Sun et al.
ECCV 2024posterarXiv:2312.00438
126
citations
Image Fusion via Vision-Language Model
Zixiang Zhao, Lilun Deng, Haowen Bai et al.
ICML 2024posterarXiv:2402.02235
PALM: Predicting Actions through Language Models
Sanghwan Kim, Daoji Huang, Yongqin Xian et al.
ECCV 2024posterarXiv:2311.17944
22
citations
Retrieval Across Any Domains via Large-scale Pre-trained Model
Jiexi Yan, Zhihui Yin, Chenghao Xu et al.
ICML 2024poster