Paper "vision-language pretrained models" Papers
2 papers found
A Multimodal, Multi-Task Adapting Framework for Video Action Recognition
Mengmeng Wang, Jiazheng Xing, Boyuan Jiang et al.
AAAI 2024paperarXiv:2401.11649
8
citations
Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior
Youngjae Cho, HeeSun Bae, Seungjae Shin et al.
AAAI 2024paperarXiv:2401.06799
9
citations