"image-language pretraining" Papers
2 papers found
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
Fida Mohammad Thoker, Letian Jiang, Chen Zhao et al.
CVPR 2025posterarXiv:2504.00527
3
citations
Region-centric Image-Language Pretraining for Open-Vocabulary Detection
Dahun Kim, Anelia Angelova, Weicheng Kuo
ECCV 2024posterarXiv:2310.00161
6
citations