ICCV 2025 "vision-language pretraining" Papers
3 papers found
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Wenqi Zhang, Hang Zhang, Xin Li et al.
ICCV 2025highlightarXiv:2501.00958
5
citations
OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Ming Hu, Kun yuan, Yaling Shen et al.
ICCV 2025posterarXiv:2411.15421
24
citations
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining
Yue Li, Qi Ma, Runyi Yang et al.
ICCV 2025posterarXiv:2503.18052
21
citations