2025 "vision-language pretraining" Papers
4 papers found
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Wenqi Zhang, Hang Zhang, Xin Li et al.
ICCV 2025highlightarXiv:2501.00958
5
citations
Filter Like You Test: Data-Driven Data Filtering for CLIP Pretraining
Mikey Shechter, Yair Carmon
NEURIPS 2025posterarXiv:2503.08805
1
citations
Provable Ordering and Continuity in Vision-Language Pretraining for Generalizable Embodied Agents
Zhizhen Zhang, Lei Zhu, Zhen Fang et al.
NEURIPS 2025oralarXiv:2502.01218
2
citations
SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining
Yue Li, Qi Ma, Runyi Yang et al.
ICCV 2025posterarXiv:2503.18052
20
citations