"zero-shot retrieval" Papers
8 papers found
Assessing and Learning Alignment of Unimodal Vision and Language Models
Le Zhang, Qian Yang, Aishwarya Agrawal
CVPR 2025highlightarXiv:2412.04616
14
citations
Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization
Michael Green, Matan Levy, Issar Tzachor et al.
NeurIPS 2025posterarXiv:2503.07038
From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos
Animesh Gupta, Jay Parmar, Ishan Rajendrakumar Dave et al.
NeurIPS 2025oralarXiv:2506.05274
1
citations
MobileViCLIP: An Efficient Video-Text Model for Mobile Devices
Min Yang, Zihan Jia, Zhilin Dai et al.
ICCV 2025posterarXiv:2508.07312
WildSAT: Learning Satellite Image Representations from Wildlife Observations
Rangel Daroya, Elijah Cole, Oisin Mac Aodha et al.
ICCV 2025posterarXiv:2412.14428
10
citations
Data Roaming and Quality Assessment for Composed Image Retrieval
Matan Levy, Rami Ben-Ari, Nir Darshan et al.
AAAI 2024paperarXiv:2303.09429
Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Samuel Lavoie, Polina Kirichenko, Mark Ibrahim et al.
ICML 2024poster
STELLA: Continual Audio-Video Pre-training with SpatioTemporal Localized Alignment
Jaewoo Lee, Jaehong Yoon, Wonjae Kim et al.
ICML 2024oral