2024 "vision-language alignment" Papers
4 papers found
CLIM: Contrastive Language-Image Mosaic for Region Representation
Size Wu, Wenwei Zhang, Lumin XU et al.
AAAI 2024paperarXiv:2312.11376
24
citations
Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Penglei SUN, Yaoxian Song, Xinglin Pan et al.
ECCV 2024posterarXiv:2407.02846
2
citations
Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
Meng Chu, Zhedong Zheng, Wei Ji et al.
ECCV 2024posterarXiv:2311.12751
25
citations
Weakly Supervised Open-Vocabulary Object Detection
Jianghang Lin, Yunhang Shen, Bingquan Wang et al.
AAAI 2024paperarXiv:2312.12437
16
citations