"visual-language alignment" Papers
3 papers found
Logits DeConfusion with CLIP for Few-Shot Learning
Shuo Li, Fang Liu, Zehua Hao et al.
CVPR 2025posterarXiv:2504.12104
6
citations
MUSE-VL: Modeling Unified VLM through Semantic Discrete Encoding
Rongchang Xie, Chen Du, Ping Song et al.
ICCV 2025posterarXiv:2411.17762
25
citations
MESED: A Multi-Modal Entity Set Expansion Dataset with Fine-Grained Semantic Classes and Hard Negative Entities
Li Yangning, Tingwei Lu, Hai-Tao Zheng et al.
AAAI 2024paperarXiv:2307.14878