ICLR 2025 "vision language models" Papers
8 papers found
Are Large Vision Language Models Good Game Players?
Xinyu Wang, Bohan Zhuang, Qi Wu
ICLR 2025posterarXiv:2503.02358
13
citations
Can We Talk Models Into Seeing the World Differently?
Paul Gavrikov, Jovita Lukasik, Steffen Jung et al.
ICLR 2025posterarXiv:2403.09193
15
citations
ColPali: Efficient Document Retrieval with Vision Language Models
Manuel Faysse, Hugues Sibille, Tony Wu et al.
ICLR 2025posterarXiv:2407.01449
91
citations
COMBO: Compositional World Models for Embodied Multi-Agent Cooperation
Hongxin Zhang, Zeyuan Wang, Qiushi Lyu et al.
ICLR 2025posterarXiv:2404.10775
33
citations
ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time
Yi Ding, Bolian Li, Ruqi Zhang
ICLR 2025posterarXiv:2410.06625
42
citations
Mechanistic Interpretability Meets Vision Language Models: Insights and Limitations
Yiming Liu, Yuhui Zhang, Serena Yeung
ICLR 2025poster
Vision Language Models are In-Context Value Learners
Yecheng Jason Ma, Joey Hejna, Chuyuan Fu et al.
ICLR 2025oralarXiv:2411.04549
43
citations
Weighted Multi-Prompt Learning with Description-free Large Language Model Distillation
Sua Lee, Kyubum Shin, Jung Ho Park
ICLR 2025posterarXiv:2507.07147
1
citations