ICML 2024 "vision language models" Papers
5 papers found
Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Jin Wang, Shichao Dong, Yapeng Zhu et al.
ICML 2024posterarXiv:2405.17201
LCA-on-the-Line: Benchmarking Out of Distribution Generalization with Class Taxonomies
Jia Shi, Gautam Rajendrakumar Gare, Jinjin Tian et al.
ICML 2024poster
Leveraging VLM-Based Pipelines to Annotate 3D Objects
Rishabh Kabra, Loic Matthey, Alexander Lerchner et al.
ICML 2024posterarXiv:2311.17851
PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Soroush Nasiriany, Fei Xia, Wenhao Yu et al.
ICML 2024posterarXiv:2402.07872
RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback
Yufei Wang, Zhanyi Sun, Jesse Zhang et al.
ICML 2024posterarXiv:2402.03681