2024 "large vision-language models" Papers
2 papers found
HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Fucai Ke, Zhixi Cai, Simindokht Jahangard et al.
ECCV 2024posterarXiv:2403.12884
25
citations
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models
Hao Cheng, Erjia Xiao, Jindong Gu et al.
ECCV 2024posterarXiv:2402.19150
15
citations