ICLR Poster "visual prompting" Papers
2 papers found
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin, Xinyu Wei, Ruichuan An et al.
ICLR 2025posterarXiv:2403.20271
86
citations
Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms
Zhangheng LI, Keen You, Haotian Zhang et al.
ICLR 2025posterarXiv:2410.18967
43
citations