2025 Poster "visual prompting" Papers
5 papers found
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model
Benlin Liu, Yuhao Dong, Yiqin Wang et al.
CVPR 2025posterarXiv:2408.00754
9
citations
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin, Xinyu Wei, Ruichuan An et al.
ICLR 2025posterarXiv:2403.20271
86
citations
Enhancing Visual Prompting through Expanded Transformation Space and Overfitting Mitigation
Shohei Enomoto
NEURIPS 2025posterarXiv:2510.07823
Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms
Zhangheng LI, Keen You, Haotian Zhang et al.
ICLR 2025posterarXiv:2410.18967
43
citations
StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance
Jaeseok Jeong, Junho Kim, Youngjung Uh et al.
ICCV 2025posterarXiv:2510.06827
2
citations