2025 "multimodal interaction" Papers
3 papers found
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin, Xinyu Wei, Ruichuan An et al.
ICLR 2025posterarXiv:2403.20271
86
citations
INTER: Mitigating Hallucination in Large Vision-Language Models by Interaction Guidance Sampling
Xin Dong, Shichao Dong, Jin Wang et al.
ICCV 2025posterarXiv:2507.05056
3
citations
Lightweight Neural App Control
Filippos Christianos, Georgios Papoudakis, Thomas Coste et al.
ICLR 2025posterarXiv:2410.17883
10
citations