2025 "pixel-level understanding" Papers
3 papers found
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin, Xinyu Wei, Ruichuan An et al.
ICLR 2025posterarXiv:2403.20271
86
citations
LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models
Haiwen Huang, Anpei Chen, Volodymyr Havrylov et al.
ICCV 2025posterarXiv:2504.14032
10
citations
UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning
Ye Liu, Zongyang Ma, Junfu Pu et al.
NEURIPS 2025posterarXiv:2509.18094
4
citations