2025 "visual question-answering" Papers
3 papers found
Mitigating Object Hallucination in MLLMs via Data-augmented Phrase-level Alignment
Pritam Sarkar, Sayna Ebrahimi, Ali Etemad et al.
ICLR 2025posterarXiv:2405.18654
19
citations
Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning
Cheng Chen, Yunpeng Zhai, Yifan Zhao et al.
CVPR 2025posterarXiv:2506.09473
1
citations
SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning
Wufei Ma, Yu-Cheng Chou, Qihao Liu et al.
NEURIPS 2025posterarXiv:2504.20024
21
citations