ICLR 2025 "visual reasoning" Papers
2 papers found
CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning
Ji Qi, Ming Ding, Weihan Wang et al.
ICLR 2025posterarXiv:2402.04236
33
citations
Mind the GAP: Glimpse-based Active Perception improves generalization and sample efficiency of visual reasoning
Oleh Kolner, Thomas Ortner, Stanisław Woźniak et al.
ICLR 2025posterarXiv:2409.20213