2025 "multimodal reasoning benchmarks" Papers
2 papers found
Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling
Jiahao Wang, Weiye Xu, Aijun Yang et al.
NEURIPS 2025posterarXiv:2511.10648
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step
Guowei Xu, Peng Jin, ZiangWu ZiangWu et al.
ICCV 2025posterarXiv:2411.10440
344
citations