2025 "rule-based reinforcement learning" Papers
2 papers found
DocThinker: Explainable Multimodal Large Language Models with Rule-based Reinforcement Learning for Document Understanding
Wenwen Yu, Zhibo Yang, Yuliang Liu et al.
ICCV 2025posterarXiv:2508.08589
4
citations
Video-R1: Reinforcing Video Reasoning in MLLMs
Kaituo Feng, Kaixiong Gong, Bohao Li et al.
NeurIPS 2025oralarXiv:2503.21776
236
citations