α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
ICCV 2025
/
R1-VL: Learning to Reason with Multimodal Large La...
ICCV 2025
poster
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
206
citations
206
Citations
7
Authors
1
Data Points
Authors
Jingyi Zhang
Jiaxing Huang
Huanjin Yao
Shunyu Liu
Xikun ZHANG
Shijian Lu
Dacheng Tao
Citation History
Jan 24, 2026
206