"group relative policy optimization" Papers
7 papers found
A2Seek: Towards Reasoning-Centric Benchmark for Aerial Anomaly Understanding
Mengjingcheng Mo, Xinyang Tong, Mingpi Tan et al.
NeurIPS 2025posterarXiv:2505.21962
2
citations
ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation
Lingfeng Wang, Hualing Lin, Senda Chen et al.
NeurIPS 2025posterarXiv:2505.16495
2
citations
CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models
Zhihang Lin, Mingbao Lin, Yuan Xie et al.
NeurIPS 2025posterarXiv:2503.22342
47
citations
Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO
Chengzhuo Tong, Ziyu Guo, Renrui Zhang et al.
NeurIPS 2025posterarXiv:2505.17017
25
citations
Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning
Fanrui Zhang, Dian Li, Qiang Zhang et al.
NeurIPS 2025posterarXiv:2505.16836
4
citations
JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent
Yunlong Lin, Zixu Lin, Kunjie Lin et al.
NeurIPS 2025posterarXiv:2506.17612
9
citations
VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank
Tianhe Wu, Jian Zou, Jie Liang et al.
NeurIPS 2025spotlightarXiv:2505.14460
30
citations