NeurIPS 2025 "reasoning models" Papers
4 papers found
CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models
Zhihang Lin, Mingbao Lin, Yuan Xie et al.
NeurIPS 2025posterarXiv:2503.22342
47
citations
Quantifying Elicitation of Latent Capabilities in Language Models
Elizabeth Donoway, Hailey Joren, Arushi Somani et al.
NeurIPS 2025poster
S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models
Muzhi Dai, Chenxu Yang, Qingyi Si
NeurIPS 2025oralarXiv:2505.07686
46
citations
SPRINT: Enabling Interleaved Planning and Parallelized Execution in Reasoning Models
Emil Biju, Shayan Talaei, Zhemin Huang et al.
NeurIPS 2025posterarXiv:2506.05745
4
citations