NEURIPS 2025 "reasoning models" Papers
5 papers found
CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models
Zhihang Lin, Mingbao Lin, Yuan Xie et al.
NEURIPS 2025posterarXiv:2503.22342
47
citations
Quantifying Elicitation of Latent Capabilities in Language Models
Elizabeth Donoway, Hailey Joren, Arushi Somani et al.
NEURIPS 2025poster
S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models
Muzhi Dai, Chenxu Yang, Qingyi Si
NEURIPS 2025oralarXiv:2505.07686
46
citations
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space
Zhen Zhang, Xuehai He, Weixiang Yan et al.
NEURIPS 2025posterarXiv:2505.15778
40
citations
SPRINT: Enabling Interleaved Planning and Parallelized Execution in Reasoning Models
Emil Biju, Shayan Talaei, Zhemin Huang et al.
NEURIPS 2025posterarXiv:2506.05745
4
citations