"reasoning trajectories" Papers
6 papers found
Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning
Chen Qian, Dongrui Liu, Hao Wen et al.
NeurIPS 2025arXiv:2506.02867
22
citations
Know What You Don't Know: Uncertainty Calibration of Process Reward Models
Young-Jin Park, Kristjan Greenewald, Kaveh Alimohammadi et al.
NeurIPS 2025posterarXiv:2506.09338
4
citations
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver
Zhenting Qi, Mingyuan MA, Jiahang Xu et al.
ICLR 2025posterarXiv:2408.06195
127
citations
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement
XIANGYU PENG, Congying Xia, Xinyi Yang et al.
ICLR 2025posterarXiv:2410.02108
14
citations
SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning
Wanjia Zhao, Mert Yuksekgonul, Shirley Wu et al.
NeurIPS 2025posterarXiv:2502.04780
18
citations
SPRINT: Enabling Interleaved Planning and Parallelized Execution in Reasoning Models
Emil Biju, Shayan Talaei, Zhemin Huang et al.
NeurIPS 2025posterarXiv:2506.05745
4
citations