2025 "math reasoning" Papers
4 papers found
AREAL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning
Wei Fu, Jiaxuan Gao, Xujie Shen et al.
NeurIPS 2025posterarXiv:2505.24298
95
citations
Fine-tuning with Reserved Majority for Noise Reduction
Shuyang Jiang, Yusheng Liao, Ya Zhang et al.
ICLR 2025poster
2
citations
Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Bootstrapping
Pu Yang, Yunzhen Feng, Ziyuan Chen et al.
NeurIPS 2025spotlightarXiv:2501.18962
1
citations
Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions
Siqi Kou, Qingyuan Tian, Hanwen Xu et al.
NeurIPS 2025posterarXiv:2505.19949
4
citations