NEURIPS Poster "training stability" Papers
3 papers found
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model
Jingcheng Hu, Yinmin Zhang, Qi Han et al.
NEURIPS 2025posterarXiv:2503.24290
317
citations
Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks
Giyeong Oh, Woohyun Cho, Siyeol Kim et al.
NEURIPS 2025posterarXiv:2505.11881
Sinusoidal Initialization, Time for a New Start
Alberto Fernandez-Hernandez, Jose Mestre, Manuel F. Dolz et al.
NEURIPS 2025posterarXiv:2505.12909
1
citations