"learning rate schedules" Papers
3 papers found
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
Kairong Luo, Haodong Wen, Shengding Hu et al.
ICLR 2025posterarXiv:2503.12811
13
citations
Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training
Minhak Song, Beomhan Baek, Kwangjun Ahn et al.
NeurIPS 2025posterarXiv:2507.09846
2
citations
No Wrong Turns: The Simple Geometry Of Neural Networks Optimization Paths
Charles Guille-Escuret, Hiroki Naganuma, Kilian Fatras et al.
ICML 2024posterarXiv:2306.11922