ICLR "model performance prediction" Papers
2 papers found
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
Kairong Luo, Haodong Wen, Shengding Hu et al.
ICLR 2025posterarXiv:2503.12811
13
citations
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
Tung-Yu Wu, Melody Lo
ICLR 2025posterarXiv:2410.01692
5
citations