NEURIPS 2025 "parameter scaling" Papers
3 papers found
EvoLM: In Search of Lost Language Model Training Dynamics
Zhenting Qi, Fan Nie, Alexandre Alahi et al.
NEURIPS 2025oralarXiv:2506.16029
3
citations
Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Sean McLeish, John Kirchenbauer, David Miller et al.
NEURIPS 2025posterarXiv:2502.06857
10
citations
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Jonas Geiping, Sean McLeish, Neel Jain et al.
NEURIPS 2025spotlightarXiv:2502.05171
138
citations