Oral "training dynamics analysis" Papers
3 papers found
EvoLM: In Search of Lost Language Model Training Dynamics
Zhenting Qi, Fan Nie, Alexandre Alahi et al.
NEURIPS 2025oralarXiv:2506.16029
3
citations
From Condensation to Rank Collapse: A Two-Stage Analysis of Transformer Training Dynamics
Zheng-An Chen, Tao Luo
NEURIPS 2025oralarXiv:2510.06954
1
citations
Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale
James Michaelov, Roger Levy, Benjamin Bergen
NEURIPS 2025oralarXiv:2510.24963