NeurIPS Oral "training dynamics" Papers
2 papers found
Flatness is Necessary, Neural Collapse is Not: Rethinking Generalization via Grokking
Ting Han, Linara Adilova, Henning Petzka et al.
NeurIPS 2025oralarXiv:2509.17738
3
citations
Why Diffusion Models Don’t Memorize: The Role of Implicit Dynamical Regularization in Training
Tony Bonnaire, Raphaël Urfin, Giulio Biroli et al.
NeurIPS 2025oral