NeurIPS "mean squared error" Papers
2 papers found
Emergence and scaling laws in SGD learning of shallow neural networks
Yunwei Ren, Eshaan Nichani, Denny Wu et al.
NeurIPS 2025posterarXiv:2504.19983
13
citations
Neural Collapse is Globally Optimal in Deep Regularized ResNets and Transformers
Peter Súkeník, Christoph Lampert, Marco Mondelli
NeurIPS 2025posterarXiv:2505.15239
4
citations