NeurIPS Poster "adaptive gradient methods" Papers
2 papers found
Adam Reduces a Unique Form of Sharpness: Theoretical Insights Near the Minimizer Manifold
Xinghan Li, Haodong Wen, Kaifeng Lyu
NeurIPS 2025posterarXiv:2511.02773
Understanding the Generalization of Stochastic Gradient Adam in Learning Neural Networks
Xuan Tang, Han Zhang, Yuan Cao et al.
NeurIPS 2025posterarXiv:2510.11354