"normalized gradient descent" Papers
2 papers found
Nonconvex Stochastic Optimization under Heavy-Tailed Noises: Optimal Convergence without Gradient Clipping
Zijian Liu, Zhengyuan Zhou
ICLR 2025posterarXiv:2412.19529
23
citations
Improving Computational Complexity in Statistical Models with Local Curvature Information
Pedram Akbarian, Tongzheng Ren, Jiacheng Zhuo et al.
ICML 2024poster