Poster "adaptive optimizers" Papers
2 papers found
What Does It Mean to Be a Transformer? Insights from a Theoretical Hessian Analysis
Weronika Ormaniec, Felix Dangel, Sidak Pal Singh
ICLR 2025posterarXiv:2410.10986
10
citations
MADA: Meta-Adaptive Optimizers Through Hyper-Gradient Descent
Kaan Ozkara, Can Karakus, Parameswaran Raman et al.
ICML 2024poster