"generalization gap" Papers
2 papers found
Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
Wu Lin, Felix Dangel, Runa Eschenhagen et al.
ICML 2024poster
PAC-Bayes Generalisation Bounds for Dynamical Systems including Stable RNNs
Deividas Eringis, John Leth, Zheng-Hua Tan et al.
AAAI 2024paperarXiv:2312.09793
3
citations