"generalization analysis" Papers
7 papers found
Impact of Layer Norm on Memorization and Generalization in Transformers
Rishi Singhal, Jung-Eun Kim
NeurIPS 2025posterarXiv:2511.10566
1
citations
From Generalization Analysis to Optimization Designs for State Space Models
Fusheng Liu, Qianxiao Li
ICML 2024oral
Generalization Analysis of Stochastic Weight Averaging with General Sampling
Wang Peng, Li Shen, Zerui Tao et al.
ICML 2024poster
How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?
Hongkang Li, Meng Wang, Songtao Lu et al.
ICML 2024poster
Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection
Feiran Li, Qianqian Xu, Shilong Bao et al.
ICML 2024spotlight
Stability and Generalization for Stochastic Recursive Momentum-based Algorithms for (Strongly-)Convex One to $K$-Level Stochastic Optimizations
Xiaokang Pan, Xingyu Li, Jin Liu et al.
ICML 2024poster
Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms
Ming Yang, Xiyuan Wei, Tianbao Yang et al.
ICML 2024poster