"heavy-tailed data" Papers
2 papers found
Scaling Laws for Gradient Descent and Sign Descent for Linear Bigram Models under Zipf’s Law
Frederik Kunstner, Francis Bach
NeurIPS 2025posterarXiv:2505.19227
7
citations
Subsampled Ensemble Can Improve Generalization Tail Exponentially
Huajie Qian, Donghao Ying, Henry Lam et al.
NeurIPS 2025posterarXiv:2405.14741
1
citations