Poster "overparameterization benefits" Papers
2 papers found
On the Optimization and Generalization of Multi-head Attention
Christos Thrampoulidis, Rouzbeh Ghaderi, Hossein Taheri et al.
ICLR 2025posterarXiv:2310.12680
44
citations
Prediction Risk and Estimation Risk of the Ridgeless Least Squares Estimator under General Assumptions on Regression Errors
Sungyoon Lee, Sokbae Lee
ICLR 2025posterarXiv:2305.12883