ICLR 2025 "gradient descent dynamics" Papers
2 papers found
How Two-Layer Neural Networks Learn, One (Giant) Step at a Time
Yatin Dandi, Florent Krzakala, Bruno Loureiro et al.
ICLR 2025posterarXiv:2305.18270
47
citations
Loss Landscape of Shallow ReLU-like Neural Networks: Stationary Points, Saddle Escape, and Network Embedding
Frank Zhengqing Wu, Berfin Simsek, François Ged
ICLR 2025posterarXiv:2402.05626
2
citations