"neural network training" Papers
12 papers found
Accelerating neural network training: An analysis of the AlgoPerf competition
Priya Kasimbeg, Frank Schneider, Runa Eschenhagen et al.
ICLR 2025posterarXiv:2502.15015
17
citations
Block Coordinate Descent for Neural Networks Provably Finds Global Minima
Shunta Akiyama
NeurIPS 2025posterarXiv:2510.22667
2
citations
Efficient Representativeness-Aware Coreset Selection
Zihao Cheng, Binrui Wu, Zhiwei Li et al.
NeurIPS 2025poster
KOALA++: Efficient Kalman-Based Optimization with Gradient-Covariance Products
Zixuan XIa, Aram Davtyan, Paolo Favaro
NeurIPS 2025posterarXiv:2506.04432
Learn2Mix: Training Neural Networks Using Adaptive Data Integration
Shyam Venkatasubramanian, Vahid Tarokh
NeurIPS 2025posterarXiv:2412.16482
2
citations
Learning High-Degree Parities: The Crucial Role of the Initialization
Emmanuel Abbe, Elisabetta Cornacchia, Jan Hązła et al.
ICLR 2025posterarXiv:2412.04910
3
citations
Purifying Shampoo: Investigating Shampoo's Heuristics by Decomposing its Preconditioner
Runa Eschenhagen, Aaron Defazio, Tsung-Hsien Lee et al.
NeurIPS 2025spotlightarXiv:2506.03595
5
citations
RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs
Xi Xie, Yuebo Luo, Hongwu Peng et al.
ICLR 2025posterarXiv:2409.00822
2
citations
Efficient Algorithms for Sum-Of-Minimum Optimization
Lisang Ding, Ziang Chen, Xinshang Wang et al.
ICML 2024posterarXiv:2402.07070
Online Learning and Information Exponents: The Importance of Batch size & Time/Complexity Tradeoffs
Luca Arnaboldi, Yatin Dandi, FLORENT KRZAKALA et al.
ICML 2024poster
Random Scaling and Momentum for Non-smooth Non-convex Optimization
Qinzi Zhang, Ashok Cutkosky
ICML 2024posterarXiv:2405.09742
Spectral Preconditioning for Gradient Methods on Graded Non-convex Functions
Nikita Doikov, Sebastian Stich, Martin Jaggi
ICML 2024poster