"neural network training" Papers

12 papers found

Filters:neural network training Clear all

Conference

AAAI 2025 (3,028)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NeurIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,140)oral (1,594)spotlight (1,421)highlight (975)

Accelerating neural network training: An analysis of the AlgoPerf competition

Priya Kasimbeg, Frank Schneider, Runa Eschenhagen et al.

ICLR 2025posterarXiv:2502.15015

Block Coordinate Descent for Neural Networks Provably Finds Global Minima

Shunta Akiyama

NeurIPS 2025posterarXiv:2510.22667

Efficient Representativeness-Aware Coreset Selection

Zihao Cheng, Binrui Wu, Zhiwei Li et al.

NeurIPS 2025poster

KOALA++: Efficient Kalman-Based Optimization with Gradient-Covariance Products

Zixuan XIa, Aram Davtyan, Paolo Favaro

NeurIPS 2025posterarXiv:2506.04432

Learn2Mix: Training Neural Networks Using Adaptive Data Integration

Shyam Venkatasubramanian, Vahid Tarokh

NeurIPS 2025posterarXiv:2412.16482

Learning High-Degree Parities: The Crucial Role of the Initialization

Emmanuel Abbe, Elisabetta Cornacchia, Jan Hązła et al.

ICLR 2025posterarXiv:2412.04910

Purifying Shampoo: Investigating Shampoo's Heuristics by Decomposing its Preconditioner

Runa Eschenhagen, Aaron Defazio, Tsung-Hsien Lee et al.

NeurIPS 2025spotlightarXiv:2506.03595

RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs

Xi Xie, Yuebo Luo, Hongwu Peng et al.

ICLR 2025posterarXiv:2409.00822

Efficient Algorithms for Sum-Of-Minimum Optimization

Lisang Ding, Ziang Chen, Xinshang Wang et al.

ICML 2024posterarXiv:2402.07070

Online Learning and Information Exponents: The Importance of Batch size & Time/Complexity Tradeoffs

Luca Arnaboldi, Yatin Dandi, FLORENT KRZAKALA et al.

ICML 2024poster

Random Scaling and Momentum for Non-smooth Non-convex Optimization

Qinzi Zhang, Ashok Cutkosky

ICML 2024posterarXiv:2405.09742

Spectral Preconditioning for Gradient Methods on Graded Non-convex Functions

Nikita Doikov, Sebastian Stich, Martin Jaggi

ICML 2024poster