"stochastic gradient descent" Papers

32 papers found

Emergence and scaling laws in SGD learning of shallow neural networks

Yunwei Ren, Eshaan Nichani, Denny Wu et al.

NeurIPS 2025posterarXiv:2504.19983
13
citations

Gaussian Approximation and Concentration of Constant Learning-Rate Stochastic Gradient Descent

Ziyang Wei, Jiaqi Li, Zhipeng Lou et al.

NeurIPS 2025poster

Nonasymptotic Analysis of Stochastic Gradient Descent with the Richardson–Romberg Extrapolation

Marina Sheshukova, Denis Belomestny, Alain Oliviero Durmus et al.

ICLR 2025posterarXiv:2410.05106
4
citations

Online robust locally differentially private learning for nonparametric regression

Chenfei Gu, Qiangqiang Zhang, Ting Li et al.

NeurIPS 2025poster

Online Statistical Inference in Decision Making with Matrix Context

Qiyu Han, Will Wei Sun, Yichen Zhang

NeurIPS 2025posterarXiv:2212.11385
2
citations

Optimal Rates in Continual Linear Regression via Increasing Regularization

Ran Levinstein, Amit Attia, Matan Schliserman et al.

NeurIPS 2025posterarXiv:2506.06501
2
citations

A Doubly Recursive Stochastic Compositional Gradient Descent Method for Federated Multi-Level Compositional Optimization

Hongchang Gao

ICML 2024poster

Bias of Stochastic Gradient Descent or the Architecture: Disentangling the Effects of Overparameterization of Neural Networks

Amit Peleg, Matthias Hein

ICML 2024poster

Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning

Libin Zhu, Chaoyue Liu, Adityanarayanan Radhakrishnan et al.

ICML 2024poster

Delving into the Convergence of Generalized Smooth Minimax Optimization

Wenhan Xian, Ziyi Chen, Heng Huang

ICML 2024poster

Demystifying SGD with Doubly Stochastic Gradients

Kyurae Kim, Joohwan Ko, Yian Ma et al.

ICML 2024poster

Double Stochasticity Gazes Faster: Snap-Shot Decentralized Stochastic Gradient Tracking Methods

Hao Di, Haishan Ye, Xiangyu Chang et al.

ICML 2024poster

Efficient Online Set-valued Classification with Bandit Feedback

Zhou Wang, Xingye Qiao

ICML 2024poster

Generalization Analysis of Stochastic Weight Averaging with General Sampling

Wang Peng, Li Shen, Zerui Tao et al.

ICML 2024poster

How Private are DP-SGD Implementations?

Lynn Chua, Badih Ghazi, Pritish Kamath et al.

ICML 2024poster

Implicit Compressibility of Overparametrized Neural Networks Trained with Heavy-Tailed SGD

Yijun Wan, Melih Barsbey, Abdellatif Zaidi et al.

ICML 2024poster

Improved Stability and Generalization Guarantees of the Decentralized SGD Algorithm

Batiste Le Bars, Aurélien Bellet, Marc Tommasi et al.

ICML 2024poster

LPGD: A General Framework for Backpropagation through Embedded Optimization Layers

Anselm Paulus, Georg Martius, Vit Musil

ICML 2024poster

MoMo: Momentum Models for Adaptive Learning Rates

Fabian Schaipp, Ruben Ohana, Michael Eickenberg et al.

ICML 2024poster

On Convergence of Incremental Gradient for Non-convex Smooth Functions

Anastasiia Koloskova, Nikita Doikov, Sebastian Stich et al.

ICML 2024poster

Online Learning and Information Exponents: The Importance of Batch size & Time/Complexity Tradeoffs

Luca Arnaboldi, Yatin Dandi, FLORENT KRZAKALA et al.

ICML 2024poster

On the Generalization of Stochastic Gradient Descent with Momentum

Ali Ramezani-Kebrya, Kimon Antonakopoulos, Volkan Cevher et al.

ICML 2024poster

Plug-and-Play image restoration with Stochastic deNOising REgularization

Marien Renaud, Jean Prost, Arthur Leclaire et al.

ICML 2024poster

Random features models: a way to study the success of naive imputation

Alexis Ayme, Claire Boyer, Aymeric Dieuleveut et al.

ICML 2024poster

Random Scaling and Momentum for Non-smooth Non-convex Optimization

Qinzi Zhang, Ashok Cutkosky

ICML 2024poster

Sliding Down the Stairs: How Correlated Latent Variables Accelerate Learning with Neural Networks

Lorenzo Bardone, Sebastian Goldt

ICML 2024poster

Sparse Variational Student-t Processes

Jian Xu, Delu Zeng

AAAI 2024paperarXiv:2312.05568
4
citations

Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms

Ming Yang, Xiyuan Wei, Tianbao Yang et al.

ICML 2024poster

The Role of Learning Algorithms in Collective Action

Omri Ben-Dov, Jake Fawkes, Samira Samadi et al.

ICML 2024poster

Tuning-Free Stochastic Optimization

Ahmed Khaled, Chi Jin

ICML 2024spotlight

Understanding Forgetting in Continual Learning with Linear Regression

Meng Ding, Kaiyi Ji, Di Wang et al.

ICML 2024poster

What is the Long-Run Distribution of Stochastic Gradient Descent? A Large Deviations Analysis

Waïss Azizian, Franck Iutzeler, Jérôme Malick et al.

ICML 2024poster