Difan Zou
16
Papers
88
Total Citations
Papers (16)
How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?
ICLR 2024
85
citations
Faster Sampling via Stochastic Gradient Proximal Sampler
ICML 2024
3
citations
Benign Overfitting in Two-Layer ReLU Convolutional Neural Networks for XOR Data
ICML 2024
0
citations
What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks
ICML 2024
0
citations
Improving Group Robustness on Spurious Correlation Requires Preciser Group Inference
ICML 2024
0
citations
Understanding the Generalization of Stochastic Gradient Adam in Learning Neural Networks
NeurIPS 2025
0
citations
Parallelized Autoregressive Visual Generation
CVPR 2025
0
citations
Kernel Regression in Structured Non-IID Settings: Theory and Implications for Denoising Score Learning
NeurIPS 2025
0
citations
Stochastic Variance-Reduced Hamilton Monte Carlo Methods
ICML 2018
0
citations
Global Convergence of Langevin Dynamics Based Algorithms for Nonconvex Optimization
NeurIPS 2018
0
citations
An Improved Analysis of Training Over-parameterized Deep Neural Networks
NeurIPS 2019
0
citations
Layer-Dependent Importance Sampling for Training Deep and Large Graph Convolutional Networks
NeurIPS 2019
0
citations
Stochastic Gradient Hamiltonian Monte Carlo Methods with Recursive Variance Reduction
NeurIPS 2019
0
citations
The Benefits of Implicit Regularization from SGD in Least Squares Problems
NeurIPS 2021
0
citations
Risk Bounds of Multi-Pass SGD for Least Squares in the Interpolation Regime
NeurIPS 2022
0
citations
The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift
NeurIPS 2022
0
citations