Difan Zou

16
Papers
88
Total Citations

Papers (16)

How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?

ICLR 2024
85
citations

Faster Sampling via Stochastic Gradient Proximal Sampler

ICML 2024
3
citations

Benign Overfitting in Two-Layer ReLU Convolutional Neural Networks for XOR Data

ICML 2024
0
citations

What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks

ICML 2024
0
citations

Improving Group Robustness on Spurious Correlation Requires Preciser Group Inference

ICML 2024
0
citations

Understanding the Generalization of Stochastic Gradient Adam in Learning Neural Networks

NeurIPS 2025
0
citations

Parallelized Autoregressive Visual Generation

CVPR 2025
0
citations

Kernel Regression in Structured Non-IID Settings: Theory and Implications for Denoising Score Learning

NeurIPS 2025
0
citations

Stochastic Variance-Reduced Hamilton Monte Carlo Methods

ICML 2018
0
citations

Global Convergence of Langevin Dynamics Based Algorithms for Nonconvex Optimization

NeurIPS 2018
0
citations

An Improved Analysis of Training Over-parameterized Deep Neural Networks

NeurIPS 2019
0
citations

Layer-Dependent Importance Sampling for Training Deep and Large Graph Convolutional Networks

NeurIPS 2019
0
citations

Stochastic Gradient Hamiltonian Monte Carlo Methods with Recursive Variance Reduction

NeurIPS 2019
0
citations

The Benefits of Implicit Regularization from SGD in Least Squares Problems

NeurIPS 2021
0
citations

Risk Bounds of Multi-Pass SGD for Least Squares in the Interpolation Regime

NeurIPS 2022
0
citations

The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift

NeurIPS 2022
0
citations