Jingzhao Zhang
13
Papers
16
Total Citations
Papers (13)
Task Generalization with Autoregressive Compositional Structure: Can Learning from $D$ Tasks Generalize to $D^T$ Tasks?
ICML 2025
6
citations
Second-Order Min-Max Optimization with Lazy Hessians
ICLR 2025
4
citations
A Quadratic Synchronization Rule for Distributed Deep Learning
ICLR 2024
4
citations
Data Mixing Can Induce Phase Transitions in Knowledge Acquisition
NeurIPS 2025arXiv
2
citations
Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning
ICML 2024
0
citations
Efficient Sampling on Riemannian Manifolds via Langevin MCMC
NeurIPS 2022
0
citations
On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm Perspective
NeurIPS 2023
0
citations
Fast Conditional Mixing of MCMC Algorithms for Non-log-concave Distributions
NeurIPS 2023
0
citations
Iteratively Learn Diverse Strategies with State Distance Information
NeurIPS 2023
0
citations
Direct Runge-Kutta Discretization Achieves Acceleration
NeurIPS 2018
0
citations
Why are Adaptive Methods Good for Attention Models?
NeurIPS 2020
0
citations
Complexity Lower Bounds for Nonconvex-Strongly-Concave Min-Max Optimization
NeurIPS 2021
0
citations
Fast Federated Learning in the Presence of Arbitrary Device Unavailability
NeurIPS 2021
0
citations