Jingzhao Zhang
5
Papers
16
Total Citations
Papers (5)
Task Generalization with Autoregressive Compositional Structure: Can Learning from $D$ Tasks Generalize to $D^T$ Tasks?
ICML 2025
6
citations
Second-Order Min-Max Optimization with Lazy Hessians
ICLR 2025
4
citations
A Quadratic Synchronization Rule for Distributed Deep Learning
ICLR 2024
4
citations
Data Mixing Can Induce Phase Transitions in Knowledge Acquisition
NeurIPS 2025arXiv
2
citations
Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning
ICML 2024
0
citations