Zhiyuan Li
8
Papers
235
Total Citations
1
Affiliations
Affiliations
Toyota Technological Institute at Chicago
Papers (8)
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
ICLR 2024
222
citations
PENCIL: Long Thoughts with Short Memory
ICML 2025
9
citations
AgentMixer: Multi-Agent Correlated Policy Factorization
AAAI 2025
4
citations
Implicit Bias of AdamW: $\ell_\infty$-Norm Constrained Optimization
ICML 2024
0
citations
Find a Scapegoat: Poisoning Membership Inference Attack and Defense to Federated Learning
ICCV 2025
0
citations
Optimistic Multi-Agent Policy Gradient
ICML 2024
0
citations
Simplicity Bias via Global Convergence of Sharpness Minimization
ICML 2024
0
citations
Why Do You Grok? A Theoretical Analysis on Grokking Modular Addition
ICML 2024
0
citations