Tuo Zhao
13
Papers
195
Total Citations
Papers (13)
LoftQ: LoRA-Fine-Tuning-aware Quantization for Large Language Models
ICLR 2024arXiv
194
citations
A Minimalist Example of Edge-of-Stability and Progressive Sharpening
NeurIPS 2025arXiv
1
citations
Ask a Strong LLM Judge when Your Reward Model is Uncertain
NeurIPS 2025arXiv
0
citations
Beyond Point Prediction: Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process
ICML 2024
0
citations
To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO
ICML 2024arXiv
0
citations
Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? --- A Neural Tangent Kernel Perspective
NeurIPS 2020arXiv
0
citations
Differentiable Top-k with Optimal Transport
NeurIPS 2020
0
citations
Towards Understanding Hierarchical Learning: Benefits of Neural Representations
NeurIPS 2020arXiv
0
citations
Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL
NeurIPS 2021
0
citations
On Deep Generative Models for Approximation and Estimation of Distributions on Manifolds
NeurIPS 2022arXiv
0
citations
Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms
NeurIPS 2023arXiv
0
citations
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
NeurIPS 2023arXiv
0
citations
Module-wise Adaptive Distillation for Multimodality Foundation Models
NeurIPS 2023arXiv
0
citations