Tuo Zhao

13

Papers

195

Total Citations

Papers (13)

LoftQ: LoRA-Fine-Tuning-aware Quantization for Large Language Models

A Minimalist Example of Edge-of-Stability and Progressive Sharpening

NeurIPS 2025arXiv

Ask a Strong LLM Judge when Your Reward Model is Uncertain

NeurIPS 2025arXiv

Beyond Point Prediction: Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process

To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO

Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? --- A Neural Tangent Kernel Perspective

NeurIPS 2020arXiv

Differentiable Top-k with Optimal Transport

Towards Understanding Hierarchical Learning: Benefits of Neural Representations

NeurIPS 2020arXiv

Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL

On Deep Generative Models for Approximation and Estimation of Distributions on Manifolds

NeurIPS 2022arXiv

Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms

NeurIPS 2023arXiv

Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms

NeurIPS 2023arXiv

Module-wise Adaptive Distillation for Multimodality Foundation Models

NeurIPS 2023arXiv