Zhuoran Yang
53
Papers
21
Total Citations
Papers (53)
Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems
ICLR 2024
13
citations
More Supervision, Less Computation: Statistical-Computational Tradeoffs in Weakly Supervised Learning
NeurIPS 2016arXiv
6
citations
Can Neural Networks Achieve Optimal Computational-statistical Tradeoff? An Analysis on Single-Index Model
ICLR 2025
2
citations
Mean Field Langevin Actor-Critic: Faster Convergence and Global Optimality beyond Lazy Learning
ICML 2024
0
citations
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
ICML 2024
0
citations
A General Framework for Sequential Decision-Making under Adaptivity Constraints
ICML 2024
0
citations
How Does Goal Relabeling Improve Sample Efficiency?
ICML 2024
0
citations
Theory of Consistency Diffusion Models: Distribution Estimation Meets Fast Sampling
ICML 2024
0
citations
From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems
ICML 2024
0
citations
InstaDrive: Instance-Aware Driving World Models for Realistic and Consistent Video Generation
ICCV 2025
0
citations
Human Memory Search as Initial-Visit Emitting Random Walk
NeurIPS 2015
0
citations
Estimating High-dimensional Non-Gaussian Multiple Index Models via Stein’s Lemma
NeurIPS 2017
0
citations
Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework
NeurIPS 2020
0
citations
Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach
NeurIPS 2020
0
citations
Provably Efficient Neural GTD for Off-Policy Learning
NeurIPS 2020
0
citations
Provably Efficient Reinforcement Learning with Kernel and Neural Function Approximations
NeurIPS 2020
0
citations
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss
NeurIPS 2020
0
citations
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory
NeurIPS 2020
0
citations
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
NeurIPS 2020
0
citations
BooVI: Provably Efficient Bootstrapped Value Iteration
NeurIPS 2021
0
citations
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic
NeurIPS 2021
0
citations
Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL
NeurIPS 2021
0
citations
Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning
NeurIPS 2021
0
citations
Provably Efficient Causal Reinforcement Learning with Confounded Observational Data
NeurIPS 2021
0
citations
Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration
NeurIPS 2021
0
citations
A Near-Optimal Algorithm for Stochastic Bilevel Optimization via Double-Momentum
NeurIPS 2021
0
citations
A Unifying Framework of Off-Policy General Value Function Evaluation
NeurIPS 2022
0
citations
Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets
NeurIPS 2022
0
citations
Exponential Family Model-Based Reinforcement Learning via Score Matching
NeurIPS 2022
0
citations
Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence
NeurIPS 2022
0
citations
Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL
NeurIPS 2022
0
citations
Reinforcement Learning with Logarithmic Regret and Policy Switches
NeurIPS 2022
0
citations
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration
NeurIPS 2023
0
citations
Posterior Sampling for Competitive RL: Function Approximation and Partial Observation
NeurIPS 2023
0
citations
Online Performative Gradient Descent for Learning Nash Equilibria in Decision-Dependent Games
NeurIPS 2023
0
citations
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
NeurIPS 2023
0
citations
Learning Regularized Monotone Graphon Mean-Field Games
NeurIPS 2023
0
citations
Sparse Nonlinear Regression: Parameter Estimation under Nonconvexity
ICML 2016
0
citations
High-dimensional Non-Gaussian Single Index Models via Thresholded Score Function Estimation
ICML 2017
0
citations
The Edge Density Barrier: Computational-Statistical Tradeoffs in Combinatorial Inference
ICML 2018
0
citations
Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
ICML 2018
0
citations
On the statistical rate of nonlinear recovery in generative models with heavy-tailed data
ICML 2019
0
citations
Provable Gaussian Embedding with One Observation
NeurIPS 2018
0
citations
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization
NeurIPS 2018
0
citations
Contrastive Learning from Pairwise Measurements
NeurIPS 2018
0
citations
Statistical-Computational Tradeoff in Single Index Models
NeurIPS 2019
0
citations
Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy
NeurIPS 2019
0
citations
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
NeurIPS 2019
0
citations
Variance Reduced Policy Evaluation with Smooth Function Approximation
NeurIPS 2019
0
citations
Provably Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
NeurIPS 2019
0
citations
Neural Temporal-Difference Learning Converges to Global Optima
NeurIPS 2019
0
citations
Convergent Policy Optimization for Safe Reinforcement Learning
NeurIPS 2019
0
citations
Dynamic Regret of Policy Optimization in Non-Stationary Environments
NeurIPS 2020
0
citations