Zhuoran Yang

53
Papers
21
Total Citations

Papers (53)

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

ICLR 2024
13
citations

More Supervision, Less Computation: Statistical-Computational Tradeoffs in Weakly Supervised Learning

NeurIPS 2016arXiv
6
citations

Can Neural Networks Achieve Optimal Computational-statistical Tradeoff? An Analysis on Single-Index Model

ICLR 2025
2
citations

Mean Field Langevin Actor-Critic: Faster Convergence and Global Optimality beyond Lazy Learning

ICML 2024
0
citations

Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

ICML 2024
0
citations

A General Framework for Sequential Decision-Making under Adaptivity Constraints

ICML 2024
0
citations

How Does Goal Relabeling Improve Sample Efficiency?

ICML 2024
0
citations

Theory of Consistency Diffusion Models: Distribution Estimation Meets Fast Sampling

ICML 2024
0
citations

From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems

ICML 2024
0
citations

InstaDrive: Instance-Aware Driving World Models for Realistic and Consistent Video Generation

ICCV 2025
0
citations

Human Memory Search as Initial-Visit Emitting Random Walk

NeurIPS 2015
0
citations

Estimating High-dimensional Non-Gaussian Multiple Index Models via Stein’s Lemma

NeurIPS 2017
0
citations

Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework

NeurIPS 2020
0
citations

Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach

NeurIPS 2020
0
citations

Provably Efficient Neural GTD for Off-Policy Learning

NeurIPS 2020
0
citations

Provably Efficient Reinforcement Learning with Kernel and Neural Function Approximations

NeurIPS 2020
0
citations

Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss

NeurIPS 2020
0
citations

Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory

NeurIPS 2020
0
citations

Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret

NeurIPS 2020
0
citations

BooVI: Provably Efficient Bootstrapped Value Iteration

NeurIPS 2021
0
citations

Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic

NeurIPS 2021
0
citations

Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL

NeurIPS 2021
0
citations

Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning

NeurIPS 2021
0
citations

Provably Efficient Causal Reinforcement Learning with Confounded Observational Data

NeurIPS 2021
0
citations

Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration

NeurIPS 2021
0
citations

A Near-Optimal Algorithm for Stochastic Bilevel Optimization via Double-Momentum

NeurIPS 2021
0
citations

A Unifying Framework of Off-Policy General Value Function Evaluation

NeurIPS 2022
0
citations

Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets

NeurIPS 2022
0
citations

Exponential Family Model-Based Reinforcement Learning via Score Matching

NeurIPS 2022
0
citations

Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence

NeurIPS 2022
0
citations

Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL

NeurIPS 2022
0
citations

Reinforcement Learning with Logarithmic Regret and Policy Switches

NeurIPS 2022
0
citations

Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration

NeurIPS 2023
0
citations

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

NeurIPS 2023
0
citations

Online Performative Gradient Descent for Learning Nash Equilibria in Decision-Dependent Games

NeurIPS 2023
0
citations

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

NeurIPS 2023
0
citations

Learning Regularized Monotone Graphon Mean-Field Games

NeurIPS 2023
0
citations

Sparse Nonlinear Regression: Parameter Estimation under Nonconvexity

ICML 2016
0
citations

High-dimensional Non-Gaussian Single Index Models via Thresholded Score Function Estimation

ICML 2017
0
citations

The Edge Density Barrier: Computational-Statistical Tradeoffs in Combinatorial Inference

ICML 2018
0
citations

Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents

ICML 2018
0
citations

On the statistical rate of nonlinear recovery in generative models with heavy-tailed data

ICML 2019
0
citations

Provable Gaussian Embedding with One Observation

NeurIPS 2018
0
citations

Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization

NeurIPS 2018
0
citations

Contrastive Learning from Pairwise Measurements

NeurIPS 2018
0
citations

Statistical-Computational Tradeoff in Single Index Models

NeurIPS 2019
0
citations

Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy

NeurIPS 2019
0
citations

Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games

NeurIPS 2019
0
citations

Variance Reduced Policy Evaluation with Smooth Function Approximation

NeurIPS 2019
0
citations

Provably Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost

NeurIPS 2019
0
citations

Neural Temporal-Difference Learning Converges to Global Optima

NeurIPS 2019
0
citations

Convergent Policy Optimization for Safe Reinforcement Learning

NeurIPS 2019
0
citations

Dynamic Regret of Policy Optimization in Non-Stationary Environments

NeurIPS 2020
0
citations