Qiaomin Xie
16
Papers
8
Total Citations
Papers (16)
Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference
AAAI 2024arXiv
4
citations
Exact Policy Recovery in Offline RL with Both Heavy-Tailed Rewards and Data Corruption
AAAI 2024
2
citations
Stable Offline Value Function Learning with Bisimulation-based Representations
ICML 2025
1
citations
Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent
AAAI 2025
1
citations
Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
ICML 2024
0
citations
Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value
ICML 2024
0
citations
Contextual Online Pricing with (Biased) Offline Data
NeurIPS 2025
0
citations
Optimal Attack and Defense for Reinforcement Learning
AAAI 2024
0
citations
Data Poisoning to Fake a Nash Equilibria for Markov Games
AAAI 2024
0
citations
Roping in Uncertainty: Robustness and Regularization in Markov Games
ICML 2024arXiv
0
citations
Q-learning with Nearest Neighbors
NeurIPS 2018
0
citations
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
NeurIPS 2020
0
citations
Dynamic Regret of Policy Optimization in Non-Stationary Environments
NeurIPS 2020
0
citations
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
NeurIPS 2020
0
citations
Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption
NeurIPS 2023
0
citations
Multi-task Representation Learning for Pure Exploration in Bilinear Bandits
NeurIPS 2023
0
citations