Xiaoteng Ma
7
Papers
0
Total Citations
Papers (7)
Learning Diverse Risk Preferences in Population-Based Self-Play
AAAI 2024arXiv
0
citations
Single-Trajectory Distributionally Robust Reinforcement Learning
ICML 2024
0
citations
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning
NeurIPS 2021
0
citations
Mildly Conservative Q-Learning for Offline Reinforcement Learning
NeurIPS 2022
0
citations
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
NeurIPS 2022
0
citations
Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping
NeurIPS 2022
0
citations
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
NeurIPS 2023
0
citations