by QIANG FU Papers
6 papers found
Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning
Hanlin Yang, Jian Yao, Weiming Liu et al.
ICLR 2025oral
2
citations
Online-to-Offline RL for Agent Alignment
Xu Liu, Haobo Fu, Stefano V. Albrecht et al.
ICLR 2025poster
Dynamic Discounted Counterfactual Regret Minimization
Hang Xu, Kai Li, Haobo Fu et al.
ICLR 2024spotlight
Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain
Yiming Gao, Feiyu Liu, Liang Wang et al.
ICLR 2024poster
4
citations
Maximum Entropy Heterogeneous-Agent Reinforcement Learning
Jiarong Liu, Yifan Zhong, Siyi Hu et al.
ICLR 2024spotlight
Towards Offline Opponent Modeling with In-context Learning
Yuheng Jing, Kai Li, Bingyun Liu et al.
ICLR 2024poster