"out-of-distribution actions" Papers
2 papers found
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Yixiu Mao, Yun Qu, Qi Wang et al.
NeurIPS 2025spotlightarXiv:2511.02567
Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Tenglong Liu, Yang Li, Yixing Lan et al.
ICML 2024posterarXiv:2405.19909