"constrained markov decision process" Papers
2 papers found
Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization
Xiyue Peng, Hengquan Guo, Jiawei Zhang et al.
NEURIPS 2025posterarXiv:2410.19933
5
citations
Fuz-RL: A Fuzzy-Guided Robust Framework for Safe Reinforcement Learning under Uncertainty
Xu Wan, Chao Yang, Cheng Yang et al.
NEURIPS 2025poster