"constrained markov decision processes" Papers
4 papers found
Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation
Toshinori Kitamura, Arnob Ghosh, Tadashi Kozuno et al.
NeurIPS 2025spotlightarXiv:2502.10138
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
Akhil Agnihotri, Rahul Jain, Haipeng Luo
ICML 2024poster
Online Learning in CMDPs: Handling Stochastic and Adversarial Constraints
Francesco Emanuele Stradi, Jacopo Germano, Gianmarco Genalti et al.
ICML 2024poster
Truly No-Regret Learning in Constrained MDPs
Adrian Müller, Pragnya Alatur, Volkan Cevher et al.
ICML 2024spotlight