"constrained mdps" Papers
2 papers found
Beyond Scalar Rewards: An Axiomatic Framework for Lexicographic MDPs
Mehran Shakerinava, Siamak Ravanbakhsh, Adam Oberman
NeurIPS 2025spotlightarXiv:2505.12049
Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization
Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.
ICLR 2025posterarXiv:2410.02275
5
citations