Risk-Sensitive Policy Optimization via Predictive CVaR Policy Gradient

0citations

PDF

Citations

#10

in ICML 2024

of 2635 papers

Authors

Data Points

Authors

Ju-Hyun Kim Seungki Min

Topics

conditional value-at-risk policy optimization risk-sensitive control policy gradient methods predictive reweighting strategy reinforcement learning

Abstract

This paper addresses a policy optimization task with the conditional value-at-risk (CVaR) objective. We introduce thepredictive CVaR policy gradient, a novel approach that seamlessly integrates risk-neutral policy gradient algorithms with minimal modifications. Our method incorporates a reweighting strategy in gradient calculation -- individual cost terms are reweighted in proportion to theirpredictedcontribution to the objective. These weights can be easily estimated through a separate learning procedure. We provide theoretical and empirical analyses, demonstrating the validity and effectiveness of our proposed method.

Citation History

Jan 28, 2026