"bellman equation" Papers
2 papers found
Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable Simulations
Feng Gao, Liangzhi Shi, Shenao Zhang et al.
ICML 2024poster
Optimal Mechanism in a Dynamic Stochastic Knapsack Environment
Jihyeok Jung, Chan-Oi Song, Deok-Joo Lee et al.
AAAI 2024paperarXiv:2402.14269