ICLR 2025 "policy evaluation" Papers
4 papers found
Doubly Optimal Policy Evaluation for Reinforcement Learning
Shuze Liu, Claire Chen, Shangtong Zhang
ICLR 2025posterarXiv:2410.02226
3
citations
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Claire Chen, Shuze Liu, Shangtong Zhang
ICLR 2025posterarXiv:2410.05655
1
citations
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to do
Yoav Wald, Mark Goldstein, Yonathan Efroni et al.
ICLR 2025posterarXiv:2503.15890
Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning
Jiuqi Wang, Ethan Blaser, Hadi Daneshmand et al.
ICLR 2025oralarXiv:2405.13861
14
citations