2025 Oral "policy evaluation" Papers
3 papers found
Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation
Feichen Gan, Lu Youcun, Yingying Zhang et al.
NEURIPS 2025oralarXiv:2510.26026
Towards Provable Emergence of In-Context Reinforcement Learning
Jiuqi Wang, Rohan Chandra, Shangtong Zhang
NEURIPS 2025oralarXiv:2509.18389
1
citations
Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning
Jiuqi Wang, Ethan Blaser, Hadi Daneshmand et al.
ICLR 2025oralarXiv:2405.13861
14
citations