"policy evaluation" Papers
13 papers found
Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation
Feichen Gan, Lu Youcun, Yingying Zhang et al.
NeurIPS 2025oralarXiv:2510.26026
Estimation and Inference in Distributional Reinforcement Learning
Liangyu Zhang, Yang Peng, Jiadong Liang et al.
NeurIPS 2025posterarXiv:2309.17262
4
citations
IRASim: A Fine-Grained World Model for Robot Manipulation
Fangqi Zhu, Hongtao Wu, Song Guo et al.
ICCV 2025posterarXiv:2406.14540
21
citations
Time After Time: Deep-Q Effect Estimation for Interventions on When and What to do
Yoav Wald, Mark Goldstein, Yonathan Efroni et al.
ICLR 2025posterarXiv:2503.15890
Combining Experimental and Historical Data for Policy Evaluation
Ting Li, Chengchun Shi, Qianglin Wen et al.
ICML 2024poster
Discerning Temporal Difference Learning
AAAI 2024paperarXiv:2310.08091
Efficient Policy Evaluation with Offline Data Informed Behavior Policy Design
Shuze Liu, Shangtong Zhang
ICML 2024poster
Faster Stochastic Variance Reduction Methods for Compositional MiniMax Optimization
Jin Liu, Xiaokang Pan, Junwen Duan et al.
AAAI 2024paperarXiv:2308.09604
Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace Recovery
Yassir Jedra, William Réveillard, Stefan Stojanovic et al.
ICML 2024poster
Policy-conditioned Environment Models are More Generalizable
Ruifeng Chen, Xiong-Hui Chen, Yihao Sun et al.
ICML 2024poster
Policy Evaluation for Variance in Average Reward Reinforcement Learning
Shubhada Agrawal, Prashanth L.A., Siva Maguluri
ICML 2024oral
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee, Josiah Hanna, Robert Nowak
ICML 2024poster
Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks
Khurram Javed, Haseeb Shah, Richard Sutton et al.
ICML 2024poster