Scott Niekum
10
Papers
15
Total Citations
Papers (10)
Learning Optimal Advantage from Preferences and Mistaking It for Reward
AAAI 2024arXiv
15
citations
Policy Evaluation Using the Ω-Return
NeurIPS 2015
0
citations
Bayesian Robust Optimization for Imitation Learning
NeurIPS 2020
0
citations
Adversarial Intrinsic Motivation for Reinforcement Learning
NeurIPS 2021
0
citations
SOPE: Spectrum of Off-Policy Estimators
NeurIPS 2021
0
citations
Universal Off-Policy Evaluation
NeurIPS 2021
0
citations
On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search
ICML 2016
0
citations
Data-Efficient Policy Evaluation Through Behavior Policy Search
ICML 2017
0
citations
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations
ICML 2019
0
citations
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
ICML 2019
0
citations