NeurIPS "sample complexity analysis" Papers
7 papers found
Finite-Time Bounds for Average-Reward Fitted Q-Iteration
Jongmin Lee, Ernest Ryu
NeurIPS 2025posterarXiv:2510.17391
FraPPE: Fast and Efficient Preference-Based Pure Exploration
Udvas Das, Apurv Shukla, Debabrota Basu
NeurIPS 2025posterarXiv:2508.16487
Learning Orthogonal Multi-Index Models: A Fine-Grained Information Exponent Analysis
Yunwei Ren, Jason Lee
NeurIPS 2025posterarXiv:2410.09678
5
citations
Linear Mixture Distributionally Robust Markov Decision Processes
Zhishuai Liu, Pan Xu
NeurIPS 2025posterarXiv:2505.18044
3
citations
Offline Actor-Critic for Average Reward MDPs
William Powell, Jeongyeol Kwon, Qiaomin Xie et al.
NeurIPS 2025poster
73
citations
On Feasible Rewards in Multi-Agent Inverse Reinforcement Learning
Till Freihaut, Giorgia Ramponi
NeurIPS 2025spotlightarXiv:2411.15046
2
citations
Outcome-Based Online Reinforcement Learning: Algorithms and Fundamental Limits
Fan Chen, Zeyu Jia, Alexander Rakhlin et al.
NeurIPS 2025posterarXiv:2505.20268
3
citations