ICML 2024 "instance-dependent bounds" Papers
2 papers found
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals
Ziyi Liu, Idan Attias, Daniel Roy
ICML 2024posterarXiv:2407.00950
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
Kaiwen Wang, Owen Oertell, Alekh Agarwal et al.
ICML 2024posterarXiv:2402.07198