Poster "minimax regret" Papers
4 papers found
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals
Ziyi Liu, Idan Attias, Daniel Roy
ICML 2024posterarXiv:2407.00950
Feel-Good Thompson Sampling for Contextual Dueling Bandits
Xuheng Li, Heyang Zhao, Quanquan Gu
ICML 2024posterarXiv:2404.06013
Refining Minimax Regret for Unsupervised Environment Design
Michael Beukman, Samuel Coward, Michael Matthews et al.
ICML 2024posterarXiv:2402.12284
Small-loss Adaptive Regret for Online Convex Optimization
Wenhao Yang, Wei Jiang, Yibo Wang et al.
ICML 2024poster