"minimax regret" Papers
4 papers found
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals
Ziyi Liu, Idan Attias, Daniel Roy
ICML 2024posterarXiv:2407.00950
Feel-Good Thompson Sampling for Contextual Dueling Bandits
Xuheng Li, Heyang Zhao, Quanquan Gu
ICML 2024poster
Refining Minimax Regret for Unsupervised Environment Design
Michael Beukman, Samuel Coward, Michael Matthews et al.
ICML 2024poster
Small-loss Adaptive Regret for Online Convex Optimization
Wenhao Yang, Wei Jiang, Yibo Wang et al.
ICML 2024poster