Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms

0citations

arXiv:2510.25811 Project

citations

#2324

in NEURIPS 2025

of 5858 papers

Top Authors

Data Points

Top Authors

William Réveillard Richard Combes

Abstract

We consider a stochastic multi-armed bandit problem with i.i.d. rewards where the expected reward function is multimodal with at most $m$ modes. We propose the first known computationally tractable algorithm for computing the solution to the Graves-Lai optimization problem, which in turn enables the implementation of asymptotically optimal algorithms for this bandit problem.

Citation History

Jan 25, 2026

Jan 27, 2026

Jan 28, 2026