A Novel General Framework for Sharp Lower Bounds in Succinct Stochastic Bandits

0citations

Citations

#1763

in NeurIPS 2025

of 5858 papers

Authors

Data Points

Authors

Guo Zeng Jean Honorio

Topics

succinct stochastic bandits minimax regret lower bounds information-regret trade-off entry-sparse vectors group-sparse settings low-rank matrix bandits

Abstract

Many online learning applications adopt the stochastic bandit problem with a linear reward model, where the unknown parameter exhibits a succinct structure. We study minimax regret lower bounds which allow to know whether more efficient algorithms can be proposed. We introduce a general definition of succinctness and propose a novel framework for constructing minimax regret lower bounds based on an information-regret trade-off. When applied to entry-sparse vectors, our framework sharpens a recent lower bound by (Hao et al, NeurIPS 2020). We further apply our framework to derive novel results. To the best of our knowledge, we provide the first lower bounds for the group-sparse and low-rank matrix settings.

Citation History

Jan 25, 2026

Jan 27, 2026

Jan 30, 2026