"bandit feedback" Papers
10 papers found
Comparing Uniform Price and Discriminatory Multi-Unit Auctions through Regret Minimization
Marius Potfer, Vianney Perchet
NeurIPS 2025posterarXiv:2510.19591
Efficient Online Set-valued Classification with Bandit Feedback
Zhou Wang, Xingye Qiao
ICML 2024poster
Federated Combinatorial Multi-Agent Multi-Armed Bandits
Fares Fourati, Mohamed-Slim Alouini, Vaneet Aggarwal
ICML 2024poster
Handling Heterogeneous Curvatures in Bandit LQR Control
Yu-Hu Yan, Jing Wang, Peng Zhao
ICML 2024spotlight
On Interpolating Experts and Multi-Armed Bandits
Houshuang Chen, Yuchen He, Chihao Zhang
ICML 2024poster
Performative Prediction with Bandit Feedback: Learning through Reparameterization
Yatong Chen, Wei Tang, Chien-Ju Ho et al.
ICML 2024poster
Projection-Free Online Convex Optimization with Time-Varying Constraints
Dan Garber, Ben Kretzu
ICML 2024poster
Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback
GUOJUN XIONG, Jian Li
ICML 2024poster
Quantum Algorithm for Online Exp-concave Optimization
Jianhao He, Chengchang Liu, Xutong Liu et al.
ICML 2024poster
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
Uri Sherman, Alon Cohen, Tomer Koren et al.
ICML 2024poster