"multi-armed bandit" Papers
3 papers found
Beyond Greedy Exits: Improved Early Exit Decisions for Risk Control and Reliability
Divya Jyoti Bajpai, Manjesh Kumar Hanawal
NeurIPS 2025posterarXiv:2509.23666
Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs
Tianyuan Jin, Hao-Lun Hsu, William Chang et al.
AAAI 2024paperarXiv:2312.15549
On Multi-Armed Bandit with Impatient Arms
Yuming Shao, Zhixuan Fang
ICML 2024poster