2025 "multi-armed bandit" Papers
3 papers found
Beyond Greedy Exits: Improved Early Exit Decisions for Risk Control and Reliability
Divya Jyoti Bajpai, Manjesh Kumar Hanawal
NEURIPS 2025posterarXiv:2509.23666
Precise Asymptotics and Refined Regret of Variance-Aware UCB
Yingying Fan, Yuxuan Han, Jinchi Lv et al.
NEURIPS 2025spotlightarXiv:2412.08843
1
citations
ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning
Mingqi Yuan, Bo Li, Xin Jin et al.
ICCV 2025posterarXiv:2503.06101
1
citations