2025 Poster "multi-armed bandit" Papers
2 papers found
Beyond Greedy Exits: Improved Early Exit Decisions for Risk Control and Reliability
Divya Jyoti Bajpai, Manjesh Kumar Hanawal
NEURIPS 2025posterarXiv:2509.23666
ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning
Mingqi Yuan, Bo Li, Xin Jin et al.
ICCV 2025posterarXiv:2503.06101
1
citations