ICML "regret analysis" Papers
7 papers found
A General Online Algorithm for Optimizing Complex Performance Metrics
Wojciech Kotlowski, Marek Wydmuch, Erik Schultheis et al.
ICML 2024poster
High-dimensional Linear Bandits with Knapsacks
Wanteng Ma, Dong Xia, Jiashuo Jiang
ICML 2024posterarXiv:2311.01327
Matroid Semi-Bandits in Sublinear Time
Ruo-Chun Tzeng, Naoto Ohsaka, Kaito Ariu
ICML 2024posterarXiv:2405.17968
Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization
Kwang-Sung Jun, Jungtaek Kim
ICML 2024posterarXiv:2402.07341
On Multi-Armed Bandit with Impatient Arms
Yuming Shao, Zhixuan Fang
ICML 2024poster
Provable Interactive Learning with Hindsight Instruction Feedback
Dipendra Misra, Aldo Pacchiano, Robert Schapire
ICML 2024posterarXiv:2404.09123
Provably Efficient Partially Observable Risk-sensitive Reinforcement Learning with Hindsight Observation
Tonghe Zhang, Yu Chen, Longbo Huang
ICML 2024posterarXiv:2402.18149