"regret analysis" Papers
14 papers found
MALinZero: Efficient Low-Dimensional Search for Mastering Complex Multi-Agent Planning
Sizhe Tang, Jiayu Chen, Tian Lan
NeurIPS 2025posterarXiv:2511.06142
1
citations
Online Two-Stage Submodular Maximization
Iasonas Nikolaou, Miltiadis Stouras, Stratis Ioannidis et al.
NeurIPS 2025posterarXiv:2510.19480
Pareto Optimal Risk-Agnostic Distributional Bandits with Heavy-Tail Rewards
Kyungjae Lee, Dohyeong Kim, Taehyun Cho et al.
NeurIPS 2025poster
True Impact of Cascade Length in Contextual Cascading Bandits
Hyun-jun Choi, Joongkyu Lee, Min-hwan Oh
NeurIPS 2025poster
A General Online Algorithm for Optimizing Complex Performance Metrics
Wojciech Kotlowski, Marek Wydmuch, Erik Schultheis et al.
ICML 2024poster
High-dimensional Linear Bandits with Knapsacks
Wanteng Ma, Dong Xia, Jiashuo Jiang
ICML 2024poster
Matroid Semi-Bandits in Sublinear Time
Ruo-Chun Tzeng, Naoto Ohsaka, Kaito Ariu
ICML 2024poster
Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization
Kwang-Sung Jun, Jungtaek Kim
ICML 2024poster
On Multi-Armed Bandit with Impatient Arms
Yuming Shao, Zhixuan Fang
ICML 2024poster
Provable Interactive Learning with Hindsight Instruction Feedback
Dipendra Misra, Aldo Pacchiano, Robert Schapire
ICML 2024poster
Provably Efficient Partially Observable Risk-sensitive Reinforcement Learning with Hindsight Observation
Tonghe Zhang, Yu Chen, Longbo Huang
ICML 2024poster
Regret Analysis of Repeated Delegated Choice
Suho Shin, Keivan Rezaei, Mohammad Hajiaghayi et al.
AAAI 2024paperarXiv:2310.04884
7
citations
Robustly Improving Bandit Algorithms with Confounded and Selection Biased Offline Data: A Causal Approach
Wen Huang, Xintao Wu
AAAI 2024paperarXiv:2312.12731
Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge
Meshal Alharbi, Mardavij Roozbehani, Munther Dahleh
AAAI 2024paperarXiv:2312.12558