2024 "regret minimization" Papers
24 papers found
Best of Both Worlds Guarantees for Smoothed Online Quadratic Optimization
Neelkamal Bhuyan, Debankur Mukherjee, Adam Wierman
Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits
Nikolai Karpov, Qin Zhang
Decoupling Learning and Decision-Making: Breaking the $\mathcal{O}(\sqrt{T})$ Barrier in Online Resource Allocation with First-Order Methods
Wenzhi Gao, Chunlin Sun, Chenyu Xue et al.
Eluder-based Regret for Stochastic Contextual MDPs
Orin Levy, Asaf Cassel, Alon Cohen et al.
Equilibrium of Data Markets with Externality
Safwan Hossain, Yiling Chen
Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs
Tianyuan Jin, Hao-Lun Hsu, William Chang et al.
Graph-Triggered Rising Bandits
Gianmarco Genalti, Marco Mussi, Nicola Gatti et al.
Improved Differentially Private and Lazy Online Convex Optimization: Lower Regret without Smoothness Requirements
Naman Agarwal, Satyen Kale, Karan Singh et al.
Incentivized Learning in Principal-Agent Bandit Games
Antoine Scheid, Daniil Tiapkin, Etienne Boursier et al.
Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace Recovery
Yassir Jedra, William Réveillard, Stefan Stojanovic et al.
Monotone Individual Fairness
Yahav Bechavod
Nash Incentive-compatible Online Mechanism Learning via Weakly Differentially Private Online Learning
Joon Suk Huh, Kirthevasan Kandasamy
Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
Asaf Cassel, Haipeng Luo, Aviv Rosenberg et al.
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints
Dan Qiao, Yu-Xiang Wang
No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran, Alberto Maria Metelli, Matteo Papini et al.
Online Learning in CMDPs: Handling Stochastic and Adversarial Constraints
Francesco Emanuele Stradi, Jacopo Germano, Gianmarco Genalti et al.
Online Learning with Bounded Recall
Jon Schneider, Kiran Vodrahalli
Online Matrix Completion: A Collaborative Approach with Hott Items
Dheeraj Baby, Soumyabrata Pal
Pricing with Contextual Elasticity and Heteroscedastic Valuation
Jianyu Xu, Yu-Xiang Wang
Projection-Free Online Convex Optimization with Time-Varying Constraints
Dan Garber, Ben Kretzu
Prospective Side Information for Latent MDPs
Jeongyeol Kwon, Yonathan Efroni, Shie Mannor et al.
Quantum Algorithm for Online Exp-concave Optimization
Jianhao He, Chengchang Liu, Xutong Liu et al.
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
Uri Sherman, Alon Cohen, Tomer Koren et al.
Test-Time Regret Minimization in Meta Reinforcement Learning
Mirco Mutti, Aviv Tamar