"regret minimization" Papers
39 papers found
Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits
Zihan Zhang, Xiangyang Ji, Yuan Zhou
An Online Learning Theory of Trading-Volume Maximization
Tommaso Cesari, Roberto Colomboni
Comparator-Adaptive $\Phi$-Regret: Improved Bounds, Simpler Algorithms, and Applications to Games
Soumita Hait, Ping Li, Haipeng Luo et al.
Comparing Uniform Price and Discriminatory Multi-Unit Auctions through Regret Minimization
Marius Potfer, Vianney Perchet
Feature-Based Online Bilateral Trade
Solenne Gaucher, Martino Bernasconi, Matteo Castiglioni et al.
Improved Regret and Contextual Linear Extension for Pandora's Box and Prophet Inequality
Junyan Liu, Ziyun Chen, Kun Wang et al.
Learning from Imperfect Human Feedback: A Tale from Corruption-Robust Dueling
Yuwei Cheng, Fan Yao, Xuefeng Liu et al.
Markov Persuasion Processes: Learning to Persuade From Scratch
Francesco Bacchiocchi, Francesco Emanuele Stradi, Matteo Castiglioni et al.
Near-Optimal Regret-Queue Length Tradeoff in Online Learning for Two-Sided Markets
Zixian Yang, Sushil Varma, Lei Ying
On the Universal Near Optimality of Hedge in Combinatorial Settings
Zhiyuan Fan, Arnab Maiti, Lillian Ratliff et al.
Optimal Regret of Bandits under Differential Privacy
Achraf Azize, Yulian Wu, Junya Honda et al.
Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation
Toshinori Kitamura, Arnob Ghosh, Tadashi Kozuno et al.
Regretful Decisions under Label Noise
Sujay Nagaraj, Yang Liu, Flavio Calmon et al.
REINFORCEMENT LEARNING FOR INDIVIDUAL OPTIMAL POLICY FROM HETEROGENEOUS DATA
Rui Miao, Babak Shahbaba, Annie Qu
Stable Matching with Ties: Approximation Ratios and Learning
Shiyun Lin, Simon Mauras, Nadav Merlis et al.
Best of Both Worlds Guarantees for Smoothed Online Quadratic Optimization
Neelkamal Bhuyan, Debankur Mukherjee, Adam Wierman
Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits
Nikolai Karpov, Qin Zhang
Decoupling Learning and Decision-Making: Breaking the $\mathcal{O}(\sqrt{T})$ Barrier in Online Resource Allocation with First-Order Methods
Wenzhi Gao, Chunlin Sun, Chenyu Xue et al.
Eluder-based Regret for Stochastic Contextual MDPs
Orin Levy, Asaf Cassel, Alon Cohen et al.
Equilibrium of Data Markets with Externality
Safwan Hossain, Yiling Chen
Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs
Tianyuan Jin, Hao-Lun Hsu, William Chang et al.
Graph-Triggered Rising Bandits
Gianmarco Genalti, Marco Mussi, Nicola Gatti et al.
Improved Differentially Private and Lazy Online Convex Optimization: Lower Regret without Smoothness Requirements
Naman Agarwal, Satyen Kale, Karan Singh et al.
Incentivized Learning in Principal-Agent Bandit Games
Antoine Scheid, Daniil Tiapkin, Etienne Boursier et al.
Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace Recovery
Yassir Jedra, William Réveillard, Stefan Stojanovic et al.
Monotone Individual Fairness
Yahav Bechavod
Nash Incentive-compatible Online Mechanism Learning via Weakly Differentially Private Online Learning
Joon Suk Huh, Kirthevasan Kandasamy
Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
Asaf Cassel, Haipeng Luo, Aviv Rosenberg et al.
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints
Dan Qiao, Yu-Xiang Wang
No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran, Alberto Maria Metelli, Matteo Papini et al.
Online Learning in CMDPs: Handling Stochastic and Adversarial Constraints
Francesco Emanuele Stradi, Jacopo Germano, Gianmarco Genalti et al.
Online Learning with Bounded Recall
Jon Schneider, Kiran Vodrahalli
Online Matrix Completion: A Collaborative Approach with Hott Items
Dheeraj Baby, Soumyabrata Pal
Pricing with Contextual Elasticity and Heteroscedastic Valuation
Jianyu Xu, Yu-Xiang Wang
Projection-Free Online Convex Optimization with Time-Varying Constraints
Dan Garber, Ben Kretzu
Prospective Side Information for Latent MDPs
Jeongyeol Kwon, Yonathan Efroni, Shie Mannor et al.
Quantum Algorithm for Online Exp-concave Optimization
Jianhao He, Chengchang Liu, Xutong Liu et al.
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
Uri Sherman, Alon Cohen, Tomer Koren et al.
Test-Time Regret Minimization in Meta Reinforcement Learning
Mirco Mutti, Aviv Tamar