"dynamic regret" Papers
6 papers found
Constrained Feedback Learning for Non-Stationary Multi-Armed Bandits
Shaoang Li, Jian Li
NeurIPS 2025posterarXiv:2509.15073
Non-Stationary Dueling Bandits Under a Weighted Borda Criterion
Joe Suk, Arpit Agarwal
ICLR 2025posterarXiv:2403.12950
2
citations
Non-stationary Online Convex Optimization with Arbitrary Delays
Yuanyu Wan, Chang Yao, Mingli Song et al.
ICML 2024poster
Online Linear Regression in Dynamic Environments via Discounting
Andrew Jacobsen, Ashok Cutkosky
ICML 2024poster
Pausing Policy Learning in Non-stationary Reinforcement Learning
Hyunin Lee, Ming Jin, Javad Lavaei et al.
ICML 2024oral
Percentile Risk-Constrained Budget Pacing for Guaranteed Display Advertising in Online Optimization
Liang Dai, Kejie Lyu, Chengcheng Zhang et al.
AAAI 2024paperarXiv:2312.06174
1
citations