Poster "regret bounds" Papers

12 papers found

Filters:poster regret bounds Clear all

Conference

AAAI 2025 (3,028)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NeurIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,140)oral (1,594)spotlight (1,421)highlight (975)

Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds

Hao Liang, Zhiquan Luo

NeurIPS 2025posterarXiv:2210.14051

Contextual Thompson Sampling via Generation of Missing Data

Kelly W Zhang, Tianhui Cai, Hongseok Namkoong et al.

NeurIPS 2025posterarXiv:2502.07064

Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions

Marc Brooks, Gabriel Durham, Kihyuk Hong et al.

NeurIPS 2025posterarXiv:2505.16311

Improved Regret Bounds for Linear Bandits with Heavy-Tailed Rewards

Artin Tajdini, Jonathan Scarlett, Kevin Jamieson

NeurIPS 2025posterarXiv:2506.04775

Learning Across the Gap: Hybrid Multi-armed Bandits with Heterogeneous Offline and Online Data

Qijia He, Minghan Wang, Xutong Liu et al.

NeurIPS 2025poster

Prediction with expert advice under additive noise

Alankrita Bhatt, Victoria Kostina

NeurIPS 2025poster

Robust Satisficing Gaussian Process Bandits Under Adversarial Attacks

Artun Saday, Yaşar Cahit Yıldırım, Cem Tekin

NeurIPS 2025posterarXiv:2506.01625

Statistical Parity with Exponential Weights

Stephen Pasteris, Chris Hicks, Vasilios Mavroudis

NeurIPS 2025poster

Uniform Wrappers: Bridging Concave to Quadratizable Functions in Online Optimization

Mohammad Pedramfar, Christopher Quinn, Vaneet Aggarwal

NeurIPS 2025poster

$\mathtt{VITS}$ : Variational Inference Thompson Sampling for contextual bandits

Pierre Clavier, Tom Huix, Alain Oliviero Durmus

ICML 2024poster

Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond

Xutong Liu, Siwei Wang, Jinhang Zuo et al.

ICML 2024poster

Reinforcement Learning and Regret Bounds for Admission Control

Lucas Weber, Ana Busic, Jiamin ZHU

ICML 2024poster