Poster "best-of-both-worlds algorithms" Papers
2 papers found
Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback
Shinji Ito, Kevin Jamieson, Haipeng Luo et al.
NEURIPS 2025posterarXiv:2510.17103
1
citations
Exploration by Optimization with Hybrid Regularizers: Logarithmic Regret with Adversarial Robustness in Partial Monitoring
Taira Tsuchiya, Shinji Ito, Junya Honda
ICML 2024posterarXiv:2402.08321