Poster "regret upper bounds" Papers
2 papers found
Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits
Kyoungseok Jang, Chicheng Zhang, Kwang-Sung Jun
ICML 2024posterarXiv:2402.11156
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
Yu Chen, XiangCheng Zhang, Siwei Wang et al.
ICML 2024poster