ICML Poster "linear function approximation" Papers
2 papers found
Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization
Yihan Du, Anna Winnicki, Gal Dalal et al.
ICML 2024posterarXiv:2402.10342
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret
Han Zhong, Jiachen Hu, Yecheng Xue et al.
ICML 2024posterarXiv:2302.10796