"online exploration" Papers
3 papers found
Self-Improvement in Language Models: The Sharpening Mechanism
Audrey Huang, Adam Block, Dylan Foster et al.
ICLR 2025posterarXiv:2412.01951
55
citations
Meta-Reinforcement Learning Robust to Distributional Shift Via Performing Lifelong In-Context Learning
TengYe Xu, Zihao Li, Qinyuan Ren
ICML 2024poster
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret
Han Zhong, Jiachen Hu, Yecheng Xue et al.
ICML 2024poster