2025 Poster "online reinforcement learning" Papers
7 papers found
Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
Zhiyuan Zhou, Andy Peng, Qiyang Li et al.
ICLR 2025posterarXiv:2412.07762
27
citations
Flow-Based Policy for Online Reinforcement Learning
Lei Lv, Yunfei Li, Yu Luo et al.
NEURIPS 2025posterarXiv:2506.12811
9
citations
Learning Preferences without Interaction for Cooperative AI: A Hybrid Offline-Online Approach
Haitong Ma, Haoran Yu, Haobo Fu et al.
NEURIPS 2025poster
Outcome-Based Online Reinforcement Learning: Algorithms and Fundamental Limits
Fan Chen, Zeyu Jia, Alexander Rakhlin et al.
NEURIPS 2025posterarXiv:2505.20268
3
citations
Prioritized Generative Replay
Ren Wang, Kevin Frans, Pieter Abbeel et al.
ICLR 2025posterarXiv:2410.18082
9
citations
Training Language Models to Self-Correct via Reinforcement Learning
Aviral Kumar, Vincent Zhuang, Rishabh Agarwal et al.
ICLR 2025posterarXiv:2409.12917
305
citations
Value-Guided Decision Transformer: A Unified Reinforcement Learning Framework for Online and Offline Settings
Hongling Zheng, Li Shen, Yong Luo et al.
NEURIPS 2025poster