Poster "online policy optimization" Papers
2 papers found
Maximizing the Value of Predictions in Control: Accuracy Is Not Enough
Yiheng Lin, Christopher Yeh, Zaiwei Chen et al.
NeurIPS 2025posterarXiv:2506.04497
Human Alignment of Large Language Models through Online Preference Optimisation
Daniele Calandriello, Zhaohan Guo, REMI MUNOS et al.
ICML 2024posterarXiv:2403.08635