"human feedback learning" Papers
2 papers found
Efficient Preference-Based Reinforcement Learning: Randomized Exploration meets Experimental Design
Andreas Schlaginhaufen, Reda Ouhamma, Maryam Kamgarpour
NeurIPS 2025posterarXiv:2506.09508
1
citations
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning
Heewoong Choi, Sangwon Jung, Hongjoon Ahn et al.
ICML 2024poster