Paper by Barbara E Engelhardt Papers
2 papers found
Conference
Sample Efficient Preference Alignment in LLMs via Active Exploration
Viraj Mehta, Syrine Belakaria, Vikramjeet Das et al.
COLM 2025paperarXiv:2312.00267
10
citations
Sharpe Ratio-Guided Active Learning for Preference Optimization in RLHF
Syrine Belakaria, Joshua Kazdan, Charles Marx et al.
COLM 2025paperarXiv:2503.22137
2
citations