NeurIPS 2025 "llm alignment" Papers
3 papers found
Avoiding exp(R) scaling in RLHF through Preference-based Exploration
Mingyu Chen, Yiding Chen, Wen Sun et al.
NeurIPS 2025poster
3
citations
Fixing It in Post: A Comparative Study of LLM Post-Training Data Quality and Model Performance
Aladin Djuhera, Swanand Kadhe, Syed Zawad et al.
NeurIPS 2025spotlightarXiv:2506.06522
Meta-Learning Objectives for Preference Optimization
Carlo Alfano, Silvia Sapora, Jakob Foerster et al.
NeurIPS 2025posterarXiv:2411.06568
2
citations