2025 "preference alignment" Papers
7 papers found
Advantage-Guided Distillation for Preference Alignment in Small Language Models
Shiping Gao, Fanqi Wan, Jiajian Guo et al.
ICLR 2025posterarXiv:2502.17927
4
citations
Direct Post-Training Preference Alignment for Multi-Agent Motion Generation Model Using Implicit Feedback from Pre-training Demonstrations
Thomas Tian, Kratarth Goel
ICLR 2025posterarXiv:2503.20105
4
citations
More RLHF, More Trust? On The Impact of Preference Alignment On Trustworthiness
Aaron J. Li, Satyapriya Krishna, Hima Lakkaraju
ICLR 2025posterarXiv:2404.18870
10
citations
Multi-domain Distribution Learning for De Novo Drug Design
Arne Schneuing, Ilia Igashov, Adrian Dobbelstein et al.
ICLR 2025posterarXiv:2508.17815
11
citations
No Preference Left Behind: Group Distributional Preference Optimization
Binwei Yao, Zefan Cai, Yun-Shiuan Chuang et al.
ICLR 2025posterarXiv:2412.20299
17
citations
PersonalLLM: Tailoring LLMs to Individual Preferences
Thomas Zollo, Andrew Siah, Naimeng Ye et al.
ICLR 2025posterarXiv:2409.20296
27
citations
Uncertainty-aware Preference Alignment for Diffusion Policies
Runqing Miao, Sheng Xu, Runyi Zhao et al.
NeurIPS 2025poster