NeurIPS 2025 "preference learning" Papers
4 papers found
Bayesian Optimization with Preference Exploration using a Monotonic Neural Network Ensemble
Hanyang Wang, Juergen Branke, Matthias Poloczek
NeurIPS 2025posterarXiv:2501.18792
Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections
Bo Wang, Qinyuan Cheng, Runyu Peng et al.
NeurIPS 2025posterarXiv:2507.00018
14
citations
Preference Learning with Response Time: Robust Losses and Guarantees
Ayush Sawarni, Sahasrajit Sarmasarkar, Vasilis Syrgkanis
NeurIPS 2025oralarXiv:2505.22820
1
citations
Self-Refining Language Model Anonymizers via Adversarial Distillation
Kyuyoung Kim, Hyunjun Jeon, Jinwoo Shin
NeurIPS 2025posterarXiv:2506.01420
3
citations