2025 "preference modeling" Papers
7 papers found
Beyond Scalar Rewards: An Axiomatic Framework for Lexicographic MDPs
Mehran Shakerinava, Siamak Ravanbakhsh, Adam Oberman
NeurIPS 2025spotlightarXiv:2505.12049
Efficient and Near-Optimal Algorithm for Contextual Dueling Bandits with Offline Regression Oracles
Aadirupa Saha, Robert Schapire
NeurIPS 2025poster
Generalized Top-k Mallows Model for Ranked Choices
Shahrzad Haddadan, Sara Ahmadian
NeurIPS 2025spotlightarXiv:2510.22040
Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning
Yuheng Zhang, Dian Yu, Baolin Peng et al.
ICLR 2025posterarXiv:2407.00617
31
citations
Logic-Logit: A Logic-Based Approach to Choice Modeling
Shuhan Zhang, Wendi Ren, Shuang Li
ICLR 2025poster
Pairwise Calibrated Rewards for Pluralistic Alignment
Daniel Halpern, Evi Micha, Ariel Procaccia et al.
NeurIPS 2025posterarXiv:2506.06298
Seeing Eye to AI: Human Alignment via Gaze-Based Response Rewards for Large Language Models
Ángela López-Cardona, Carlos Segura, Alexandros Karatzoglou et al.
ICLR 2025posterarXiv:2410.01532
8
citations