Poster "offline alignment" Papers
2 papers found
Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions
Simon Matrenok, Skander Moalla, Caglar Gulcehre
NEURIPS 2025posterarXiv:2507.08068
Generalized Preference Optimization: A Unified Approach to Offline Alignment
Yunhao Tang, Zhaohan Guo, Zeyu Zheng et al.
ICML 2024posterarXiv:2402.05749