2025 "reward alignment" Papers
4 papers found
Align-DA: Align Score-based Atmospheric Data Assimilation with Multiple Preferences
Jing-An Sun, Hang Fan, Junchao Gong et al.
NEURIPS 2025posterarXiv:2505.22008
2
citations
Learning Preferences without Interaction for Cooperative AI: A Hybrid Offline-Online Approach
Haitong Ma, Haoran Yu, Haobo Fu et al.
NEURIPS 2025poster
Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models
Jisung Hwang, Jaihoon Kim, Minhyuk Sung
NEURIPS 2025posterarXiv:2509.07027
Unhackable Temporal Reward for Scalable Video MLLMs
En Yu, Kangheng Lin, Liang Zhao et al.
ICLR 2025oralarXiv:2502.12081
1
citations