NeurIPS "reinforcement learning alignment" Papers
3 papers found
Improving Video Generation with Human Feedback
Jie Liu, Gongye Liu, Jiajun Liang et al.
NeurIPS 2025posterarXiv:2501.13918
112
citations
Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards
Qingming LIU, Zhen Liu, Dinghuai Zhang et al.
NeurIPS 2025posterarXiv:2506.15684
2
citations
PurpCode: Reasoning for Safer Code Generation
Jiawei Liu, Nirav Diwan, Zhe Wang et al.
NeurIPS 2025posterarXiv:2507.19060
7
citations