"reward finetuning" Papers
2 papers found
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
Zhen Liu, Tim Xiao, Weiyang Liu et al.
ICLR 2025posterarXiv:2412.07775
19
citations
Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards
Qingming LIU, Zhen Liu, Dinghuai Zhang et al.
NeurIPS 2025posterarXiv:2506.15684
2
citations