2025 Poster "coding tasks" Papers
2 papers found
HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages
Zhilin Wang, Jiaqi Zeng, Olivier Delalleau et al.
NEURIPS 2025posterarXiv:2505.11475
31
citations
Preference Optimization for Reasoning with Pseudo Feedback
Fangkai Jiao, Geyang Guo, Xingxing Zhang et al.
ICLR 2025posterarXiv:2411.16345
34
citations