Poster by Feng Yao Papers
2 papers found
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Jorge (Zhoujun) Cheng, Shibo Hao, Tianyang Liu et al.
NEURIPS 2025posterarXiv:2506.14965
35
citations
Training Language Models to Generate Quality Code with Program Analysis Feedback
Feng Yao, Zilong Wang, Liyuan Liu et al.
NEURIPS 2025posterarXiv:2505.22704