Poster by Zhengran Zeng Papers
2 papers found
Conference
Reasoning Through Execution: Unifying Process and Outcome Rewards for Code Generation
Zhuohao Yu, Weizheng Gu, Yidong Wang et al.
ICML 2025arXiv:2412.15118
10
citations
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization
Yidong Wang, Zhuohao Yu, Wenjin Yao et al.
ICLR 2024arXiv:2306.05087
336
citations