by Shengcai Liu Papers
2 papers found
Conference
Is PRM Necessary? Problem-Solving RL Implicitly Induces PRM Capability in LLMs
Zhangyin Feng, Qianglong Chen, Ning Lu et al.
NEURIPS 2025arXiv:2505.11227
7
citations
Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets
Ning LU, Shengcai Liu, Jiahao Wu et al.
ICML 2025arXiv:2505.12038
13
citations