"process reward models" Papers
5 papers found
DreamPRM: Domain-reweighted Process Reward Model for Multimodal Reasoning
Qi Cao, Ruiyi Wang, Ruiyi Zhang et al.
NeurIPS 2025posterarXiv:2505.20241
5
citations
Know What You Don't Know: Uncertainty Calibration of Process Reward Models
Young-Jin Park, Kristjan Greenewald, Kaveh Alimohammadi et al.
NeurIPS 2025posterarXiv:2506.09338
4
citations
Unlocking Multimodal Mathematical Reasoning via Process Reward Model
Ruilin Luo, Zhuofan Zheng, Lei Wang et al.
NeurIPS 2025posterarXiv:2501.04686
29
citations
Value-Guided Search for Efficient Chain-of-Thought Reasoning
Kaiwen Wang, Jin Zhou, Jonathan Chang et al.
NeurIPS 2025posterarXiv:2505.17373
7
citations
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Hyungjoo Chae, Seonghwan Kim, Junhee Cho et al.
NeurIPS 2025spotlightarXiv:2505.15277
8
citations