Shizhe Diao
3
Papers
121
Total Citations
Papers (3)
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
NeurIPS 2025
96
citations
UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models
ICML 2025
25
citations
Towards Unifying Medical Vision-and-Language Pre-Training via Soft Prompts
ICCV 2023arXiv
0
citations