Wanjia Zhao
3
Papers
152
Total Citations
1
Affiliations
Affiliations
Stanford University
Papers (3)
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
ICLR 2025arXiv
134
citations
SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning
NeurIPS 2025arXiv
18
citations
Don’t Forget the Enjoin: FocalLoRA for Instruction Hierarchical Alignment in Large Language Models
NeurIPS 2025
0
citations