α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Zhiyu Mei
Zhiyu Mei
3
Papers
103
Total Citations
Papers (3)
AREAL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning
NeurIPS 2025
arXiv
95
citations
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
ICLR 2024
arXiv
8
citations
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
ICML 2024
arXiv
0
citations