α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Wei Fu
Wei Fu
3
Papers
8
Total Citations
Papers (3)
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
ICLR 2024
arXiv
8
citations
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
ICML 2024
arXiv
0
citations
Iteratively Learn Diverse Strategies with State Distance Information
NeurIPS 2023
arXiv
0
citations