α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Qining Zhang
Qining Zhang
1
Papers
9
Total Citations
Papers (1)
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
ICLR 2025
9
citations