α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Jiantao Jiao
Jiantao Jiao
3
Papers
95
Total Citations
Papers (3)
How to Evaluate Reward Models for RLHF
ICLR 2025
50
citations
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
ICML 2025
45
citations
Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
ICML 2024
0
citations