Yuda Song
5
Papers
12
Total Citations
1
Affiliations
Affiliations
Carnegie Mellon University
Papers (5)
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
ICLR 2024
8
citations
Accelerating Unbiased LLM Evaluation via Synthetic Feedback
ICML 2025
3
citations
To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Observable RL
NeurIPS 2025
1
citations
Hybrid Reinforcement Learning from Offline Observation Alone
ICML 2024arXiv
0
citations
Rich-Observation Reinforcement Learning with Continuous Latent Dynamics
ICML 2024arXiv
0
citations