Lei Ying
3
Papers
9
Total Citations
Papers (3)
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
ICLR 2025
9
citations
Near-Optimal Regret-Queue Length Tradeoff in Online Learning for Two-Sided Markets
NeurIPS 2025arXiv
0
citations
Graph Mixup on Approximate Gromov–Wasserstein Geodesics
ICML 2024
0
citations