Vincent Zhuang
3
Papers
351
Total Citations
Papers (3)
Training Language Models to Self-Correct via Reinforcement Learning
ICLR 2025arXiv
305
citations
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
ICLR 2025
43
citations
Motion Control of High-Dimensional Musculoskeletal Systems with Hierarchical Model-Based Planning
ICLR 2025
3
citations