Vincent Zhuang

3

Papers

351

Total Citations

Papers (3)

Training Language Models to Self-Correct via Reinforcement Learning

Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models

Motion Control of High-Dimensional Musculoskeletal Systems with Hierarchical Model-Based Planning