Aleksandra Faust
5
Papers
369
Total Citations
Papers (5)
Training Language Models to Self-Correct via Reinforcement Learning
ICLR 2025arXiv
305
citations
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
ICLR 2025arXiv
43
citations
ElasticTok: Adaptive Tokenization for Image and Video
ICLR 2025arXiv
21
citations
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
ICML 2024
0
citations
Position: Levels of AGI for Operationalizing Progress on the Path to AGI
ICML 2024arXiv
0
citations