by Charlie Snell Papers
4 篇论文
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Marwa Abdulhai, Isadora White, Charlie Snell et al.
ICML 2025oral
63
citations
Scaling LLM Test-Time Compute Optimally Can be More Effective than Scaling Parameters for Reasoning
Charlie Snell, Jaehoon Lee, Kelvin Xu et al.
ICLR 2025poster
Value-Based Deep RL Scales Predictably
Oleh Rybkin, Michal Nauman, Preston Fu et al.
ICML 2025poster
The False Promise of Imitating Proprietary Language Models
Arnav Gudibande, Eric Wallace, Charlie Snell et al.
ICLR 2024spotlight