"reasoning llms" Papers
3 papers found
Conference
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
Pranjal Aggarwal, Sean Welleck
COLM 2025paperarXiv:2503.04697
247
citations
LIFEBENCH: Evaluating Length Instruction Following in Large Language Models
Wei Zhang, Zhenhong Zhou, Kun Wang et al.
NEURIPS 2025posterarXiv:2505.16234
1
citations
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements
Bingchen Zhao, Despoina Magka, Minqi Jiang et al.
NEURIPS 2025posterarXiv:2506.22419
2
citations