Poster "reasoning llms" Papers
2 papers found
Conference
LIFEBENCH: Evaluating Length Instruction Following in Large Language Models
Wei Zhang, Zhenhong Zhou, Kun Wang et al.
NEURIPS 2025posterarXiv:2505.16234
1
citations
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements
Bingchen Zhao, Despoina Magka, Minqi Jiang et al.
NEURIPS 2025posterarXiv:2506.22419
2
citations