"test-time compute" Papers
2 篇论文
Interpreting Emergent Planning in Model-Free Reinforcement Learning
Thomas Bush, Stephen Chung, Usman Anwar et al.
ICLR 2025posterarXiv:1901.03559
124
citations
Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning
Wenkai Yang, Shuming Ma, Yankai Lin et al.
NeurIPS 2025posterarXiv:2502.18080
96
citations