"test-time compute scaling" Papers
5 papers found
Chain-of-Retrieval Augmented Generation
Liang Wang, Haonan Chen, Nan Yang et al.
NeurIPS 2025posterarXiv:2501.14342
26
citations
Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models
Yulei Qin, Gang Li, Zongyi Li et al.
NeurIPS 2025posterarXiv:2506.01413
4
citations
Preserving Diversity in Supervised Fine-Tuning of Large Language Models
Ziniu Li, Congliang Chen, Tian Xu et al.
ICLR 2025posterarXiv:2408.16673
33
citations
Thoughts Are All Over the Place: On the Underthinking of Long Reasoning Models
Yue Wang, Qiuzhi Liu, Jiahao Xu et al.
NeurIPS 2025spotlight
Value-Guided Search for Efficient Chain-of-Thought Reasoning
Kaiwen Wang, Jin Zhou, Jonathan Chang et al.
NeurIPS 2025posterarXiv:2505.17373
7
citations