"reasoning" Papers
14 papers found
Conference
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
Andreas Hochlehnert, Hardik Bhatnagar, Vishaal Udandarao et al.
COLM 2025paperarXiv:2504.07086
70
citations
Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining
Rosie Zhao, Alexandru Meterez, Sham M. Kakade et al.
COLM 2025paperarXiv:2504.07912
87
citations
Finding Flawed Fictions: Evaluating Complex Reasoning in Language Models via Plot Hole Detection
Kabir Ahuja, Melanie Sclar, Yulia Tsvetkov
COLM 2025paperarXiv:2504.11900
15
citations
From Next-Token to Mathematics: The Learning Dynamics of Mathematical Reasoning in Language Models
Shubhra Mishra, Gabriel Poesia, Noah Goodman
COLM 2025paperarXiv:2407.00900
4
citations
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models
Hyunwoo Kim, Melanie Sclar, Tan Zhi-Xuan et al.
COLM 2025paperarXiv:2502.11881
12
citations
Learning Adaptive Parallel Reasoning with Language Models
Jiayi Pan, Xiuyu Li, Long Lian et al.
COLM 2025paperarXiv:2504.15466
49
citations
Learning to Reason for Long-Form Story Generation
Alexander Gurung, Mirella Lapata
COLM 2025paper
19
citations
MALT: Improving Reasoning with Multi-Agent LLM Training
Sumeet Ramesh Motwani, Chandler Smith, Rocktim Jyoti Das et al.
COLM 2025paperarXiv:2412.01928
37
citations
Partial Perspectives: How LLMs Handle Logically Inconsistent Knowledge in Reasoning Tasks
Zichao Li, Ines Arous, Jackie CK Cheung
COLM 2025paper
ReasonIR: Training Retrievers for Reasoning Tasks
Rulin Shao, Rui Qiao, Varsha Kishore et al.
COLM 2025paperarXiv:2504.20595
44
citations
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Bowen Jin, Hansi Zeng, Zhenrui Yue et al.
COLM 2025paperarXiv:2503.09516
685
citations
Self-Steering Language Models
Gabriel Grand, Joshua B. Tenenbaum, Vikash Mansinghka et al.
COLM 2025paperarXiv:2504.07081
6
citations
Training Large Language Models to Reason in a Continuous Latent Space
Shibo Hao, Sainbayar Sukhbaatar, DiJia Su et al.
COLM 2025paperarXiv:2412.06769
349
citations
Weight ensembling improves reasoning in language models
Xingyu Dang, Christina Baek, Kaiyue Wen et al.
COLM 2025paperarXiv:2504.10478
25
citations