NEURIPS Poster "reasoning capabilities" Papers
11 papers found
Activation-Guided Consensus Merging for Large Language Models
Yuxuan Yao, Shuqi LIU, Zehua Liu et al.
NEURIPS 2025posterarXiv:2505.14009
1
citations
Finding and Reactivating Post-Trained LLMs' Hidden Safety Mechanisms
Mingjie Li, Wai Man Si, Michael Backes et al.
NEURIPS 2025poster
1
citations
General-Reasoner: Advancing LLM Reasoning Across All Domains
Xueguang Ma, Qian Liu, Dongfu Jiang et al.
NEURIPS 2025posterarXiv:2505.14652
81
citations
GRIP: A Graph-Based Reasoning Instruction Producer
Jiankang Wang, Jianjun Xu, Xiaorui Wang et al.
NEURIPS 2025posterarXiv:2412.08864
2
citations
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
Mingjie Liu, Shizhe Diao, Ximing Lu et al.
NEURIPS 2025posterarXiv:2505.24864
101
citations
RAST: Reasoning Activation in LLMs via Small-model Transfer
Siru Ouyang, Xinyu Zhu, Zilin Xiao et al.
NEURIPS 2025posterarXiv:2506.15710
1
citations
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models
Yiran Guo, Lijie Xu, Jie Liu et al.
NEURIPS 2025posterarXiv:2505.23564
15
citations
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
Junteng Liu, Yuanxiang Fan, Jiang Zhuo et al.
NEURIPS 2025posterarXiv:2505.19641
22
citations
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
Parshin Shojaee, Iman Mirzadeh, Keivan Alizadeh vahid et al.
NEURIPS 2025posterarXiv:2506.06941
271
citations
Thinker: Learning to Think Fast and Slow
Stephen Chung, Wenyu Du, Jie Fu
NEURIPS 2025posterarXiv:2505.21097
7
citations
When Can Model-Free Reinforcement Learning be Enough for Thinking?
Josiah Hanna, Nicholas Corrado
NEURIPS 2025posterarXiv:2506.17124