2025 "reasoning benchmarks" Papers
7 papers found
Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking
Heli Ben-Hamu, Itai Gat, Daniel Severo et al.
NeurIPS 2025posterarXiv:2505.24857
40
citations
HELM: Hyperbolic Large Language Models via Mixture-of-Curvature Experts
Neil He, Rishabh Anand, Hiren Madhu et al.
NeurIPS 2025posterarXiv:2505.24722
8
citations
KLASS: KL-Guided Fast Inference in Masked Diffusion Models
Seo Hyun Kim, Sunwoo Hong, Hojung Jung et al.
NeurIPS 2025spotlightarXiv:2511.05664
SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought
Guanghao Li, Wenhao Jiang, Mingfeng Chen et al.
NeurIPS 2025posterarXiv:2505.24181
2
citations
SeRL: Self-play Reinforcement Learning for Large Language Models with Limited Data
Wenkai Fang, Shunyu Liu, Yang Zhou et al.
NeurIPS 2025posterarXiv:2505.20347
19
citations
SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning
Rui Pan, Yinwei Dai, Zhihao Zhang et al.
NeurIPS 2025posterarXiv:2504.07891
35
citations
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Haipeng Luo, Qingfeng Sun, Can Xu et al.
ICLR 2025posterarXiv:2308.09583
637
citations