"self-correction mechanisms" Papers
2 papers found
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Mingyang Chen, Linzhuang Sun, Tianpeng Li et al.
NeurIPS 2025posterarXiv:2503.19470
56
citations
SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
Ling Yang, Zhaochen Yu, Tianjun Zhang et al.
ICLR 2025posterarXiv:2410.09008
12
citations