🧬Language Models

Chain-of-Thought Reasoning

Step-by-step reasoning in language models

100 papers10,952 total citations

Compare with other topics

Feb '24 — Jan '261225 papers

Top Conferences

ICLR: 38 NeurIPS: 24 AAAI: 17 CVPR: 10 ICML: 5 ICCV: 3

Top Papers

#1

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts

Pan Lu, Hritik Bansal, Tony Xia et al.

LISA: Reasoning Segmentation via Large Language Model

Xin Lai, Zhuotao Tian, Yukang Chen et al.

Causal Reasoning and Large Language Models: Opening a New Frontier for Causality

Chenhao Tan, Robert Ness, Amit Sharma et al.

OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Qidong Huang, Xiaoyi Dong, Pan Zhang et al.

LLaVA-CoT: Let Vision Language Models Reason Step-by-Step

Guowei Xu, Peng Jin, ZiangWu ZiangWu et al.

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

Samuel Marks, Can Rager, Eric Michaud et al.

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

yi yang, Xiaoxuan He, Hongkun Pan et al.

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Parshin Shojaee, Iman Mirzadeh, Keivan Alizadeh vahid et al.

Video-R1: Reinforcing Video Reasoning in MLLMs

Kaituo Feng, Kaixiong Gong, Bohao Li et al.

NeurIPS 2025arXiv:2503.21776

rule-based reinforcement learningmultimodal large language modelsvideo reasoningtemporal modeling+3

236

citations

#10

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Zayne Sprague, Fangcong Yin, Juan Rodriguez et al.

Listen, Think, and Understand

Yuan Gong, Hongyin Luo, Alexander Liu et al.

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Jingyi Zhang, Jiaxing Huang, Huanjin Yao et al.

CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models

Qingqing Zhao, Yao Lu, Moo Jin Kim et al.

Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification

Aojun Zhou, Ke Wang, Zimu Lu et al.

Think before you speak: Training Language Models With Pause Tokens

Sachin Goyal, Ziwei Ji, Ankit Singh Rawat et al.

Advancing LLM Reasoning Generalists with Preference Trees

Lifan Yuan, Ganqu Cui, Hanbin Wang et al.

ICLR 2025arXiv:2404.02078

large language modelsreasoning taskspreference learningalignment dataset+4

179

citations

#17

Compositional Chain-of-Thought Prompting for Large Multimodal Models

Chancharik Mitra, Brandon Huang, Trevor Darrell et al.

Can Large Language Models Infer Causation from Correlation?

Zhijing Jin, Jiarui Liu, Zhiheng LYU et al.

Training Language Models to Reason Efficiently

Daman Arora, Andrea Zanette

Hypothesis Search: Inductive Reasoning with Language Models

Ruocheng Wang, Eric Zelikman, Gabriel Poesia et al.

Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources

Xingxuan Li, Ruochen Zhao, Yew Ken Chia et al.

Physics of Language Models: Part 3.2, Knowledge Manipulation

Zeyuan Allen-Zhu, Yuanzhi Li

Linearity of Relation Decoding in Transformer Language Models

Evan Hernandez, Arnab Sen Sharma, Tal Haklay et al.

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Jonas Geiping, Sean McLeish, Neel Jain et al.

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Zayne Sprague, Xi Ye, Kaj Bostrom et al.

Large Language Models as Analogical Reasoners

Michihiro Yasunaga, Xinyun Chen, Yujia Li et al.

Imagine While Reasoning in Space: Multimodal Visualization-of-Thought

Chengzu Li, Wenshan Wu, Huanyu Zhang et al.

C3oT: Generating Shorter Chain-of-Thought Without Compromising Effectiveness

Yu Kang, Xianghui Sun, Liangyu Chen et al.

Mitigating Large Language Model Hallucinations via Autonomous Knowledge Graph-Based Retrofitting

Xinyan Guan, Yanjiang Liu, Hongyu Lin et al.

AAAI 2024arXiv:2311.13314

knowledge graph integrationlarge language model hallucinationfactual knowledge retrievalautonomous knowledge verification+2

108

citations

#30

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Han Zhao, Min Zhang, Wei Zhao et al.

Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Tian Ye, Zicheng Xu, Yuanzhi Li et al.

Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

Wenkai Yang, Shuming Ma, Yankai Lin et al.

NeurIPS 2025arXiv:2502.18080

test-time computechain of thoughtmathematical reasoningreasoning performance+3

96

citations

#33

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Mingjie Liu, Shizhe Diao, Ximing Lu et al.

At Which Training Stage Does Code Data Help LLMs Reasoning?

ma yingwei, Yue Liu, Yue Yu et al.

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Dongzhi Jiang, Renrui Zhang, Ziyu Guo et al.

The All-Seeing Project V2: Towards General Relation Comprehension of the Open World

Weiyun Wang Weiyun, yiming ren, Haowen Luo et al.

In-Context Pretraining: Language Modeling Beyond Document Boundaries

Weijia Shi, Sewon Min, Maria Lomeli et al.

BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models

Zhen Xiang, Fengqing Jiang, Zidi Xiong et al.

AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation

Jiafei Duan, Wilbert Pumacay, Nishanth Kumar et al.

KAM-CoT: Knowledge Augmented Multimodal Chain-of-Thoughts Reasoning

Debjyoti Mondal, Suraj Modi, Subhadarshi Panda et al.

AAAI 2024arXiv:2401.12863

chain-of-thought reasoningmultimodal reasoningknowledge graph integrationlarge language models+3

78

citations

#41

Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning

Yiwei Li, Peiwen Yuan, Shaoxiong Feng et al.

Towards Foundation Models for Knowledge Graph Reasoning

Mikhail Galkin, Xinyu Yuan, Hesham Mostafa et al.

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Javier Ferrando, Oscar Obeso, Senthooran Rajamanoharan et al.

ICLR 2025arXiv:2411.14257

sparse autoencodershallucination mechanismsentity recognitionknowledge awareness+3

77

citations

#44

The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning

Xinyu Zhu, Mengzhou Xia, Zhepei Wei et al.

NeurIPS 2025arXiv:2506.01347

reinforcement learningmathematical reasoninglanguage modelspolicy gradients+4

74

citations

#45

General-Reasoner: Advancing LLM Reasoning Across All Domains

Xueguang Ma, Qian Liu, Dongfu Jiang et al.

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Davide Paglieri, Bartłomiej Cupiał, Samuel Coward et al.

Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation

Shanshan Zhong, Zhongzhan Huang, Shanghua Gao et al.

Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

Hritik Bansal, Arian Hosseini, Rishabh Agarwal et al.

Key-Point-Driven Data Synthesis with Its Enhancement on Mathematical Reasoning

Yiming Huang, Xiao Liu, Yeyun Gong et al.

Arithmetic Without Algorithms: Language Models Solve Math with a Bag of Heuristics

Yaniv Nikankin, Anja Reusch, Aaron Mueller et al.

GoT: Unleashing Reasoning Capability of MLLM for Visual Generation and Editing

Rongyao Fang, Chengqi Duan, Kun Wang et al.

Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Neurons

Yuheng Chen, Pengfei Cao, Yubo Chen et al.

AAAI 2024arXiv:2308.13198

knowledge neuronsmultilingual language modelsfactual knowledge storageintegrated gradients method+4

59

citations

#53

Thinkless: LLM Learns When to Think

Gongfan Fang, Xinyin Ma, Xinchao Wang

Magnushammer: A Transformer-Based Approach to Premise Selection

Maciej Mikuła, Szymon Tworkowski, Szymon Antoniak et al.

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Mingyang Chen, Linzhuang Sun, Tianpeng Li et al.

Large Language Models Are Clinical Reasoners: Reasoning-Aware Diagnosis Framework with Prompt-Generated Rationales

Taeyoon Kwon, Kai Ong, Dongjin Kang et al.

AAAI 2024arXiv:2312.07399

clinical reasoninglarge language modelsdisease diagnosisprompt-based learning+3

57

citations

#57

TEILP: Time Prediction over Knowledge Graphs via Logical Reasoning

Siheng Xiong, Yuan Yang, Ali Payani et al.

AAAI 2024arXiv:2312.15816

temporal knowledge graphsevent time predictionlogical reasoningtemporal event knowledge graph+3

54

citations

#58

Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge

Swarnadeep Saha, Xian Li, Marjan Ghazvininejad et al.

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Ziyang Ma, Yinghao Ma, Yanqiao Zhu et al.

NeurIPS 2025arXiv:2505.13032

audio-language modelsmultimodal audio reasoningchain-of-thought rationaleaudio question answering+4

52

citations

#60

Advancing Spatial Reasoning in Large Language Models: An In-Depth Evaluation and Enhancement Using the StepGame Benchmark

Fangjun Li, David C. Hogg, Anthony G. Cohn

AAAI 2024arXiv:2401.03991

spatial reasoninglarge language modelsbenchmark evaluationtemplate-to-relation mapping+4

51

citations

#61

Discovering and Mitigating Visual Biases through Keyword Explanation

Younghyun Kim, Sangwoo Mo, Minkyu Kim et al.

SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples

Phillip Howard, Avinash Madasu, Tiep Le et al.

Causal Order: The Key to Leveraging Imperfect Experts in Causal Inference

Aniket Vashishtha, Abbavaram Gowtham Reddy, Abhinav Kumar et al.

GRIT: Teaching MLLMs to Think with Images

Yue Fan, Xuehai He, Diji Yang et al.

Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems

Tian Ye, Zicheng Xu, Yuanzhi Li et al.

What does the Knowledge Neuron Thesis Have to do with Knowledge?

Jingcheng Niu, Andrew Liu, Zining Zhu et al.

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM

Yixuan Wu, Yizhou Wang, Shixiang Tang et al.

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

Weizhe Yuan, Jane Yu, Song Jiang et al.

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Zhuoqun Li, Xuanang Chen, Haiyang Yu et al.

ICLR 2025arXiv:2410.08815

retrieval-augmented generationknowledge-intensive reasoninginformation structurizationlarge language models+3

46

citations

#70

Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing

Junfei Wu, Jian Guan, Kaituo Feng et al.

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Andy (DiJia) Su, Hanlin Zhu, Yingchen Xu et al.

Learning How Hard to Think: Input-Adaptive Allocation of LM Computation

Mehul Damani, Idan Shenfeld, Andi Peng et al.

Atom of Thoughts for Markov LLM Test-Time Scaling

Fengwei Teng, Quan Shi, Zhaoyang Yu et al.

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Xingyu Fu, Minqian Liu, Zhengyuan Yang et al.

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Yibin Wang, li zhimin, Yuhang Zang et al.

RATT: A Thought Structure for Coherent and Correct LLM Reasoning

Jinghan Zhang, Xiting Wang, Weijieying Ren et al.

Reasoning Gym: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Zafir Stojanovski, Oliver Stanley, Joe Sharratt et al.

MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation

Zhongshen Zeng, Pengguang Chen, Shu Liu et al.

Combining Induction and Transduction for Abstract Reasoning

Wen-Ding Li, Keya Hu, Carter Larsen et al.

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

Zhen Zhang, Xuehai He, Weixiang Yan et al.

Causal Prompting: Debiasing Large Language Model Prompting Based on Front-Door Adjustment

Congzhi Zhang, Linhai Zhang, Jialong Wu et al.

ReMA: Learning to Meta-Think for LLMs with Multi-agent Reinforcement Learning

Ziyu Wan, Yunxiang Li, Xiaoyu Wen et al.

NeurIPS 2025arXiv:2503.09501

meta-thinkingmulti-agent reinforcement learninglarge language modelsreasoning processes+4

36

citations

#83

TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models

Ziyao Shangguan, Chuhan Li, Yuxuan Ding et al.

Making Large Language Models Better Planners with Reasoning-Decision Alignment

Zhijian Huang, Tao Tang, Shaoxiang Chen et al.

Think Only When You Need with Large Hybrid-Reasoning Models

Lingjie Jiang, Xun Wu, Shaohan Huang et al.

NeurIPS 2025arXiv:2505.14631

large reasoning modelshybrid reasoning modelsadaptive thinking selectionreinforcement learning optimization+4

35

citations

#86

Multi-Modal Latent Space Learning for Chain-of-Thought Reasoning in Language Models

Liqi He, Zuchao Li, Xiantao Cai et al.

AAAI 2024arXiv:2312.08762

chain-of-thought reasoningmulti-modal reasoninglatent space learningdiffusion processes+4

34

citations

#87

CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning

Ji Qi, Ming Ding, Weihan Wang et al.

ICLR 2025arXiv:2402.04236

vision-language modelschain-of-manipulations reasoningvisual reasoningmulti-turn multi-image architecture+4

33

citations

#88

Preference Optimization for Reasoning with Pseudo Feedback

Fangkai Jiao, Geyang Guo, Xingxing Zhang et al.

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Xiangyu Zhao, Peiyuan Zhang, Kexian Tang et al.

ExACT: Language-guided Conceptual Reasoning and Uncertainty Estimation for Event-based Action Recognition and More

Jiazhou Zhou, Xu Zheng, Yuanhuiyi Lyu et al.

Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference

Barys Liskavets, Maxim Ushakov, Shuvendu Roy et al.

Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation

Derong Xu, Xinhang Li, Ziheng Zhang et al.

What Makes Large Language Models Reason in (Multi-Turn) Code Generation?

Kunhao Zheng, Juliette Decugis, Jonas Gehring et al.

Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning

Xinlu Zhang, Zhiyu Zoey Chen, Xi Ye et al.

CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models

Zihui Cheng, Qiguang Chen, Jin Zhang et al.

Reasoning Models Better Express Their Confidence

Dongkeun Yoon, Seungone Kim, Sohee Yang et al.

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Di Zhang, Jingdi Lei, Junxian Li et al.

CVPR 2025arXiv:2411.18203

vision-language modelsmultimodal reasoningactor-critic paradigmpreference optimization+4

30

citations

#98

Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key

Zhihe Yang, Xufang Luo, Dongqi Han et al.

Unlocking Multimodal Mathematical Reasoning via Process Reward Model

Ruilin Luo, Zhuofan Zheng, Lei Wang et al.

NeurIPS 2025arXiv:2501.04686

multimodal mathematical reasoningprocess reward modelstest-time scalingmultimodal large language models+4

29

citations

#100

Chinese Spelling Correction as Rephrasing Language Model

Linfeng Liu, Hongqiu Wu, Hai Zhao

AAAI 2024arXiv:2308.08796

chinese spelling correctionsequence tagging taskrephrasing language modelzero-shot learning+2

29

citations

Chain-of-Thought Reasoning

Top Conferences

Related Topics (Language Models)

Top Papers

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts

LISA: Reasoning Segmentation via Large Language Model

Causal Reasoning and Large Language Models: Opening a New Frontier for Causality

OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

LLaVA-CoT: Let Vision Language Models Reason Step-by-Step

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Video-R1: Reinforcing Video Reasoning in MLLMs

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Listen, Think, and Understand

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models

Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification

Think before you speak: Training Language Models With Pause Tokens

Advancing LLM Reasoning Generalists with Preference Trees

Compositional Chain-of-Thought Prompting for Large Multimodal Models

Can Large Language Models Infer Causation from Correlation?

Training Language Models to Reason Efficiently

Hypothesis Search: Inductive Reasoning with Language Models

Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources

Physics of Language Models: Part 3.2, Knowledge Manipulation

Linearity of Relation Decoding in Transformer Language Models

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Large Language Models as Analogical Reasoners

Imagine While Reasoning in Space: Multimodal Visualization-of-Thought

C3oT: Generating Shorter Chain-of-Thought Without Compromising Effectiveness

Mitigating Large Language Model Hallucinations via Autonomous Knowledge Graph-Based Retrofitting

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

At Which Training Stage Does Code Data Help LLMs Reasoning?

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

The All-Seeing Project V2: Towards General Relation Comprehension of the Open World

In-Context Pretraining: Language Modeling Beyond Document Boundaries

BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models

AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation

KAM-CoT: Knowledge Augmented Multimodal Chain-of-Thoughts Reasoning

Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning

Towards Foundation Models for Knowledge Graph Reasoning

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning

General-Reasoner: Advancing LLM Reasoning Across All Domains

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation

Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

Key-Point-Driven Data Synthesis with Its Enhancement on Mathematical Reasoning

Arithmetic Without Algorithms: Language Models Solve Math with a Bag of Heuristics

GoT: Unleashing Reasoning Capability of MLLM for Visual Generation and Editing

Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Neurons

Thinkless: LLM Learns When to Think

Magnushammer: A Transformer-Based Approach to Premise Selection

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Large Language Models Are Clinical Reasoners: Reasoning-Aware Diagnosis Framework with Prompt-Generated Rationales

TEILP: Time Prediction over Knowledge Graphs via Logical Reasoning

Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

Advancing Spatial Reasoning in Large Language Models: An In-Depth Evaluation and Enhancement Using the StepGame Benchmark

Discovering and Mitigating Visual Biases through Keyword Explanation

SocialCounterfactuals: Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples

Causal Order: The Key to Leveraging Imperfect Experts in Causal Inference

GRIT: Teaching MLLMs to Think with Images

Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems

What does the Knowledge Neuron Thesis Have to do with Knowledge?

DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Learning How Hard to Think: Input-Adaptive Allocation of LM Computation

Atom of Thoughts for Markov LLM Test-Time Scaling

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

RATT: A Thought Structure for Coherent and Correct LLM Reasoning