2025 "information-theoretic perspective" Papers
2 papers found
Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning
Chen Qian, Dongrui Liu, Hao Wen et al.
NEURIPS 2025arXiv:2506.02867
22
citations
Information-Theoretic Reward Decomposition for Generalizable RLHF
Liyuan Mao, Haoran Xu, Amy Zhang et al.
NEURIPS 2025posterarXiv:2504.06020
3
citations