2025 "proper scoring rules" Papers
2 papers found
ConfTuner: Training Large Language Models to Express Their Confidence Verbally
Yibo Li, Miao Xiong, Jiaying Wu et al.
NeurIPS 2025posterarXiv:2508.18847
10
citations
Consistency Checks for Language Model Forecasters
Daniel Paleka, Abhimanyu Pallavi Sudhir, Alejandro Alvarez et al.
ICLR 2025posterarXiv:2412.18544
10
citations