🧬Robustness

Model Calibration

Calibrating confidence estimates

100 papers236 total citations
Compare with other topics
Mar '24 β€” Feb '2685 papers
Also includes: model calibration, calibration, confidence calibration, temperature scaling

Top Papers

#1

Calibrating Large Language Models with Sample Consistency

Qing Lyu, Kumar Shridhar, Chaitanya Malaviya et al.

AAAI 2025arXiv:2402.13904
48
citations
#2

Smooth ECE: Principled Reliability Diagrams via Kernel Smoothing

Jaroslaw Blasiok, Preetum Nakkiran

ICLR 2024
46
citations
#3

Reasoning Models Better Express Their Confidence

Dongkeun Yoon, Seungone Kim, Sohee Yang et al.

NeurIPS 2025arXiv:2505.14489
confidence calibrationchain-of-thought reasoninglarge language modelsslow thinking behaviors+2
30
citations
#4

Addressing Misspecification in Simulation-based Inference through Data-driven Calibration

Antoine Wehenkel, Juan L. Gamella, Ozan Sener et al.

ICML 2025arXiv:2405.08719
23
citations
#5

Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models

Gianni Franchi, Olivier Laurent, Maxence LeguΓ©ry et al.

CVPR 2024arXiv:2312.15297
15
citations
#6

ConfTuner: Training Large Language Models to Express Their Confidence Verbally

Yibo Li, Miao Xiong, Jiaying Wu et al.

NeurIPS 2025arXiv:2508.18847
confidence calibrationverbalized uncertaintylarge language modelsproper scoring rules+4
10
citations
#7

On Temperature Scaling and Conformal Prediction of Deep Classifiers

Lahav Dabah, Tom Tirer

ICML 2025arXiv:2402.05806
9
citations
#8

Simultaneous Swap Regret Minimization via KL-Calibration

Haipeng Luo, Spandan Senapati, Vatsal Sharan

NeurIPS 2025arXiv:2502.16387
swap regret minimizationkl-calibrationcalibration measuresproper loss functions+3
6
citations
#9

Epistemic Uncertainty Quantification For Pre-Trained Neural Networks

Hanjing Wang, Qiang Ji

CVPR 2024arXiv:2404.10124
6
citations
#10

Feature Clipping for Uncertainty Calibration

Linwei Tao, Minjing Dong, Chang Xu

AAAI 2025arXiv:2410.19796
5
citations
#11

QA-Calibration of Language Model Confidence Scores

Putra Manggala, Atalanti A Mastakouri, Elke Kirschbaum et al.

ICLR 2025arXiv:2410.06615
5
citations
#12

AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration

Javier Tirado-GarΓ­n, Javier Civera

ICCV 2025arXiv:2503.12701
camera calibrationsingle-view calibrationmodel-agnostic calibrationintrinsic parameter estimation+4
5
citations
#13

Generalized Venn and Venn-Abers Calibration with Applications in Conformal Prediction

Lars van der Laan, Ahmed Alaa

ICML 2025arXiv:2502.05676
4
citations
#14

Unlocking the Potential of Model Calibration in Federated Learning

Yun-Wei Chu, Dong-Jun Han, Seyyedali Hosseinalipour et al.

ICLR 2025
4
citations
#15

Towards Calibrated Deep Clustering Network

Yuheng Jia, Jianhong Cheng, Hui LIU et al.

ICLR 2025arXiv:2403.02998
3
citations
#16

Calibrating LLMs with Information-Theoretic Evidential Deep Learning

Yawei Li, David RΓΌgamer, Bernd Bischl et al.

ICLR 2025arXiv:2502.06351
3
citations
#17

Discretization-free Multicalibration through Loss Minimization over Tree Ensembles

Hongyi Henry Jin, Zijun Ding, Dung Daniel Ngo et al.

NeurIPS 2025
2
citations
#18

Calibrated Language Models and How to Find Them with Label Smoothing

Jerry Huang, Peng Lu, QIUHAO Zeng

ICML 2025arXiv:2508.00264
2
citations
#19

Beyond One-Hot Labels: Semantic Mixing for Model Calibration

Haoyang Luo, Linwei Tao, Minjing Dong et al.

ICML 2025arXiv:2504.13548
2
citations
#20

Leveraging Uncertainty Estimates To Improve Classifier Performance

Gundeep Arora, Srujana Merugu, Anoop Saladi et al.

ICLR 2024arXiv:2311.11723
1
citations
#21

Model Uncertainty Quantification by Conformal Prediction in Continual Learning

Rui Gao, Weiwei Liu

ICML 2025
1
citations
#22

Combining Priors with Experience: Confidence Calibration Based on Binomial Process Modeling

Jinzong Dong, Zhaohui Jiang, Dong Pan et al.

AAAI 2025arXiv:2412.10658
1
citations
#23

Real-Time Calibration Model for Low-Cost Sensor in Fine-Grained Time Series

Seokho Ahn, Hyungjin Kim, Sungbok Shin et al.

AAAI 2025arXiv:2412.20170
1
citations
#24

Modeling Stereo-Confidence out of the End-to-End Stereo-Matching Network via Disparity Plane Sweep

Jae Young Lee, Woonghyun Ka, Jaehyun Choi et al.

AAAI 2024arXiv:2401.12001
stereo confidencedisparity plane sweepstereo matching networkscost volume+3
1
citations
#25

Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment

Kejia Zhang, Juanjuan Weng, Zhiming Luo et al.

ICCV 2025arXiv:2408.06079
1
citations
#26

Multivariate Latent Recalibration for Conditional Normalizing Flows

Victor Dheur, Souhaib Ben Taieb

NeurIPS 2025arXiv:2505.16636
conditional normalizing flowsmultivariate density estimationmodel recalibrationlatent calibration+4
1
citations
#27

Quantifying Uncertainty in the Presence of Distribution Shifts

Yuli Slavutsky, David Blei

NeurIPS 2025arXiv:2506.18283
uncertainty estimationcovariate shiftbayesian frameworkadaptive prior+3
1
citations
#28

Your Pre-trained LLM is Secretly an Unsupervised Confidence Calibrator

Beier Luo, Shuoyuan Wang, Sharon Li et al.

NeurIPS 2025
β€”
not collected
#29

Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation

Feichen Gan, Lu Youcun, Yingying Zhang et al.

NeurIPS 2025arXiv:2510.26026
conformal predictionpolicy evaluationreinforcement learninguncertainty quantification+4
β€”
not collected
#30

Taming Overconfidence in LLMs: Reward Calibration in RLHF

Jixuan Leng, Chengsong Huang, Banghua Zhu et al.

ICLR 2025
β€”
not collected
#31

Calibrating Expressions of Certainty

Peiqi Wang, Barbara Lam, Yingcheng Liu et al.

ICLR 2025arXiv:2410.04315
certainty calibrationlinguistic expressionsuncertainty distributionspost-hoc calibration+3
β€”
not collected
#32

Performative Risk Control: Calibrating Models for Reliable Deployment under Performativity

Victor Li, Baiting Chen, Yuzhen Mao et al.

NeurIPS 2025arXiv:2505.24097
risk controlperformative predictionsmodel calibrationstrategic manipulation+3
β€”
not collected
#33

Improving Perturbation-based Explanations by Understanding the Role of Uncertainty Calibration

Thomas Decker, Volker Tresp, Florian Buettner

NeurIPS 2025arXiv:2511.10439
β€”
not collected
#34

SteerConf: Steering LLMs for Confidence Elicitation

Ziang Zhou, Tianyuan Jin, Jieming Shi et al.

NeurIPS 2025arXiv:2503.02863
confidence elicitationmodel calibrationsteering prompt strategyconfidence consistency+3
β€”
not collected
#35

On Calibration of LLM-based Guard Models for Reliable Content Moderation

Hongfu Liu, Hengguan Huang, Xiangming Gu et al.

ICLR 2025
β€”
not collected
#36

Conformal Linguistic Calibration: Trading-off between Factuality and Specificity

Zhengping Jiang, Anqi Liu, Ben Van Durme

NeurIPS 2025arXiv:2502.19110
linguistic calibrationuncertainty quantificationconformal predictionanswer set prediction+3
β€”
not collected
#37

Understanding Model Calibration - A gentle introduction and visual exploration of calibration and the expected calibration error (ECE)

Maja Pavlovic

ICLR 2025
β€”
not collected
#38

Know What You Don't Know: Uncertainty Calibration of Process Reward Models

Young-Jin Park, Kristjan Greenewald, Kaveh Alimohammadi et al.

NeurIPS 2025arXiv:2506.09338
uncertainty calibrationprocess reward modelsquantile regressioninstance-adaptive scaling+4
β€”
not collected
#39

On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines

Selim Kuzucu, Kemal Oksuz, Jonathan Sadeghi et al.

ECCV 2024
β€”
not collected
#40

Provable Uncertainty Decomposition via Higher-Order Calibration

Gustaf Ahdritz, Aravind Gollakota, Parikshit Gopalan et al.

ICLR 2025
β€”
not collected
#41

Aligning Evaluation with Clinical Priorities: Calibration, Label Shift, and Error Costs

Gerardo Flores, Alyssa H. Smith, Julia Fukuyama et al.

NeurIPS 2025
β€”
not collected
#42

Quantifying Uncertainty in Error Consistency: Towards Reliable Behavioral Comparison of Classifiers

Thomas Klein, Sascha Meyen, Wieland Brendel et al.

NeurIPS 2025
β€”
not collected
#43

Approximating Full Conformal Prediction for Neural Network Regression with Gauss-Newton Influence

Dharmesh Tailor, Alvaro Correia, Eric Nalisnick et al.

ICLR 2025
β€”
not collected
#44

Towards Unbiased Calibration using Meta-Regularization

Jacek Golebiowski, Cheng Wang

ICLR 2025
β€”
not collected
#45

Optimal and Provable Calibration in High-Dimensional Binary Classification: Angular Calibration and Platt Scaling

Yufan Li, Pragya Sur

NeurIPS 2025arXiv:2502.15131
binary classificationcalibration theoryhigh-dimensional statisticsbregman divergence+4
β€”
not collected
#46

General Uncertainty Estimation with Delta Variances

Simon Schmitt, John Shawe-Taylor, Hado van Hasselt

AAAI 2025arXiv:2502.14698
β€”
not collected
#47

Reassessing How to Compare and Improve the Calibration of Machine Learning Models

Muthu Chidambaram, Rong Ge

ICLR 2025
β€”
not collected
#48

Dirichlet-Based Prediction Calibration for Learning with Noisy Labels

Chen-Chen Zong, Ye-Wen Wang, Ming-Kun Xie et al.

AAAI 2024
β€”
not collected
#49

Catalyst for Clustering-Based Unsupervised Object Re-identification: Feature Calibration

Huafeng Li, Qingsong Hu, Zhanxuan Hu

AAAI 2024
β€”
not collected
#50

On the Asymptotic Optimality of Confidence Interval Based Algorithms for Fixed Confidence MABs

Kushal Kejriwal, Nikhil Karamchandani, Jayakrishnan Nair

AAAI 2025
β€”
not collected
#51

Calibrated One Round Federated Learning with Bayesian Inference in the Predictive Space

Mohsin Hasan, Guojun Zhang, Kaiyang Guo et al.

AAAI 2024arXiv:2312.09817
β€”
not collected
#52

Intelligent Calibration for Bias Reduction in Sentiment Corpora Annotation Process

Idan Toker, David Sarne, Jonathan Schler

AAAI 2024
β€”
not collected
#53

Towards Certification of Uncertainty Calibration under Adversarial Attacks

Cornelius Emde, Francesco Pinto, Thomas Lukasiewicz et al.

ICLR 2025arXiv:2405.13922
uncertainty calibrationadversarial attackscertification methodsmodel calibration+3
β€”
not collected
#54

Generative Calibration of Inaccurate Annotation for Label Distribution Learning

Liang He, Yunan Lu, Weiwei Li et al.

AAAI 2024
β€”
not collected
#55

Attack-inspired Calibration Loss for Calibrating Crack Recognition

Zhuangzhuang Chen, Qiangyu Chen, Jiahao Zhang et al.

AAAI 2025
β€”
not collected
#56

CLIB-FIQA: Face Image Quality Assessment with Confidence Calibration

Fu-Zhao Ou, Chongyi Li, Shiqi Wang et al.

CVPR 2024
β€”
not collected
#57

Improving Model Probability Calibration by Integration of Large Data Sources with Biased Labels

Renat Sergazinov, Richard Chen, Cheng Ji et al.

AAAI 2025
β€”
not collected
#58

Conformalized Interval Arithmetic with Symmetric Calibration

Rui Luo, Zhixin Zhou

AAAI 2025arXiv:2408.10939
β€”
not collected
#59

Parametric ρ-Norm Scaling Calibration

Siyuan Zhang, Linbo Xie

AAAI 2025
β€”
not collected
#60

Inlier Confidence Calibration for Point Cloud Registration

Yongzhe Yuan, Yue Wu, Xiaolong Fan et al.

CVPR 2024
β€”
not collected
#61

Self-Calibrating Vicinal Risk Minimisation for Model Calibration

Jiawei Liu, Changkun Ye, Ruikai Cui et al.

CVPR 2024
β€”
not collected
#62

Calibration Bottleneck: Over-compressed Representations are Less Calibratable

Deng-Bao Wang, Min-Ling Zhang

ICML 2024
uncertainty calibrationmodel calibratabilitypost-hoc calibrationweight decay regularizer+3
β€”
not collected
#63

Parametric Scaling Law of Tuning Bias in Conformal Prediction

Hao Zeng, Kangdao Liu, Bingyi Jing et al.

ICML 2025arXiv:2502.03023
β€”
not collected
#64

How Flawed Is ECE? An Analysis via Logit Smoothing

Muthu Chidambaram, Holden Lee, Colin McSwiggen et al.

ICML 2024
model calibrationexpected calibration errormiscalibration metriclogit smoothing+4
β€”
not collected
#65

Rectifying Conformity Scores for Better Conditional Coverage

Vincent Plassier, Alexander Fishkov, Victor Dheur et al.

ICML 2025arXiv:2502.16336
β€”
not collected
#66

Set Learning for Accurate and Calibrated Models

Lukas Muttenthaler, Robert A Vandermeulen, Qiuyi (Richard) Zhang et al.

ICLR 2024arXiv:2307.02245
β€”
not collected
#67

Linguistic Calibration of Long-Form Generations

Neil Band, Xuechen Li, Tengyu Ma et al.

ICML 2024
linguistic calibrationlong-form generationconfidence statementsdecision-making+3
β€”
not collected
#68

Pointwise Information Measures as Confidence Estimators in Deep Neural Networks: A Comparative Study

Shelvia Wongso, Rohan Ghosh, Mehul Motani

ICML 2025
β€”
not collected
#69

Understanding and Mitigating Miscalibration in Prompt Tuning for Vision-Language Models

Shuoyuan Wang, Sharon Li, Hongxin Wei

ICML 2025arXiv:2410.02681
β€”
not collected
#70

Improving Multi-Class Calibration through Normalization-Aware Isotonic Techniques

Alon Arad, Saharon Rosset

ICML 2025arXiv:2512.09054
β€”
not collected
#71

Minimal Perspective Autocalibration

Andrea Porfiri Dal Cin, Timothy Duff, Luca Magri et al.

CVPR 2024arXiv:2405.05605
β€”
not collected
#72

Unbiased Estimator for Distorted Conics in Camera Calibration

Chaehyeon Song, Jaeho Shin, Myung-Hwan Jeon et al.

CVPR 2024arXiv:2403.04583
β€”
not collected
#73

PAC-Bayes Analysis for Recalibration in Classification

Masahiro Fujisawa, Futoshi Futami

ICML 2025arXiv:2406.06227
β€”
not collected
#74

T-Cal: An Optimal Test for the Calibration of Predictive Models

Donghwan Lee, Xinmeng Huang, Hamed Hassani et al.

ICML 2024
predictive model calibrationuncertainty quantificationhypothesis testingexpected calibration error+4
β€”
not collected
#75

Learning model uncertainty as variance-minimizing instance weights

Nishant Jain, Karthikeyan Shanmugam, Pradeep Shenoy

ICLR 2024
β€”
not collected
#76

Improving the Statistical Efficiency of Cross-Conformal Prediction

ICML 2025arXiv:2503.01495
β€”
not collected
#77

Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs

Miao Xiong, Zhiyuan Hu, Xinyang Lu et al.

ICLR 2024arXiv:2306.13063
β€”
not collected
#78

Uncertainty Quantification for LLM-Based Survey Simulations

Chengpiao Huang, Yuhang Wu, Kaizheng Wang

ICML 2025
β€”
not collected
#79

Multicalibration for Confidence Scoring in LLMs

Gianluca Detommaso, Martin A Bertran, Riccardo Fogliato et al.

ICML 2024arXiv:2404.04689
confidence scoringmulticalibrationlarge language modelscalibration methods+4
β€”
not collected
#80

Enhancing Post-training Quantization Calibration through Contrastive Learning

Yuzhang Shang, Gaowen Liu, Ramana Kompella et al.

CVPR 2024
β€”
not collected
#81

Algorithms with Calibrated Machine Learning Predictions

Judy Hanwen Shen, Ellen Vitercik, Anders Wikum

ICML 2025
β€”
not collected
#82

Experts Don't Cheat: Learning What You Don't Know By Predicting Pairs

Daniel D. Johnson, Daniel Tarlow, David Duvenaud et al.

ICML 2024
epistemic uncertainty quantificationsecond-order calibrationconditional distribution predictionhallucination detection+2
β€”
not collected
#83

LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses

Xin Liu, Muhammad Khalifa, Lu Wang

ICLR 2024arXiv:2310.19208
β€”
not collected
#84

FedCal: Achieving Local and Global Calibration in Federated Learning via Aggregated Parameterized Scaler

Hongyi Peng, Han Yu, Xiaoli Tang et al.

ICML 2024arXiv:2405.15458
federated learningmodel calibrationdata heterogeneitynon-iid data+4
β€”
not collected
#85

An Empirical Study Into What Matters for Calibrating Vision-Language Models

Weijie Tu, Weijian Deng, Dylan Campbell et al.

ICML 2024
vision-language modelsuncertainty estimationmodel calibrationtemperature scaling+3
β€”
not collected
#86

On the Calibration of Human Pose Estimation

Kerui Gu, Rongyu Chen, Xuanlong Yu et al.

ICML 2024arXiv:2311.17105
human pose estimationconfidence calibrationkeypoint localizationheatmap analysis+3
β€”
not collected
#87

Open-Vocabulary Calibration for Fine-tuned CLIP

Shuoyuan Wang, Jindong Wang, Guoqing Wang et al.

ICML 2024arXiv:2402.04655
vision-language modelsopen-vocabulary tasksconfidence calibrationparameter-efficient fine-tuning+3
β€”
not collected
#88

Tilt and Average : Geometric Adjustment of the Last Layer for Recalibration

Gyusang Cho, Chan-Hyun Youn

ICML 2024
neural network calibrationconfidence alignmentlast layer adjustmentgeometric adjustment+2
β€”
not collected
#89

Sampling-based Multi-dimensional Recalibration

Youngseog Chung, Ian Char, Jeff Schneider

ICML 2024
probabilistic forecast calibrationmulti-dimensional regressionsample-based uncertaintyhighest density regions+4
β€”
not collected
#90

Conformalized Survival Distributions: A Generic Post-Process to Increase Calibration

Shi-ang Qi, Yakun Yu, Russell Greiner

ICML 2024arXiv:2405.07374
survival analysismodel calibrationdiscrimination performanceconformal regression+2
β€”
not collected
#91

Confidence Self-Calibration for Multi-Label Class-Incremental Learning

Kaile Du, Yifan Zhou, Fan Lyu et al.

ECCV 2024
β€”
not collected
#92

Uncertainty Calibration with Energy Based Instance-wise Scaling in the Wild Dataset

Mijoo Kim, Junseok Kwon

ECCV 2024
β€”
not collected
#93

Instant Uncertainty Calibration of NeRFs Using a Meta-Calibrator

Niki Amini-Naieni, Tomas Jakab, Andrea Vedaldi et al.

ECCV 2024
β€”
not collected
#94

Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction

Alexander Timans, Christoph-Nikolas Straehle, Kaspar Sakmann et al.

ECCV 2024arXiv:2403.07263
conformal predictionuncertainty quantificationobject detectionbounding box localization+4
β€”
not collected
#95

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

Jiayi Guo, Chuanhao Yan, Xingqian Xu et al.

ICCV 2025arXiv:2509.26231
diffusion model alignmentmultimodal guidanceimage-text alignmentpreference optimization+3
β€”
not collected
#96

Improving Accuracy and Calibration via Differentiated Deep Mutual Learning

Han Liu, Peng Cui, Bingning Wang et al.

CVPR 2025
β€”
not collected
#97

Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles

Eric Slyman, Mehrab Tanjim, Kushal Kafle et al.

ICCV 2025
β€”
not collected
#98

CaliMatch: Adaptive Calibration for Improving Safe Semi-supervised Learning

Jinsoo Bae, Seoung Bum Kim, Hyungrok Do

ICCV 2025
β€”
not collected
#99

Uncertainty Weighted Gradients for Model Calibration

Jinxu Lin, Linwei Tao, Minjing Dong et al.

CVPR 2025arXiv:2503.22725
model calibrationuncertainty estimationloss functionsgradient weighting+4
β€”
not collected
#100

Deterministic Object Pose Confidence Region Estimation

Jinghao Wang, Zhang Li, Zi Wang et al.

ICCV 2025
β€”
not collected