🧬Robustness

Model Calibration

Calibrating confidence estimates

100 papers965 total citations
Compare with other topics
Feb '24 Jan '26436 papers
Also includes: model calibration, calibration, confidence calibration, temperature scaling

Top Papers

#1

Conformal Risk Control

Anastasios Angelopoulos, Stephen Bates, Adam Fisch et al.

ICLR 2024
193
citations
#2

Calibrating Large Language Models with Sample Consistency

Qing Lyu, Kumar Shridhar, Chaitanya Malaviya et al.

AAAI 2025
48
citations
#3

Smooth ECE: Principled Reliability Diagrams via Kernel Smoothing

Jaroslaw Blasiok, Preetum Nakkiran

ICLR 2024
46
citations
#4

Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement

Jaehun Jung, Faeze Brahman, Yejin Choi

ICLR 2025
42
citations
#5

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

Rylan Schaeffer, Hailey Schoelkopf, Brando Miranda et al.

ICML 2025
33
citations
#6

Reasoning Models Better Express Their Confidence

Dongkeun Yoon, Seungone Kim, Sohee Yang et al.

NeurIPS 2025
30
citations
#7

Copula Conformal prediction for multi-step time series prediction

Sophia Sun, Rose Yu

ICLR 2024
29
citations
#8

GeoCalib: Learning Single-image Calibration with Geometric Optimization

Alexander Veicht, Paul-Edouard Sarlin, Philipp Lindenberger et al.

ECCV 2024
23
citations
#9

Addressing Misspecification in Simulation-based Inference through Data-driven Calibration

Antoine Wehenkel, Juan L. Gamella, Ozan Sener et al.

ICML 2025
23
citations
#10

A Call to Reflect on Evaluation Practices for Age Estimation: Comparative Analysis of the State-of-the-Art and a Unified Benchmark

Jakub Paplham, Vojtech Franc

CVPR 2024
20
citations
#11

Bounds on Representation-Induced Confounding Bias for Treatment Effect Estimation

Valentyn Melnychuk, Dennis Frauen, Stefan Feuerriegel

ICLR 2024
19
citations
#12

A Unified Comparative Study with Generalized Conformity Scores for Multi-Output Conformal Regression

Victor Dheur, Matteo Fontana, Yorick Estievenart et al.

ICML 2025
16
citations
#13

Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption

Du CHEN, Tianhe Wu, Kede Ma et al.

CVPR 2025
16
citations
#14

Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms

Joren Brunekreef, Eric Marcus, Ray Sheombarsing et al.

CVPR 2024
15
citations
#15

Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models

Gianni Franchi, Olivier Laurent, Maxence Leguéry et al.

CVPR 2024
15
citations
#16

PAC Prediction Sets Under Label Shift

Wenwen Si, Sangdon Park, Insup Lee et al.

ICLR 2024
13
citations
#17

R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning

Mengyuan Chen, Junyu Gao, Changsheng Xu

ICLR 2024
12
citations
#18

Conformal Thresholded Intervals for Efficient Regression

Rui Luo, Zhixin Zhou

AAAI 2025
11
citations
#19

Confidence Estimation for Error Detection in Text-to-SQL Systems

Oleg Somov, Elena Tutubalina

AAAI 2025
10
citations
#20

Consistency Checks for Language Model Forecasters

Daniel Paleka, Abhimanyu Pallavi Sudhir, Alejandro Alvarez et al.

ICLR 2025arXiv:2412.18544
language model forecastingconsistency checksautomated evaluation systemarbitrage-based metrics+3
10
citations
#21

Reliable and Efficient Amortized Model-based Evaluation

Sang Truong, Yuheng Tu, Percy Liang et al.

ICML 2025
10
citations
#22

Towards Modeling Uncertainties of Self-explaining Neural Networks via Conformal Prediction

Wei Qian, Chenxu Zhao, Yangyi Li et al.

AAAI 2024arXiv:2401.01549
self-explaining neural networksconformal predictionuncertainty quantificationinterpretable machine learning+4
10
citations
#23

ConfTuner: Training Large Language Models to Express Their Confidence Verbally

Yibo Li, Miao Xiong, Jiaying Wu et al.

NeurIPS 2025
10
citations
#24

On Temperature Scaling and Conformal Prediction of Deep Classifiers

Lahav Dabah, Tom Tirer

ICML 2025
9
citations
#25

Foundation Model-oriented Robustness: Robust Image Model Evaluation with Pretrained Models

Peiyan Zhang, Haoyang Liu, Chaozhuo Li et al.

ICLR 2024
9
citations
#26

Unraveling Batch Normalization for Realistic Test-Time Adaptation

Zixian Su, Jingwei Guo, Kai Yao et al.

AAAI 2024arXiv:2312.09486
batch normalizationtest-time adaptationdomain shiftmini-batch degradation+3
9
citations
#27

Noise Calibration and Spatial-Frequency Interactive Network for STEM Image Enhancement

Hesong Li, Ziqi Wu, Ruiwen Shao et al.

CVPR 2025
8
citations
#28

Error-quantified Conformal Inference for Time Series

Junxi Wu, Dongjian Hu, Yajie Bao et al.

ICLR 2025arXiv:2502.00818
conformal inferenceuncertainty quantificationtime series predictionprediction sets+3
8
citations
#29

On the Limitations of Temperature Scaling for Distributions with Overlaps

Muthu Chidambaram, Rong Ge

ICLR 2024
8
citations
#30

Conformal Linguistic Calibration: Trading-off between Factuality and Specificity

Zhengping Jiang, Anqi Liu, Ben Van Durme

NeurIPS 2025arXiv:2502.19110
linguistic calibrationuncertainty quantificationconformal predictionanswer set prediction+3
7
citations
#31

Adaptive Calibration: A Unified Conversion Framework of Spiking Neural Networks

Ziqing Wang, Yuetong Fang, Jiahang Cao et al.

AAAI 2025
7
citations
#32

Error Bounds for Gaussian Process Regression Under Bounded Support Noise with Applications to Safety Certification

Robert Reed, Luca Laurenti, Morteza Lahijanian

AAAI 2025
7
citations
#33

Robustness Auditing for Linear Regression: To Singularity and Beyond

Ittai Rubinstein, Samuel Hopkins

ICLR 2025arXiv:2410.07916
robustness auditinglinear regressionordinary least squaressample removal+3
7
citations
#34

SteerConf: Steering LLMs for Confidence Elicitation

Ziang Zhou, Tianyuan Jin, Jieming Shi et al.

NeurIPS 2025arXiv:2503.02863
confidence elicitationmodel calibrationsteering prompt strategyconfidence consistency+3
6
citations
#35

Epistemic Uncertainty Quantification For Pre-Trained Neural Networks

Hanjing Wang, Qiang Ji

CVPR 2024
6
citations
#36

The Lipschitz-Variance-Margin Tradeoff for Enhanced Randomized Smoothing

Blaise Delattre, Alexandre Araujo, Quentin Barthélemy et al.

ICLR 2024
6
citations
#37

CSformer: Combining Channel Independence and Mixing for Robust Multivariate Time Series Forecasting

Haoxin Wang, Yipeng Mo, Kunlan Xiang et al.

AAAI 2025
6
citations
#38

Simultaneous Swap Regret Minimization via KL-Calibration

Haipeng Luo, Spandan Senapati, Vatsal Sharan

NeurIPS 2025arXiv:2502.16387
swap regret minimizationkl-calibrationcalibration measuresproper loss functions+3
6
citations
#39

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation

Muhammed Yusuf Kocyigit, Eleftheria Briakou, Daniel Deutsch et al.

ICML 2025
6
citations
#40

Calibrating Expressions of Certainty

Peiqi Wang, Barbara Lam, Yingcheng Liu et al.

ICLR 2025arXiv:2410.04315
certainty calibrationlinguistic expressionsuncertainty distributionspost-hoc calibration+3
5
citations
#41

A Generic Framework for Conformal Fairness

Aditya Vadlamani, Anutam Srinivasan, Pranav Maneriker et al.

ICLR 2025
5
citations
#42

On Volume Minimization in Conformal Regression

Batiste Le Bars, Pierre Humbert

ICML 2025
5
citations
#43

Feature Clipping for Uncertainty Calibration

Linwei Tao, Minjing Dong, Chang Xu

AAAI 2025
5
citations
#44

Integral Imprecise Probability Metrics

Siu Lun (Alan) Chau, Michele Caprio, Krikamol Muandet

NeurIPS 2025
5
citations
#45

Learning with Calibration: Exploring Test-Time Computing of Spatio-Temporal Forecasting

Wei Chen, Yuxuan Liang

NeurIPS 2025
5
citations
#46

Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets

Yuxin Wang, Maresa Schröder, Dennis Frauen et al.

ICLR 2025
5
citations
#47

Revisiting Calibration of Wide-Angle Radially Symmetric Cameras

Andrea Porfiri Dal Cin, Francesco Azzoni, Giacomo Boracchi et al.

ECCV 2024
camera calibrationwide-angle camerasradially symmetric modelsimplicit camera representation+4
5
citations
#48

QA-Calibration of Language Model Confidence Scores

Putra Manggala, Atalanti A Mastakouri, Elke Kirschbaum et al.

ICLR 2025
5
citations
#49

Difficulty-aware Balancing Margin Loss for Long-tailed Recognition

Minseok Son, Inyong Koo, Jinyoung Park et al.

AAAI 2025
5
citations
#50

AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration

Javier Tirado-Garín, Javier Civera

ICCV 2025
5
citations
#51

Robust Self-calibration of Focal Lengths from the Fundamental Matrix

Viktor Kocur, Daniel Kyselica, Zuzana Kukelova

CVPR 2024
5
citations
#52

Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference

Dongyan Huo, Yudong Chen, Qiaomin Xie

AAAI 2024arXiv:2312.10894
linear stochastic approximationmarkovian dataconstant stepsizestatistical inference+4
4
citations
#53

Conformal Inference of Individual Treatment Effects Using Conditional Density Estimates

Baozhen Wang, Xingye Qiao

AAAI 2025
4
citations
#54

Generalized Venn and Venn-Abers Calibration with Applications in Conformal Prediction

Lars van der Laan, Ahmed Alaa

ICML 2025
4
citations
#55

Kernel-based Optimally Weighted Conformal Time-Series Prediction

Jonghyeok Lee, Chen Xu, Yao Xie

ICLR 2025
4
citations
#56

Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation

David Heineman, Valentin Hofmann, Ian Magnusson et al.

NeurIPS 2025
4
citations
#57

Quantifying Prediction Consistency Under Fine-tuning Multiplicity in Tabular LLMs

Faisal Hamman, Sachindra P Dissanayake, Saumitra Mishra et al.

ICML 2025
4
citations
#58

Conformal Prediction for Ensembles: Improving Efficiency via Score-Based Aggregation

Yash Patel, Eduardo Ochoa Rivera, Ambuj Tewari

NeurIPS 2025
4
citations
#59

Introducing FOReCAst: The Future Outcome Reasoning and Confidence Assessment Benchmark

Zhangdie Yuan, Zifeng Ding, Andreas Vlachos

NeurIPS 2025
4
citations
#60

Backward Conformal Prediction

Etienne Gauthier, Francis Bach, Michael Jordan

NeurIPS 2025
4
citations
#61

$\texttt{BetaConform}$: Efficient MAP Estimation of LLM Ensemble Judgment Performance with Prior Transfer

Huaizhi Qu, Inyoung Choi, Zhen Tan et al.

NeurIPS 2025
4
citations
#62

Non-parametric Sensor Noise Modeling and Synthesis

Ali Mosleh, Luxi Zhao, Atin Vikram Singh et al.

ECCV 2024
sensor noise modelingnon-parametric modelingnoise synthesisprobability mass functions+2
4
citations
#63

Towards Establishing Guaranteed Error for Learned Database Operations

Sepanta Zeighami, Cyrus Shahabi

ICLR 2024
4
citations
#64

Unlocking the Potential of Model Calibration in Federated Learning

Yun-Wei Chu, Dong-Jun Han, Seyyedali Hosseinalipour et al.

ICLR 2025
4
citations
#65

Simplification Is All You Need against Out-of-Distribution Overconfidence

Keke Tang, Chao Hou, Weilong Peng et al.

CVPR 2025
4
citations
#66

Towards Robust Influence Functions with Flat Validation Minima

Xichen Ye, Yifan Wu, Weizhong Zhang et al.

ICML 2025
3
citations
#67

Multi-Accurate CATE is Robust to Unknown Covariate Shifts

Angela Zhou, Christoph Kern, Michael Kim

ICLR 2025
heterogeneous treatment effectsconditional average treatment effectscovariate shift robustnessmulti-accurate predictors+4
3
citations
#68

Credal Wrapper of Model Averaging for Uncertainty Estimation in Classification

Kaizheng Wang, Fabio Cuzzolin, Keivan Shariatmadar et al.

ICLR 2025arXiv:2405.15047
uncertainty estimationbayesian neural networksdeep ensemblescredal set representation+3
3
citations
#69

How Benchmark Prediction from Fewer Data Misses the Mark

Guanhua Zhang, Florian E. Dorner, Moritz Hardt

NeurIPS 2025
3
citations
#70

Uncertainty Weighted Gradients for Model Calibration

Jinxu Lin, Linwei Tao, Minjing Dong et al.

CVPR 2025arXiv:2503.22725
model calibrationuncertainty estimationloss functionsgradient weighting+4
3
citations
#71

Towards Calibrated Deep Clustering Network

Yuheng Jia, Jianhong Cheng, Hui LIU et al.

ICLR 2025
3
citations
#72

Calibrating LLMs with Information-Theoretic Evidential Deep Learning

Yawei Li, David Rügamer, Bernd Bischl et al.

ICLR 2025
3
citations
#73

High-Dimensional Calibration from Swap Regret

Maxwell Fishelson, Noah Golowich, Mehryar Mohri et al.

NeurIPS 2025
3
citations
#74

Probably Approximately Precision and Recall Learning

Lee Cohen, Yishay Mansour, Shay Moran et al.

NeurIPS 2025
3
citations
#75

Human-in-the-Loop Visual Re-ID for Population Size Estimation

Gustavo Perez, Daniel Sheldon, Grant Van Horn et al.

ECCV 2024
3
citations
#76

Multi-Dimensional Conformal Prediction

Yam Tawachi, Bracha Laufer-Goldshtein

ICLR 2025
3
citations
#77

Fractal Calibration for Long-tailed Object Detection

Konstantinos Alexandridis, Ismail Elezi, Jiankang Deng et al.

CVPR 2025
3
citations
#78

Consistency-Guided Temperature Scaling Using Style and Content Information for Out-of-Domain Calibration

Wonjeong Choi, Jungwuk Park, Dong-Jun Han et al.

AAAI 2024arXiv:2402.15019
temperature scalingout-of-domain calibrationdomain shift robustnessconfidence calibration+3
2
citations
#79

Discretization-free Multicalibration through Loss Minimization over Tree Ensembles

Hongyi Henry Jin, Zijun Ding, Dung Daniel Ngo et al.

NeurIPS 2025
2
citations
#80

From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers

Swaminathan Gurumurthy, Karnik Ram, Bingqing Chen et al.

CVPR 2024
2
citations
#81

Learning With Multi-Group Guarantees For Clusterable Subpopulations

Jessica Dai, Nika Haghtalab, Eric Zhao

ICML 2025
2
citations
#82

FreeCap: Hybrid Calibration-Free Motion Capture in Open Environments

Aoru Xue, Yiming Ren, Zining Song et al.

AAAI 2025
2
citations
#83

Provably Reliable Conformal Prediction Sets in the Presence of Data Poisoning

Yan Scholten, Stephan Günnemann

ICLR 2025arXiv:2410.09878
conformal predictionuncertainty quantificationdata poisoning attacksprediction sets+2
2
citations
#84

How Much is Unseen Depends Chiefly on Information About the Seen

Seongmin Lee, Marcel Boehme

ICLR 2025
2
citations
#85

Rethinking Classifier Re-Training in Long-Tailed Recognition: Label Over-Smooth Can Balance

Siyu Sun, Han Lu, Jiangtong Li et al.

ICLR 2025
2
citations
#86

CBMA: Improving Conformal Prediction through Bayesian Model Averaging

Pankaj Bhagwat, Linglong Kong, Bei Jiang

ICLR 2025
2
citations
#87

Stochastic Online Conformal Prediction with Semi-Bandit Feedback

Haosen Ge, Hamsa Bastani, Osbert Bastani

ICML 2025
2
citations
#88

Exact Recovery of Sparse Binary Vectors from Generalized Linear Measurements

Arya Mazumdar, Neha Sangwan

ICML 2025
2
citations
#89

Conformal Inference under High-Dimensional Covariate Shifts via Likelihood-Ratio Regularization

Sunay Joshi, Shayan Kiyani, George J. Pappas et al.

NeurIPS 2025
2
citations
#90

Learning multivariate Gaussians with imperfect advice

Arnab Bhattacharyya, Davin Choo, Philips George John et al.

ICML 2025
2
citations
#91

Beyond One-Hot Labels: Semantic Mixing for Model Calibration

Haoyang Luo, Linwei Tao, Minjing Dong et al.

ICML 2025
2
citations
#92

Credal Prediction based on Relative Likelihood

Timo Löhr, Paul Hofman, Felix Mohr et al.

NeurIPS 2025
2
citations
#93

Towards Certification of Uncertainty Calibration under Adversarial Attacks

Cornelius Emde, Francesco Pinto, Thomas Lukasiewicz et al.

ICLR 2025arXiv:2405.13922
uncertainty calibrationadversarial attackscertification methodsmodel calibration+3
2
citations
#94

Conformal Prediction Beyond the Seen: A Missing Mass Perspective for Uncertainty Quantification in Generative Models

Sima Noorani, Shayan Kiyani, George J. Pappas et al.

NeurIPS 2025
2
citations
#95

Calibrated Language Models and How to Find Them with Label Smoothing

Jerry Huang, Peng Lu, QIUHAO Zeng

ICML 2025
2
citations
#96

MC-PanDA: Mask Confidence for Panoptic Domain Adaptation

Ivan Martinovic, Josip Šarić, Siniša Šegvić

ECCV 2024arXiv:2407.14110
panoptic segmentationdomain adaptationmask transformersprediction uncertainty+3
2
citations
#97

T-CIL: Temperature Scaling using Adversarial Perturbation for Calibration in Class-Incremental Learning

Seong-Hyeon Hwang, Minsu Kim, Steven Euijong Whang

CVPR 2025
2
citations
#98

RC-AutoCalib: An End-to-End Radar-Camera Automatic Calibration Network

Van-Tin Luu, Yong-Lin Cai, Vu-Hoang Tran et al.

CVPR 2025
2
citations
#99

Uncertainty-Aware Self-Training for CTC-Based Automatic Speech Recognition

Eungbeom Kim, Kyogu Lee

AAAI 2025
1
citations
#100

Pushing the Limits of BFP on Narrow Precision LLM Inference

Hui Wang, Yuan Cheng, Xiaomeng Han et al.

AAAI 2025
1
citations