2025 "model reliability" Papers
4 papers found
AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions
Polina Kirichenko, Mark Ibrahim, Kamalika Chaudhuri et al.
NeurIPS 2025posterarXiv:2506.09038
26
citations
Is Your Multimodal Language Model Oversensitive to Safe Queries?
Xirui Li, Hengguang Zhou, Ruochen Wang et al.
ICLR 2025posterarXiv:2406.17806
20
citations
Reasoning Models Better Express Their Confidence
Dongkeun Yoon, Seungone Kim, Sohee Yang et al.
NeurIPS 2025posterarXiv:2505.14489
32
citations
Regretful Decisions under Label Noise
Sujay Nagaraj, Yang Liu, Flavio Calmon et al.
ICLR 2025poster
3
citations