Data Poisoning Attacks against Conformal Prediction

0citations

PDF

Citations

in ICML 2024

of 2635 papers

Authors

Data Points

Authors

Yangyi Li Aobo Chen Wei Qian Chenxu Zhao Divya Lidder Mengdi Huai

Topics

conformal prediction data poisoning attacks uncertainty quantification black-box attacks prediction set manipulation model-agnostic methods

Abstract

The efficient and theoretically sound uncertainty quantification is crucial for building trust in deep learning models. This has spurred a growing interest in conformal prediction (CP), a powerful technique that provides a model-agnostic and distribution-free method for obtaining conformal prediction sets with theoretical guarantees. However, the vulnerabilities of such CP methods with regard to dedicated data poisoning attacks have not been studied previously. To bridge this gap, for the first time, we in this paper propose a new class of black-box data poisoning attacks against CP, where the adversary aims to cause the desired manipulations of some specific examples' prediction uncertainty results (instead of misclassifications). Additionally, we design novel optimization frameworks for our proposed attacks. Further, we conduct extensive experiments to validate the effectiveness of our attacks on various settings (e.g., the full and split CP settings). Notably, our extensive experiments show that our attacks are more effective in manipulating uncertainty results than traditional poisoning attacks that aim at inducing misclassifications, and existing defenses against conventional attacks are ineffective against our proposed attacks.

Citation History

Jan 28, 2026