Et Tu Certifications: Robustness Certificates Yield Better Adversarial Examples

0citations

PDF Project

Citations

#10

in ICML 2024

of 2635 papers

Authors

Data Points

Authors

Andrew C. Cullen Shijie Liu Paul Montague Sarah Erfani Benjamin Rubinstein

Topics

adversarial robustness robustness certification adversarial examples certification aware attack norm-minimising perturbations security vulnerabilities

Abstract

In guaranteeing the absence of adversarial examples in an instance's neighbourhood, certification mechanisms play an important role in demonstrating neural net robustness. In this paper, we ask if these certifications can compromise the very models they help to protect? Our new *Certification Aware Attack* exploits certifications to produce computationally efficient norm-minimising adversarial examples $74$% more often than comparable attacks, while reducing the median perturbation norm by more than $10$%. While these attacks can be used to assess the tightness of certification bounds, they also highlight that releasing certifications can paradoxically reduce security.

Citation History

Jan 28, 2026