Certified Adversarial Robustness via Randomized Smoothing
Indexed inarxivdatacite
Abstract
We show how to turn any classifier that classifies well under Gaussian noise into a new classifier that is certifiably robust to adversarial perturbations under the $\ell_2$ norm. This "randomized smoothing" technique has been proposed recently in the literature, but existing guarantees are loose. We prove a tight robustness guarantee in $\ell_2$ norm for smoothing with Gaussian noise. We use randomized smoothing to obtain an ImageNet classifier with e.g. a certified top-1 accuracy of 49% under adversarial perturbations with $\ell_2$ norm less than 0.5 (=127/255). No certified defense has been shown feasible on ImageNet except for smoothing. On smaller-scale datasets where competing approaches to certified…
Citation impact
621
total citations
- FWCI
- —
- Percentile
- —
- References
- 0
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Smoothing
- Computer science
- Gaussian
- Classifier (UML)
- Robustness (evolution)
- Certification
- Gaussian blur
- Gaussian noise
No related works found for this paper.