Feature Denoising for Improving Adversarial Robustness
Johns Hopkins University · Meta (Israel)
Abstract
Adversarial attacks to image classification systems present challenges to convolutional networks and opportunities for understanding them. This study suggests that adversarial perturbations on images lead to noise in the features constructed by these networks. Motivated by this observation, we develop new network architectures that increase adversarial robustness by performing feature denoising. Specifically, our networks contain blocks that denoise the features using non-local means or other filters; the entire networks are trained end-to-end. When combined with adversarial training, our feature denoising networks substantially improve the state-of-the-art in adversarial robustness in both white-box and…
Citation impact
- FWCI
- 75.30
- Percentile
- 100%
- References
- 44
Authors
5Topics & keywords
- Adversarial system
- Robustness (evolution)
- Computer science
- Artificial intelligence
- White box
- Noise reduction
- Deep learning
- Feature (linguistics)