Towards Deep Neural Network Architectures Robust to Adversarial Examples
Max Planck Innovation · Max Planck Society · +1 more institution
Abstract
Recent work has shown deep neural networks (DNNs) to be highly susceptible to well-designed, small perturbations at the input layer, or so-called adversarial examples. Taking images as an example, such distortions are often imperceptible, but can result in 100% mis-classification for a state of the art DNN. We study the structure of adversarial examples and explore network topology, pre-processing and training strategies to improve the robustness of DNNs. We perform various experiments to assess the removability of adversarial examples by corrupting with additional noise and pre-processing with denoising autoencoders (DAEs). We find that DAEs can remove substantial amounts of the adversarial noise. How- ever,…
Citation impact
- FWCI
- —
- Percentile
- —
- References
- 19
Authors
2Topics & keywords
- Adversarial system
- Robustness (evolution)
- Computer science
- Deep neural networks
- Artificial intelligence
- Artificial neural network
- Autoencoder
- Deep learning
- Peace, Justice and strong institutions