Practical Black-Box Attacks against Machine Learning
Pennsylvania State University · OpenAI (United States) · +2 more institutions
Abstract
Machine learning (ML) models, e.g., deep neural networks (DNNs), are vulnerable to adversarial examples: malicious inputs modified to yield erroneous model outputs, while appearing unmodified to human observers. Potential attacks include having malicious content like malware identified as legitimate or controlling vehicle behavior. Yet, all existing adversarial example attacks require knowledge of either the model internals or its training data. We introduce the first practical demonstration of an attacker controlling a remotely hosted DNN with no such knowledge. Indeed, the only capability of our black-box adversary is to observe labels given by the DNN to chosen inputs. Our attack strategy consists in…
Citation impact
- FWCI
- 282.60
- Percentile
- 100%
- References
- 18
Authors
6Topics & keywords
- Adversarial system
- Computer science
- Adversary
- Black box
- Malware
- Deep neural networks
- Artificial intelligence
- Deep learning
- Peace, Justice and strong institutions