Towards Deep Learning Models Resistant to Adversarial Attacks

Mądry, Aleksander; Makelov, Aleksandar; Schmidt, Ludwig; Tsipras, Dimitris; Vladu, Adrian

doi:10.48550/arxiv.1706.06083

preprintarXiv (Cornell University)Jun 19, 2017GREEN OA

Towards Deep Learning Models Resistant to Adversarial Attacks

AMAleksander Mądry AMAleksandar Makelov LSLudwig Schmidt DTDimitris Tsipras AVAdrian Vladu

Massachusetts Institute of Technology

Indexed inarxivdatacite

Abstract

Recent work has demonstrated that deep neural networks are vulnerable to adversarial examples---inputs that are almost indistinguishable from natural data and yet classified incorrectly by the network. In fact, some of the latest findings suggest that the existence of adversarial attacks may be an inherent weakness of deep learning models. To address this problem, we study the adversarial robustness of neural networks through the lens of robust optimization. This approach provides us with a broad and unifying view on much of the prior work on this topic. Its principled nature also enables us to identify methods for both training and attacking neural networks that are reliable and, in a certain sense,…

Citation impact

1,541

total citations

FWCI: —
Percentile: —
References: 30

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Adversarial system
Adversary
Computer science
Robustness (evolution)
Deep learning
Deep neural networks
Artificial intelligence
Artificial neural network

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.