Explaining nonlinear classification decisions with deep Taylor decomposition
Technische Universität Berlin · Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute · +2 more institutions
Abstract
Nonlinear methods such as Deep Neural Networks (DNNs) are the gold standard for various challenging machine learning problems such as image recognition. Although these methods perform impressively well, they have a significant disadvantage, the lack of transparency, limiting the interpretability of the solution and thus the scope of application in practice. Especially DNNs act as black boxes due to their multilayer nonlinear structure. In this paper we introduce a novel methodology for interpreting generic multilayer neural networks by decomposing the network classification decision into contributions of its input elements. Although our focus is on image classification, the method is applicable to a broad set…
Citation impact
- FWCI
- 130.31
- Percentile
- 100%
- References
- 78
Authors
5- GMGrégoire MontavonCorresponding
Technische Universität Berlin
- SLSebastian Lapuschkin
Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute
- ABAlexander Binder
Singapore University of Technology and Design
- WSWojciech SamekCorresponding
Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute
- KMKlaus‐Robert MüllerCorresponding
Korea University, Technische Universität Berlin
Topics & keywords
- Interpretability
- MNIST database
- Computer science
- Artificial intelligence
- Artificial neural network
- Nonlinear system
- Deep learning
- Focus (optics)
- Peace, Justice and strong institutions