On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
Technische Universität Berlin · Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute · +3 more institutions
Abstract
Understanding and interpreting classification decisions of automated image classification systems is of high value in many applications, as it allows to verify the reasoning of the system and provides additional information to the human expert. Although machine learning methods are solving very successfully a plethora of tasks, they have in most cases the disadvantage of acting as a black box, not providing any information about what made them arrive at a particular decision. This work proposes a general solution to the problem of understanding classification decisions by pixel-wise decomposition of nonlinear classifiers. We introduce a methodology that allows to visualize the contributions of single pixels to…
Citation impact
- FWCI
- 49.97
- Percentile
- 100%
- References
- 56
Authors
6- SBSebastian BachCorresponding
Technische Universität Berlin, Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute
- ABAlexander Binder
Technische Universität Berlin, Singapore University of Technology and Design
- GMGrégoire Montavon
Technische Universität Berlin
- FKFrederick Klauschen
Charité - Universitätsmedizin Berlin
- KMKlaus‐Robert MüllerCorresponding
Technische Universität Berlin, Korea University
Topics & keywords
- MNIST database
- Computer science
- Artificial intelligence
- Pixel
- Pascal (unit)
- Machine learning
- Pattern recognition (psychology)
- Classifier (UML)
- Peace, Justice and strong institutions
Funding
- NRNational Research Foundation
- DFDeutsche ForschungsgemeinschaftAward: 01GQ1115
- BFBundesministerium für Bildung und ForschungAwards: 01IS14013A, 01GQ1115
- BFBundesministerium für Wirtschaft und TechnologieAward: 01MQ07018
- NRNational Research Foundation of KoreaAwards: 01IS14013A, BK21 program
- TUTechnische Universität Berlin
- BABanting and Best Diabetes Centre, University of TorontoAward: 01IS14013A