Learning Important Features Through Propagating Activation Differences

Shrikumar, Avanti; Greenside, Peyton; Kundaje, Anshul

doi:10.48550/arxiv.1704.02685

preprintarXiv (Cornell University)Apr 10, 2017GREEN OA

Learning Important Features Through Propagating Activation Differences

ASAvanti Shrikumar PGPeyton Greenside AKAnshul Kundaje

Stanford University

Indexed inarxivdatacite

Abstract

The purported "black box" nature of neural networks is a barrier to adoption in applications where interpretability is essential. Here we present DeepLIFT (Deep Learning Important FeaTures), a method for decomposing the output prediction of a neural network on a specific input by backpropagating the contributions of all neurons in the network to every feature of the input. DeepLIFT compares the activation of each neuron to its 'reference activation' and assigns contribution scores according to the difference. By optionally giving separate consideration to positive and negative contributions, DeepLIFT can also reveal dependencies which are missed by other approaches. Scores can be computed efficiently in a…

Citation impact

2,372

total citations

FWCI: —
Percentile: —
References: 12

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Interpretability
MNIST database
Computer science
Feature (linguistics)
Code (set theory)
Artificial neural network
Artificial intelligence
Deep neural networks

No related works found for this paper.