preprintarXiv (Cornell University)Mar 4, 2017GREEN OA

Axiomatic Attribution for Deep Networks

Google (United States)

Indexed inarxivdatacite

Abstract

We study the problem of attributing the prediction of a deep network to its input features, a problem previously studied by several other works. We identify two fundamental axioms---Sensitivity and Implementation Invariance that attribution methods ought to satisfy. We show that they are not satisfied by most known attribution methods, which we consider to be a fundamental weakness of those methods. We use the axioms to guide the design of a new attribution method called Integrated Gradients. Our method requires no modification to the original network and is extremely simple to implement; it just needs a few calls to the standard gradient operator. We apply this method to a couple of image models, a couple of…

Citation impact

2,631
total citations
FWCI
Percentile
References
29
Citations per year

Authors

3

Topics & keywords

Keywords
  • Axiom
  • Attribution
  • Computer science
  • Debugging
  • Operator (biology)
  • Simple (philosophy)
  • Theoretical computer science
  • Artificial intelligence
No related works found for this paper.