Axiomatic Attribution for Deep Networks

Sundararajan, Mukund; Taly, Ankur; Yan, Qiqi

doi:10.48550/arxiv.1703.01365

preprintarXiv (Cornell University)Mar 4, 2017GREEN OA

Axiomatic Attribution for Deep Networks

MSMukund Sundararajan ATAnkur Taly QYQiqi Yan

Google (United States)

Indexed inarxivdatacite

Abstract

We study the problem of attributing the prediction of a deep network to its input features, a problem previously studied by several other works. We identify two fundamental axioms---Sensitivity and Implementation Invariance that attribution methods ought to satisfy. We show that they are not satisfied by most known attribution methods, which we consider to be a fundamental weakness of those methods. We use the axioms to guide the design of a new attribution method called Integrated Gradients. Our method requires no modification to the original network and is extremely simple to implement; it just needs a few calls to the standard gradient operator. We apply this method to a couple of image models, a couple of…

Citation impact

2,631

total citations

FWCI: —
Percentile: —
References: 29

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Axiom
Attribution
Computer science
Debugging
Operator (biology)
Simple (philosophy)
Theoretical computer science
Artificial intelligence

No related works found for this paper.