Captum: A unified and generic model interpretability library for PyTorch

Kokhlikyan, Narine; Miglani, Vivek; Martín, Miguel Vargas; Wang, Edward; Alsallakh, Bilal; Reynolds, Jonathan; Melnikov, Alexander; Kliushkina, Natalia; Araya, Carlos L.; Yan, Siqi; Reblitz-Richardson, Orion

doi:10.48550/arxiv.2009.07896

preprintarXiv (Cornell University)Sep 16, 2020GREEN OA

Captum: A unified and generic model interpretability library for PyTorch

NKNarine Kokhlikyan VMVivek Miglani MVMiguel Vargas Martín EWEdward Wang BABilal Alsallakh

Meta (Israel)

Indexed inarxivdatacite

Abstract

In this paper we introduce a novel, unified, open-source model interpretability library for PyTorch [12]. The library contains generic implementations of a number of gradient and perturbation-based attribution algorithms, also known as feature, neuron and layer importance algorithms, as well as a set of evaluation metrics for these algorithms. It can be used for both classification and non-classification models including graph-structured models built on Neural Networks (NN). In this paper we give a high-level overview of supported attribution algorithms and show how to perform memory-efficient and scalable computations. We emphasize that the three main characteristics of the library are multimodality,…

Citation impact

637

total citations

FWCI: —
Percentile: —
References: 18

Citations per year

Authors

11

Topics & keywords

Topics

Keywords

Interpretability
Computer science
Debugging
Implementation
Extensibility
Visualization
Scalability
Artificial intelligence

UN Sustainable Development Goals

Quality Education

No related works found for this paper.