Captum: A unified and generic model interpretability library for PyTorch
Indexed inarxivdatacite
Abstract
In this paper we introduce a novel, unified, open-source model interpretability library for PyTorch [12]. The library contains generic implementations of a number of gradient and perturbation-based attribution algorithms, also known as feature, neuron and layer importance algorithms, as well as a set of evaluation metrics for these algorithms. It can be used for both classification and non-classification models including graph-structured models built on Neural Networks (NN). In this paper we give a high-level overview of supported attribution algorithms and show how to perform memory-efficient and scalable computations. We emphasize that the three main characteristics of the library are multimodality,…
Citation impact
637
total citations
- FWCI
- —
- Percentile
- —
- References
- 18
Citations per year
Authors
11Topics & keywords
Topics
Keywords
- Interpretability
- Computer science
- Debugging
- Implementation
- Extensibility
- Visualization
- Scalability
- Artificial intelligence
UN Sustainable Development Goals
- Quality Education
No related works found for this paper.