Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

Selvaraju, Ramprasaath R.; Cogswell, Michael; Das, Abhishek; Vedantam, Ramakrishna; Parikh, Devi; Batra, Dhruv

doi:10.1109/iccv.2017.74

articleOct 1, 2017Closed access

Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization

RRRamprasaath R. Selvaraju MCMichael Cogswell ADAbhishek Das RVRamakrishna Vedantam DPDevi Parikh

Georgia Institute of Technology · Meta (Israel)

Indexed incrossref

Abstract

We propose a technique for producing `visual explanations' for decisions from a large class of Convolutional Neural Network (CNN)-based models, making them more transparent. Our approach - Gradient-weighted Class Activation Mapping (Grad-CAM), uses the gradients of any target concept (say logits for `dog' or even a caption), flowing into the final convolutional layer to produce a coarse localization map highlighting the important regions in the image for predicting the concept. Unlike previous approaches, Grad- CAM is applicable to a wide variety of CNN model-families: (1) CNNs with fully-connected layers (e.g. VGG), (2) CNNs used for structured outputs (e.g. captioning), (3) CNNs used in tasks with…

Citation impact

21,187

total citations

FWCI: 245.84
Percentile: 100%
References: 65

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Closed captioning
Computer science
Discriminative model
Convolutional neural network
Artificial intelligence
Visualization
Generalization
Question answering

UN Sustainable Development Goals

Reduced inequalities

No related works found for this paper.