From Anecdotal Evidence to Quantitative Evaluation Methods: A Systematic Review on Evaluating Explainable AI

Nauta, Meike; Trienes, Jan; Pathak, Shreyasi; Nguyen, Elisa; Peters, Michelle; Schmitt, Yasmin; Schlötterer, Jörg; Keulen, Maurice van; Seifert, Christin

doi:10.1145/3583558

reviewACM Computing SurveysFeb 24, 2023HYBRID OA

From Anecdotal Evidence to Quantitative Evaluation Methods: A Systematic Review on Evaluating Explainable AI

MNMeike Nauta JTJan Trienes SPShreyasi Pathak ENElisa Nguyen MPMichelle Peters

University of Duisburg-Essen · University of Twente

Indexed incrossref

Abstract

The rising popularity of explainable artificial intelligence (XAI) to understand high-performing black boxes raised the question of how to evaluate explanations of machine learning (ML) models. While interpretability and explainability are often presented as a subjectively validated binary property, we consider it a multi-faceted concept. We identify 12 conceptual properties, such as Compactness and Correctness, that should be evaluated for comprehensively assessing the quality of an explanation. Our so-called Co-12 properties serve as categorization scheme for systematically reviewing the evaluation practices of more than 300 papers published in the past 7 years at major AI and ML conferences that introduce…

Citation impact

458

total citations

FWCI: 74.23
Percentile: 100%
References: 211

Citations per year

Authors

9

Topics & keywords

Topics

Keywords

Computer science
Interpretability
Popularity
Categorization
Benchmark (surveying)
Correctness
Artificial intelligence
Systematic review

No related works found for this paper.