From Anecdotal Evidence to Quantitative Evaluation Methods: A Systematic Review on Evaluating Explainable AI
University of Duisburg-Essen · University of Twente
Abstract
The rising popularity of explainable artificial intelligence (XAI) to understand high-performing black boxes raised the question of how to evaluate explanations of machine learning (ML) models. While interpretability and explainability are often presented as a subjectively validated binary property, we consider it a multi-faceted concept. We identify 12 conceptual properties, such as Compactness and Correctness, that should be evaluated for comprehensively assessing the quality of an explanation. Our so-called Co-12 properties serve as categorization scheme for systematically reviewing the evaluation practices of more than 300 papers published in the past 7 years at major AI and ML conferences that introduce…
Citation impact
- FWCI
- 74.23
- Percentile
- 100%
- References
- 211
Authors
9Topics & keywords
- Computer science
- Interpretability
- Popularity
- Categorization
- Benchmark (surveying)
- Correctness
- Artificial intelligence
- Systematic review