Counterfactual Explanations and Algorithmic Recourses for Machine Learning: A Review
University of Washington · Seattle University · +1 more institution
Abstract
Machine learning plays a role in many deployed decision systems, often in ways that are difficult or impossible to understand by human stakeholders. Explaining, in a human-understandable way, the relationship between the input and output of machine learning models is essential to the development of trustworthy machine learning based systems. A burgeoning body of research seeks to define the goals and methods of explainability in machine learning. In this article, we seek to review and categorize research on counterfactual explanations , a specific class of explanation that provides a link between what could have happened had input to a model been changed in a particular way. Modern approaches to counterfactual…
Citation impact
- FWCI
- 41.37
- Percentile
- 100%
- References
- 285
Authors
6Topics & keywords
- Computer science
- Counterfactual thinking
- Artificial intelligence
- Machine learning
- Psychology