Counterfactual Explanations and Algorithmic Recourses for Machine Learning: A Review

Verma, Sahil; Boonsanong, Varich; Hoang, Minh; Hines, Keegan; Dickerson, John P.; Shah, Chirag

doi:10.1145/3677119

reviewACM Computing SurveysJul 9, 2024HYBRID OA

Counterfactual Explanations and Algorithmic Recourses for Machine Learning: A Review

SVSahil Verma VBVarich Boonsanong MHMinh Hoang KHKeegan Hines JPJohn P. Dickerson

University of Washington · Seattle University · +1 more institution

Indexed incrossref

Abstract

Machine learning plays a role in many deployed decision systems, often in ways that are difficult or impossible to understand by human stakeholders. Explaining, in a human-understandable way, the relationship between the input and output of machine learning models is essential to the development of trustworthy machine learning based systems. A burgeoning body of research seeks to define the goals and methods of explainability in machine learning. In this article, we seek to review and categorize research on counterfactual explanations , a specific class of explanation that provides a link between what could have happened had input to a model been changed in a particular way. Modern approaches to counterfactual…

Citation impact

138

total citations

FWCI: 41.37
Percentile: 100%
References: 285

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Computer science
Counterfactual thinking
Artificial intelligence
Machine learning
Psychology

No related works found for this paper.