articleJournal Of Big DataJan 24, 2025GOLD OA

A problem-agnostic approach to feature selection and analysis using SHAP

Florida Atlantic University

Indexed incrossrefdoaj

Abstract

Feature selection is an effective data reduction technique. SHapley Additive exPlanations (SHAP) can be used to provide a feature importance ranking for models built with labeled or unlabeled data. Thus, one may use the SHAP feature importance ranking in a feature selection technique by selecting the k highest ranking features. Furthermore, this SHAP-based feature selection technique is applicable regardless of the availability of labels for data. We use the Kaggle Credit Card Fraud detection dataset to simulate three label availability scenarios. When no labeled data is available, unsupervised learners should be used. We explore feature selection for data reduction with Isolation Forest and SHAP for this…

Citation impact

53
total citations
FWCI
100.63
Percentile
100%
References
22
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Computational Science and Engineering
  • Feature selection
  • Selection (genetic algorithm)
  • Feature (linguistics)
  • Artificial intelligence
  • Computational science
No related works found for this paper.

Funding