articleJournal Of Big DataMar 26, 2024GOLD OA

Feature selection strategies: a comparative analysis of SHAP-value and importance-based methods

Western Kentucky University · Florida Atlantic University

Indexed incrossrefdoaj

Abstract

Abstract In the context of high-dimensional credit card fraud data, researchers and practitioners commonly utilize feature selection techniques to enhance the performance of fraud detection models. This study presents a comparison in model performance using the most important features selected by SHAP (SHapley Additive exPlanations) values and the model’s built-in feature importance list. Both methods rank features and choose the most significant ones for model assessment. To evaluate the effectiveness of these feature selection techniques, classification models are built using five classifiers: XGBoost, Decision Tree, CatBoost, Extremely Randomized Trees, and Random Forest. The Area under the Precision-Recall…

Citation impact

482
total citations
FWCI
107.84
Percentile
100%
References
26
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Feature selection
  • Computational Science and Engineering
  • Feature (linguistics)
  • Selection (genetic algorithm)
  • Value (mathematics)
  • Data science
  • Artificial intelligence
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.

Funding