Benchmark for filter methods for feature selection in high-dimensional classification data

Bommert, Andrea; Sun, Xudong; Bischl, Bernd; Rahnenführer, Jörg; Lang, Michel

doi:10.1016/j.csda.2019.106839

articleComputational Statistics & Data AnalysisSep 19, 2019HYBRID OA

Benchmark for filter methods for feature selection in high-dimensional classification data

ABAndrea Bommert XSXudong Sun BBBernd Bischl JRJörg Rahnenführer MLMichel Lang

TU Dortmund University · Ludwig-Maximilians-Universität München

Indexed incrossref

Abstract

Feature selection is one of the most fundamental problems in machine learning and has drawn increasing attention due to high-dimensional data sets emerging from different fields like bioinformatics. For feature selection, filter methods play an important role, since they can be combined with any machine learning model and can heavily reduce run time of machine learning algorithms. The aim of the analyses is to review how different filter methods work, to compare their performance with respect to both run time and predictive accuracy, and to provide guidance for applications. Based on 16 high-dimensional classification data sets, 22 filter methods are analyzed with respect to run time and accuracy when combined…

Citation impact

719

total citations

FWCI: 26.66
Percentile: 100%
References: 101

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Filter (signal processing)
Computer science
Feature selection
Machine learning
Artificial intelligence
Benchmark (surveying)
Feature (linguistics)
Data mining

No related works found for this paper.

Funding

DF
Deutsche Forschungsgemeinschaft
Awards: RA870/7-1, BI1902/1-1