Selecting critical features for data classification based on machine learning methods

Chen, Rung-Ching; Dewi, Christine; Huang, Su-Wen; Caraka, Rezzy Eko

doi:10.1186/s40537-020-00327-4

articleJournal Of Big DataJul 23, 2020GOLD OA

Selecting critical features for data classification based on machine learning methods

RCRung-Ching Chen CDChristine Dewi SHSu-Wen Huang RERezzy Eko Caraka

Chaoyang University of Technology · Satya Wacana Christian University · +1 more institution

Indexed incrossrefdoaj

Abstract

Abstract Feature selection becomes prominent, especially in the data sets with many variables and features. It will eliminate unimportant variables and improve the accuracy as well as the performance of classification. Random Forest has emerged as a quite useful algorithm that can handle the feature selection issue even with a higher number of variables. In this paper, we use three popular datasets with a higher number of variables (Bank Marketing, Car Evaluation Database, Human Activity Recognition Using Smartphones) to conduct the experiment. There are four main reasons why feature selection is essential. First, to simplify the model by reducing the number of parameters, next to decrease the training time,…

Citation impact

970

total citations

FWCI: 43.65
Percentile: 100%
References: 123

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Random forest
Feature selection
Computer science
Support vector machine
Artificial intelligence
Curse of dimensionality
Linear discriminant analysis
Machine learning

No related works found for this paper.