articleJournal Of Big DataJul 23, 2020GOLD OA

Selecting critical features for data classification based on machine learning methods

Chaoyang University of Technology · Satya Wacana Christian University · +1 more institution

Indexed incrossrefdoaj

Abstract

Abstract Feature selection becomes prominent, especially in the data sets with many variables and features. It will eliminate unimportant variables and improve the accuracy as well as the performance of classification. Random Forest has emerged as a quite useful algorithm that can handle the feature selection issue even with a higher number of variables. In this paper, we use three popular datasets with a higher number of variables (Bank Marketing, Car Evaluation Database, Human Activity Recognition Using Smartphones) to conduct the experiment. There are four main reasons why feature selection is essential. First, to simplify the model by reducing the number of parameters, next to decrease the training time,…

Citation impact

970
total citations
FWCI
43.65
Percentile
100%
References
123
Citations per year

Authors

4

Topics & keywords

Keywords
  • Random forest
  • Feature selection
  • Computer science
  • Support vector machine
  • Artificial intelligence
  • Curse of dimensionality
  • Linear discriminant analysis
  • Machine learning
No related works found for this paper.

Funding