Selecting critical features for data classification based on machine learning methods
Chaoyang University of Technology · Satya Wacana Christian University · +1 more institution
Abstract
Abstract Feature selection becomes prominent, especially in the data sets with many variables and features. It will eliminate unimportant variables and improve the accuracy as well as the performance of classification. Random Forest has emerged as a quite useful algorithm that can handle the feature selection issue even with a higher number of variables. In this paper, we use three popular datasets with a higher number of variables (Bank Marketing, Car Evaluation Database, Human Activity Recognition Using Smartphones) to conduct the experiment. There are four main reasons why feature selection is essential. First, to simplify the model by reducing the number of parameters, next to decrease the training time,…
Citation impact
- FWCI
- 43.65
- Percentile
- 100%
- References
- 123
Authors
4Topics & keywords
- Random forest
- Feature selection
- Computer science
- Support vector machine
- Artificial intelligence
- Curse of dimensionality
- Linear discriminant analysis
- Machine learning