Feature Selection with the Boruta Package
Indexed incrossrefdoaj
Abstract
This article describes a R package Boruta, implementing a novel feature selection algorithm for finding emph{all relevant variables}. The algorithm is designed as a wrapper around a Random Forest classification algorithm. It iteratively removes the features which are proved by a statistical test to be less relevant than random probes. The Boruta package provides a convenient interface to the algorithm. The short description of the algorithm and examples of its application are presented.
Citation impact
5,132
total citations
- FWCI
- 22.37
- Percentile
- 100%
- References
- 12
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Feature (linguistics)
- Feature selection
- Selection (genetic algorithm)
- Computer science
- R package
- Algorithm
- Random forest
- Interface (matter)
UN Sustainable Development Goals
- Life in Land
No related works found for this paper.