articleJournal of Statistical SoftwareJan 1, 2010DIAMOND OA

Feature Selection with the Boruta Package

Indexed incrossrefdoaj

Abstract

This article describes a R package Boruta, implementing a novel feature selection algorithm for finding emph{all relevant variables}. The algorithm is designed as a wrapper around a Random Forest classification algorithm. It iteratively removes the features which are proved by a statistical test to be less relevant than random probes. The Boruta package provides a convenient interface to the algorithm. The short description of the algorithm and examples of its application are presented.

Citation impact

5,132
total citations
FWCI
22.37
Percentile
100%
References
12
Citations per year

Authors

2

Topics & keywords

Keywords
  • Feature (linguistics)
  • Feature selection
  • Selection (genetic algorithm)
  • Computer science
  • R package
  • Algorithm
  • Random forest
  • Interface (matter)
UN Sustainable Development Goals
  • Life in Land
No related works found for this paper.