Boruta – A System for Feature Selection

Kursa, Miron B.; Jankowski, Aleksander; Rudnicki, Witold R.

doi:10.3233/fi-2010-288

articleFundamenta InformaticaeJan 1, 2010Closed access

Boruta – A System for Feature Selection

MBMiron B. Kursa AJAleksander Jankowski WRWitold R. Rudnicki

University of Warsaw

Indexed incrossref

Abstract

Machine learning methods are often used to classify objects described by hundreds of attributes; in many applications of this kind a great fraction of attributes may be totally irrelevant to the classification problem. Even more, usually one cannot decide a priori which attributes are relevant. In this paper we present an improved version of the algorithm for identification of the full set of truly important variables in an information system. It is an extension of the random forest method which utilises the importance measure generated by the original algorithm. It compares, in the iterative fashion, the importances of original attributes with importances of their randomised copies. We analyse performance of…

Citation impact

855

total citations

FWCI: 9.32
Percentile: 100%
References: 22

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Selection (genetic algorithm)
Computer science
Feature (linguistics)
Feature selection
Artificial intelligence
Linguistics
Philosophy

No related works found for this paper.