Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy
Lawrence Berkeley National Laboratory · University of California, Berkeley
Abstract
Feature selection is an important problem for pattern classification systems. We study how to select good features according to the maximal statistical dependency criterion based on mutual information. Because of the difficulty in directly implementing the maximal dependency condition, we first derive an equivalent form, called minimal-redundancy-maximal-relevance criterion (mRMR), for first-order incremental feature selection. Then, we present a two-stage feature selection algorithm by combining mRMR and other more sophisticated feature selectors (e.g., wrappers). This allows us to select a compact set of superior features at very low cost. We perform extensive experimental comparison of our algorithm and…
Citation impact
- FWCI
- 26.53
- Percentile
- 100%
- References
- 37
Authors
3- HPHanchuan PengCorresponding
Lawrence Berkeley National Laboratory, University of California, Berkeley
- FLFuhui Long
Lawrence Berkeley National Laboratory, University of California, Berkeley
- CDChen Ding
Lawrence Berkeley National Laboratory
Topics & keywords
- Feature selection
- Mutual information
- Pattern recognition (psychology)
- Redundancy (engineering)
- Minimum redundancy feature selection
- Artificial intelligence
- Computer science
- Support vector machine
- Reduced inequalities