articleJournal of Bioinformatics and Computational BiologyApr 1, 2005Closed access

MINIMUM REDUNDANCY FEATURE SELECTION FROM MICROARRAY GENE EXPRESSION DATA

Lawrence Berkeley National Laboratory · University of California, Berkeley

PubMed
Indexed incrossrefpubmed

Abstract

How to selecting a small subset out of the thousands of genes in microarray data is important for accurate classification of phenotypes. Widely used methods typically rank genes according to their differential expressions among phenotypes and pick the top-ranked genes. We observe that feature sets so obtained have certain redundancy and study methods to minimize it. We propose a minimum redundancy - maximum relevance (MRMR) feature selection framework. Genes selected via MRMR provide a more balanced coverage of the space and capture broader characteristics of phenotypes. They lead to significantly improved class predictions in extensive experiments on 6 gene expression data sets: NCI, Lymphoma, Lung, Child…

Citation impact

2,783
total citations
FWCI
13.27
Percentile
100%
References
23
Citations per year

Authors

2

Topics & keywords

Keywords
  • Feature selection
  • Minimum redundancy feature selection
  • Redundancy (engineering)
  • Support vector machine
  • Bayes' theorem
  • Gene
  • Microarray analysis techniques
  • Computational biology
UN Sustainable Development Goals
  • Reduced inequalities
No related works found for this paper.

Funding