Metagenes and molecular pattern discovery using matrix factorization
Broad Institute · Harvard University · +2 more institutions
Abstract
We describe here the use of nonnegative matrix factorization (NMF), an algorithm based on decomposition by parts that can reduce the dimension of expression data from thousands of genes to a handful of metagenes. Coupled with a model selection mechanism, adapted to work for any stochastic clustering algorithm, NMF is an efficient method for identification of distinct molecular patterns and provides a powerful method for class discovery. We demonstrate the ability of NMF to recover meaningful biological information from cancer-related microarray data. NMF appears to have advantages over other methods such as hierarchical clustering or self-organizing maps. We found it less sensitive to a priori selection of…
Citation impact
- FWCI
- 10.61
- Percentile
- 100%
- References
- 16
Authors
4- JBJean-Philippe Brunet
Broad Institute, Harvard University, Dana-Farber Cancer Institute, Massachusetts Institute of Technology
- PTPablo Tamayo
Broad Institute, Harvard University, Dana-Farber Cancer Institute, Massachusetts Institute of Technology
- TRTodd R. Golub
Broad Institute, Harvard University, Dana-Farber Cancer Institute, Massachusetts Institute of Technology
- JPJill P. MesirovCorresponding
Broad Institute, Harvard University, Dana-Farber Cancer Institute, Massachusetts Institute of Technology
Topics & keywords
- Non-negative matrix factorization
- Cluster analysis
- Matrix decomposition
- Computer science
- Context (archaeology)
- Computational biology
- Pattern recognition (psychology)
- A priori and a posteriori