An automated method for finding molecular complexes in large protein interaction networks
Lunenfeld-Tanenbaum Research Institute · Memorial Sloan Kettering Cancer Center · +2 more institutions
Abstract
Recent advances in proteomics technologies such as two-hybrid, phage display and mass spectrometry have enabled us to create a detailed map of biomolecular interaction networks. Initial mapping efforts have already produced a wealth of data. As the size of the interaction set increases, databases and computational methods will be required to store, visualize and analyze the information in order to effectively aid in knowledge discovery.
This paper describes a novel graph theoretic clustering algorithm, "Molecular Complex Detection" (MCODE), that detects densely connected regions in large protein-protein interaction networks that may represent molecular complexes. The method is based on vertex weighting by local neighborhood density and outward traversal from a locally dense seed protein to isolate the dense regions according to given parameters. The algorithm has the advantage over other graph clustering methods of having a directed mode that allows fine-tuning of clusters of interest without considering the rest of the network and allows examination of cluster interconnectivity, which is relevant for protein networks. Protein interaction and complex information from the yeast Saccharomyces cerevisiae was used for evaluation.
Citation impact
- FWCI
- 14.89
- Percentile
- 100%
- References
- 47
Authors
2Topics & keywords
- Computer science
- Tree traversal
- Cluster analysis
- Protein Interaction Networks
- Protein–protein interaction
- Interaction network
- Data mining
- False positive paradox