Protein complex prediction via cost-based clustering
University of Toronto · Ontario Institute for Cancer Research
Abstract
We have developed the Restricted Neighborhood Search Clustering Algorithm (RNSC) to efficiently partition networks into clusters using a cost function. We applied this cost-based clustering algorithm to PPI networks of Saccharomyces cerevisiae, Drosophila melanogaster and Caenorhabditis elegans to identify and predict protein complexes. We have determined functional and graph-theoretic properties of true protein complexes from the MIPS database. Based on these properties, we defined filters to distinguish between identified network clusters and true protein complexes.
Our application of the cost-based clustering algorithm provides an accurate and scalable method of detecting and predicting protein complexes within a PPI network.
Citation impact
- FWCI
- 5.54
- Percentile
- 100%
- References
- 36
Authors
3Topics & keywords
- Cluster analysis
- Computer science
- Scalability
- Data mining
- Partition (number theory)
- Identification (biology)
- Biological network
- Protein function prediction