An efficient algorithm for large-scale detection of protein families
European Bioinformatics Institute
Abstract
Detection of protein families in large databases is one of the principal research objectives in structural and functional genomics. Protein family classification can significantly contribute to the delineation of functional diversity of homologous proteins, the prediction of function based on domain architecture or the presence of sequence motifs as well as comparative genomics, providing valuable evolutionary insights. We present a novel approach called TRIBE-MCL for rapid and accurate clustering of protein sequences into families. The method relies on the Markov cluster (MCL) algorithm for the assignment of proteins into families based on precomputed sequence similarity information. This novel approach does…
Citation impact
- FWCI
- 13.92
- Percentile
- 100%
- References
- 56
Authors
1Topics & keywords
- Biology
- Cluster analysis
- Protein family
- Protein domain
- Computational biology
- Genomics
- Structural Classification of Proteins database
- Structural genomics