Expanded microbial genome coverage and improved protein family annotation in the COG database
National Institutes of Health · National Center for Biotechnology Information
Abstract
Microbial genome sequencing projects produce numerous sequences of deduced proteins, only a small fraction of which have been or will ever be studied experimentally. This leaves sequence analysis as the only feasible way to annotate these proteins and assign to them tentative functions. The Clusters of Orthologous Groups of proteins (COGs) database (http://www.ncbi.nlm.nih.gov/COG/), first created in 1997, has been a popular tool for functional annotation. Its success was largely based on (i) its reliance on complete microbial genomes, which allowed reliable assignment of orthologs and paralogs for most genes; (ii) orthology-based approach, which used the function(s) of the characterized member(s) of the…
Citation impact
- FWCI
- 33.86
- Percentile
- 100%
- References
- 77
Authors
4- MYMichael Y. GalperinCorresponding
National Institutes of Health, National Center for Biotechnology Information
- KSKira S. Makarova
National Institutes of Health, National Center for Biotechnology Information
- YIYuri I. Wolf
National Institutes of Health, National Center for Biotechnology Information
- EVEugene V. Koonin
National Institutes of Health, National Center for Biotechnology Information
Topics & keywords
- Cog
- Annotation
- Genome
- Biology
- Computational biology
- Function (biology)
- Genome project
- Genetics