articleNucleic Acids ResearchNov 26, 2014GOLD OA

Expanded microbial genome coverage and improved protein family annotation in the COG database

National Institutes of Health · National Center for Biotechnology Information

PubMed
Indexed incrossrefdoajpubmed

Abstract

Microbial genome sequencing projects produce numerous sequences of deduced proteins, only a small fraction of which have been or will ever be studied experimentally. This leaves sequence analysis as the only feasible way to annotate these proteins and assign to them tentative functions. The Clusters of Orthologous Groups of proteins (COGs) database (http://www.ncbi.nlm.nih.gov/COG/), first created in 1997, has been a popular tool for functional annotation. Its success was largely based on (i) its reliance on complete microbial genomes, which allowed reliable assignment of orthologs and paralogs for most genes; (ii) orthology-based approach, which used the function(s) of the characterized member(s) of the…

No related works found for this paper.