COG database update: focus on microbial diversity, model organisms, and widespread pathogens
National Institutes of Health · National Center for Biotechnology Information
Abstract
The Clusters of Orthologous Genes (COG) database, also referred to as the Clusters of Orthologous Groups of proteins, was created in 1997 and went through several rounds of updates, most recently, in 2014. The current update, available at https://www.ncbi.nlm.nih.gov/research/COG, substantially expands the scope of the database to include complete genomes of 1187 bacteria and 122 archaea, typically, with a single genome per genus. In addition, the current version of the COGs includes the following new features: (i) the recently deprecated NCBI's gene index (gi) numbers for the encoded proteins are replaced with stable RefSeq or GenBank\ENA\DDBJ coding sequence (CDS) accession numbers; (ii) COG annotations are…
Citation impact
- FWCI
- 41.84
- Percentile
- 100%
- References
- 73
Authors
6- MYMichael Y. GalperinCorresponding
National Institutes of Health, National Center for Biotechnology Information
- YIYuri I. Wolf
National Institutes of Health, National Center for Biotechnology Information
- KSKira S. Makarova
National Institutes of Health, National Center for Biotechnology Information
- RVRoberto Vera Alvarez
National Institutes of Health, National Center for Biotechnology Information
- DLDavid Landsman
National Institutes of Health, National Center for Biotechnology Information
Topics & keywords
- RefSeq
- GenBank
- Biology
- Cog
- Sequence database
- Genome
- Archaea
- UniProt