The conserved domain database in 2023

Wang, Jiyao; Chitsaz, Farideh; Derbyshire, Myra K.; Gonzales, Noreen R.; Gwadz, Marc; Lu, Shennan; Marchler, Gabriele H.; Song, James S.; Thanki, Narmada; Yamashita, Roxanne A.; Yang, Mingzhang; Zhang, Dachuan; Zheng, Chanjuan; Lanczycki, Christopher J.; Marchler‐Bauer, Aron

doi:10.1093/nar/gkac1096

articleNucleic Acids ResearchDec 8, 2022GOLD OA

The conserved domain database in 2023

JWJiyao Wang FCFarideh Chitsaz MKMyra K. Derbyshire NRNoreen R. Gonzales MGMarc Gwadz

National Institutes of Health · National Center for Biotechnology Information

PubMed

Indexed incrossrefdoajpubmed

Abstract

NLM's conserved domain database (CDD) is a collection of protein domain and protein family models constructed as multiple sequence alignments. Its main purpose is to provide annotation for protein and translated nucleotide sequences with the location of domain footprints and associated functional sites, and to define protein domain architecture as a basis for assigning gene product names and putative/predicted function. CDD has been available publicly for over 20 years and has grown substantially during that time. Maintaining an archive of pre-computed annotation continues to be a challenge and has slowed down the cadence of CDD releases. CDD curation staff builds hierarchical classifications of large protein…

Citation impact

1,145

total citations

FWCI: 88.79
Percentile: 100%
References: 15

Citations per year

Authors

15

Topics & keywords

Topics

Keywords

Biology
Conserved sequence
Domain (mathematical analysis)
Computational biology
Database
Genetics
Base sequence
DNA

No related works found for this paper.

Funding

NI
National Institutes of Health