The conserved domain database in 2023
National Institutes of Health · National Center for Biotechnology Information
Abstract
NLM's conserved domain database (CDD) is a collection of protein domain and protein family models constructed as multiple sequence alignments. Its main purpose is to provide annotation for protein and translated nucleotide sequences with the location of domain footprints and associated functional sites, and to define protein domain architecture as a basis for assigning gene product names and putative/predicted function. CDD has been available publicly for over 20 years and has grown substantially during that time. Maintaining an archive of pre-computed annotation continues to be a challenge and has slowed down the cadence of CDD releases. CDD curation staff builds hierarchical classifications of large protein…
Citation impact
- FWCI
- 88.79
- Percentile
- 100%
- References
- 15
Authors
15- JWJiyao Wang
National Institutes of Health, National Center for Biotechnology Information
- FCFarideh Chitsaz
National Institutes of Health, National Center for Biotechnology Information
- MKMyra K. Derbyshire
National Institutes of Health, National Center for Biotechnology Information
- NRNoreen R. Gonzales
National Institutes of Health, National Center for Biotechnology Information
- MGMarc Gwadz
National Institutes of Health, National Center for Biotechnology Information
Topics & keywords
- Biology
- Conserved sequence
- Domain (mathematical analysis)
- Computational biology
- Database
- Genetics
- Base sequence
- DNA