RefSeq: an update on prokaryotic genome annotation and curation
National Institutes of Health · National Center for Biotechnology Information
Abstract
The Reference Sequence (RefSeq) project at the National Center for Biotechnology Information (NCBI) provides annotation for over 95 000 prokaryotic genomes that meet standards for sequence quality, completeness, and freedom from contamination. Genomes are annotated by a single Prokaryotic Genome Annotation Pipeline (PGAP) to provide users with a resource that is as consistent and accurate as possible. Notable recent changes include the development of a hierarchical evidence scheme, a new focus on curating annotation evidence sources, the addition and curation of protein profile hidden Markov models (HMMs), release of an updated pipeline (PGAP-4), and comprehensive re-annotation of RefSeq prokaryotic genomes.…
Citation impact
- FWCI
- 35.54
- Percentile
- 100%
- References
- 30
Authors
21- DHDaniel H. HaftCorresponding
National Institutes of Health, National Center for Biotechnology Information
- MDMichael DiCuccio
National Institutes of Health, National Center for Biotechnology Information
- ABAzat Badretdin
National Institutes of Health, National Center for Biotechnology Information
- VBVyacheslav Brover
National Institutes of Health, National Center for Biotechnology Information
- VCVyacheslav Chetvernin
National Institutes of Health, National Center for Biotechnology Information
Topics & keywords
- RefSeq
- Annotation
- Ensembl
- Genome project
- Genome
- Biology
- Data curation
- Computational biology