NCBI Reference Sequences: current status, policy and new initiatives
National Institutes of Health · National Center for Biotechnology Information
Abstract
NCBI's Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. RefSeq records integrate information from multiple sources and represent a current description of the sequence, the gene and sequence features. The database includes over 5300 organisms spanning prokaryotes, eukaryotes and viruses, with records for more than 5.5 x 10(6) proteins (RefSeq release 30). Feature annotation is applied by a combination of curation, collaboration, propagation from other sources and computation. We report here on the recent growth of the database, recent changes to feature annotations and record types for…
Citation impact
- FWCI
- 26.53
- Percentile
- 100%
- References
- 15
Authors
4- KDKim D. PruittCorresponding
National Institutes of Health, National Center for Biotechnology Information
- TTTatiana Tatusova
National Institutes of Health, National Center for Biotechnology Information
- WKWilliam Klimke
National Institutes of Health, National Center for Biotechnology Information
- DMDonna Maglott
National Institutes of Health, National Center for Biotechnology Information
Topics & keywords
- RefSeq
- Annotation
- Biology
- Genome
- Ensembl
- Sequence (biology)
- Gene Annotation
- Computational biology
- Life in Land