articleNucleic Acids ResearchDec 17, 2004HYBRID OA

NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins

National Institutes of Health · National Center for Biotechnology Information

PubMed
Indexed incrossrefdoajpubmed

Abstract

The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) provides a non-redundant collection of sequences representing genomic data, transcripts and proteins. Although the goal is to provide a comprehensive dataset representing the complete sequence information for any given species, the database pragmatically includes sequence data that are currently publicly available in the archival databases. The database incorporates data from over 2400 organisms and includes over one million proteins representing significant taxonomic diversity spanning prokaryotes, eukaryotes and viruses. Nucleotide and protein sequences are explicitly linked,…

Citation impact

1,663
total citations
FWCI
38.45
Percentile
100%
References
19
Citations per year

Authors

1

Topics & keywords

Keywords
  • RefSeq
  • GenBank
  • Biology
  • Sequence database
  • Ensembl
  • Annotation
  • Genome
  • dbSNP
No related works found for this paper.