RefSeq: an update on mammalian reference sequences
National Institutes of Health · National Center for Biotechnology Information
Abstract
The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of annotated genomic, transcript and protein sequence records derived from data in public sequence archives and from computation, curation and collaboration (http://www.ncbi.nlm.nih.gov/refseq/). We report here on growth of the mammalian and human subsets, changes to NCBI's eukaryotic annotation pipeline and modifications affecting transcript and protein records. Recent changes to NCBI's eukaryotic genome annotation pipeline provide higher throughput, and the addition of RNAseq data to the pipeline results in a significant expansion of the number of transcripts and novel exons annotated on mammalian…
Citation impact
- FWCI
- 66.71
- Percentile
- 100%
- References
- 28
Authors
29- KDKim D. PruittCorresponding
National Institutes of Health, National Center for Biotechnology Information
- GBGarth Brown
National Center for Biotechnology Information, National Institutes of Health
- SMSusan M. Hiatt
National Center for Biotechnology Information, National Institutes of Health
- FTFrançoise Thibaud‐Nissen
National Center for Biotechnology Information, National Institutes of Health
- AAAlexander Astashyn
National Institutes of Health, National Center for Biotechnology Information
Topics & keywords
- RefSeq
- Annotation
- Biology
- Ensembl
- Pipeline (software)
- Computational biology
- Genome project
- Genome
- Partnerships for the goals