UniProt: the universal protein knowledgebase
?
Indexed incrossrefdoajpubmed
Abstract
The UniProt knowledgebase is a large resource of protein sequences and associated detailed annotation. The database contains over 60 million sequences, of which over half a million sequences have been curated by experts who critically review experimental and predicted data for each protein. The remainder are automatically annotated based on rule systems that rely on the expert curated knowledge. Since our last update in 2014, we have more than doubled the number of reference proteomes to 5631, giving a greater coverage of taxonomic diversity. We implemented a pipeline to remove redundant highly similar proteomes that were causing excessive redundancy in UniProt. The initial run of this pipeline reduced the…
Citation impact
4,663
total citations
- FWCI
- 398.82
- Percentile
- 100%
- References
- 30
Citations per year
Authors
1- ?Corresponding
Topics & keywords
Topics
Keywords
- UniProt
- Proteome
- Biology
- SPARQL
- Annotation
- Pipeline (software)
- Human proteome project
- Computational biology
No related works found for this paper.