articleNucleic Acids ResearchOct 6, 2020GOLD OA

Pfam: The protein families database in 2021

European Bioinformatics Institute · Stockholm University · +2 more institutions

PubMed
Indexed incrossrefdoajpubmed

Abstract

The Pfam database is a widely used resource for classifying protein sequences into families and domains. Since Pfam was last described in this journal, over 350 new families have been added in Pfam 33.1 and numerous improvements have been made to existing entries. To facilitate research on COVID-19, we have revised the Pfam entries that cover the SARS-CoV-2 proteome, and built new entries for regions that were not covered by Pfam. We have reintroduced Pfam-B which provides an automatically generated supplement to Pfam and contains 136 730 novel clusters of sequences that are not yet matched by a Pfam family. The new Pfam-B is based on a clustering by the MMseqs2 software. We have compared all of the regions in…

Citation impact

7,711
total citations
FWCI
313.15
Percentile
100%
References
29
Citations per year

Authors

12

Topics & keywords

Keywords
  • Biology
  • UniProt
  • Computational biology
  • Proteome
  • Database
  • Bioinformatics
  • Genetics
  • Computer science
UN Sustainable Development Goals
  • Partnerships for the goals
No related works found for this paper.

Funding