articlePubMed CentralJan 1, 2002GREEN OA

The Pfam Protein Families Database

BABateman, AlexBEBirney, EwanCLCerruti, LorenzoDRDurbin, RichardELEtwiller, Laurence

Wellcome Sanger Institute

Abstract

Pfam is a large collection of protein multiple sequence alignments and profile hidden Markov models. Pfam is available on the World Wide Web in the UK at http://www.sanger.ac.uk/Software/Pfam/, in Sweden at http://www.cgb.ki.se/Pfam/, in France at http://pfam.jouy.inra.fr/ and in the US at http://pfam.wustl.edu/. The latest version (6.6) of Pfam contains 3071 families, which match 69% of proteins in SWISS-PROT 39 and TrEMBL 14. Structural data, where available, have been utilised to ensure that Pfam families correspond with structural domains, and to improve domain-based annotation. Predictions of non-domain regions are now also included. In addition to secondary structure, Pfam multiple sequence alignments…

Citation impact

770
total citations
FWCI
77.90
Percentile
100%
References
14
Citations per year

Authors

10
  • BA
    Bateman, AlexCorresponding

    Wellcome Sanger Institute

  • BE
    Birney, Ewan
  • CL
    Cerruti, Lorenzo
  • DR
    Durbin, Richard
  • EL
    Etwiller, Laurence

Topics & keywords

Keywords
  • UniProt
  • Biology
  • Computational biology
  • Annotation
  • Protein domain
  • Sequence alignment
  • Protein family
  • Sequence database
UN Sustainable Development Goals
  • Partnerships for the goals
No related works found for this paper.