Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions
European Bioinformatics Institute · Wellcome Sanger Institute
Abstract
Detection of protein homology via sequence similarity has important applications in biology, from protein structure and function prediction to reconstruction of phylogenies. Although current methods for aligning protein sequences are powerful, challenges remain, including problems with homologous overextension of alignments and with regions under convergent evolution. Here, we test the ability of the profile hidden Markov model method HMMER3 to correctly assign homologous sequences to >13,000 manually curated families from the Pfam database. We identify problem families using protein regions that match two or more Pfam families not currently annotated as related in Pfam. We find that HMMER3 E-value estimates…
Citation impact
- FWCI
- 9.12
- Percentile
- 100%
- References
- 67
Authors
5- JMJaina Mistry
European Bioinformatics Institute, Wellcome Sanger Institute
- RFROBERT FINN
Wellcome Sanger Institute, European Bioinformatics Institute
- SRSean R. Eddy
Wellcome Sanger Institute, European Bioinformatics Institute
- ABAlex Bateman
European Bioinformatics Institute, Wellcome Sanger Institute
- MPMarco PuntaCorresponding
Wellcome Sanger Institute, European Bioinformatics Institute
Topics & keywords
- Biology
- Computational biology
- Homology (biology)
- Protein function prediction
- Protein family
- Sequence alignment
- Convergent evolution
- Protein sequencing
- Reduced inequalities