articleBMC BioinformaticsAug 18, 2010GOLD OA

Hidden Markov model speed heuristic and iterative HMM search procedure

Washington University in St. Louis · Janelia Research Campus · +2 more institutions

PubMed
Indexed incrossrefdoajpubmed

Abstract

Background

Profile hidden Markov models (profile-HMMs) are sensitive tools for remote protein homology detection, but the main scoring algorithms, Viterbi or Forward, require considerable time to search large sequence databases.

Results

We have designed a series of database filtering steps, HMMERHEAD, that are applied prior to the scoring algorithms, as implemented in the HMMER package, in an effort to reduce search time. Using this heuristic, we obtain a 20-fold decrease in Forward and a 6-fold decrease in Viterbi search time with a minimal loss in sensitivity relative to the unfiltered approaches. We then implemented an iterative profile-HMM search method, JackHMMER, which employs the HMMERHEAD heuristic. Due to our search heuristic, we eliminated the subdatabase creation that is common in current iterative profile-HMM approaches. On our benchmark, JackHMMER detects 14% more remote protein homologs than SAM's iterative method T2K.

Citation impact

1,501
total citations
FWCI
10.07
Percentile
100%
References
17
Citations per year

Authors

3

Topics & keywords

Keywords
  • Hidden Markov model
  • Viterbi algorithm
  • Computer science
  • Beam search
  • Heuristic
  • Benchmark (surveying)
  • Search algorithm
  • Pattern recognition (psychology)
No related works found for this paper.

Funding