Gene prediction with a hidden Markov model and a new intron submodel
Universitätsmedizin Göttingen · University of Göttingen
Abstract
MOTIVATION: The problem of finding the genes in eukaryotic DNA sequences by computational methods is still not satisfactorily solved. Gene finding programs have achieved relatively high accuracy on short genomic sequences but do not perform well on longer sequences with an unknown number of genes in them. Here existing programs tend to predict many false exons. RESULTS: We have developed a new program, AUGUSTUS, for the ab initio prediction of protein coding genes in eukaryotic genomes. The program is based on a Hidden Markov Model and integrates a number of known methods and submodels. It employs a new way of modeling intron lengths. We use a new donor splice site model, a new model for a short region…
Citation impact
- FWCI
- 4.58
- Percentile
- 100%
- References
- 24
Authors
2Topics & keywords
- Executable
- Computer science
- Gene prediction
- Hidden Markov model
- splice
- RNA splicing
- Markov chain
- Computational biology
- Quality Education