Phylogeny-Aware Gap Placement Prevents Errors in Sequence Alignment and Evolutionary Analysis
European Bioinformatics Institute
Abstract
Genetic sequence alignment is the basis of many evolutionary and comparative studies, and errors in alignments lead to errors in the interpretation of evolutionary information in genomes. Traditional multiple sequence alignment methods disregard the phylogenetic implications of gap patterns that they create and infer systematically biased alignments with excess deletions and substitutions, too few insertions, and implausible insertion-deletion-event histories. We present a method that prevents these systematic errors by recognizing insertions and deletions as distinct evolutionary events. We show theoretically and practically that this improves the quality of sequence alignments and downstream analyses over a…
Citation impact
- FWCI
- 17.69
- Percentile
- 100%
- References
- 19
Authors
2Topics & keywords
- Sequence (biology)
- Alignment-free sequence analysis
- Phylogenetic tree
- Multiple sequence alignment
- Sequence alignment
- Phylogenetics
- Biology
- Computational biology