MACSE: Multiple Alignment of Coding SEquences Accounting for Frameshifts and Stop Codons
Institut des Sciences de l'Evolution de Montpellier · Centre National de la Recherche Scientifique · +2 more institutions
Abstract
Until now the most efficient solution to align nucleotide sequences containing open reading frames was to use indirect procedures that align amino acid translation before reporting the inferred gap positions at the codon level. There are two important pitfalls with this approach. Firstly, any premature stop codon impedes using such a strategy. Secondly, each sequence is translated with the same reading frame from beginning to end, so that the presence of a single additional nucleotide leads to both aberrant translation and alignment.We present an algorithm that has the same space and time complexity as the classical Needleman-Wunsch algorithm while accommodating sequencing errors and other biological…
Citation impact
- FWCI
- 7.18
- Percentile
- 100%
- References
- 57
Authors
4- VRVincent RanwezCorresponding
Institut des Sciences de l'Evolution de Montpellier, Centre National de la Recherche Scientifique, Université de Montpellier
- SHSébastien Harispe
IMT Mines Alès, Institut des Sciences de l'Evolution de Montpellier, Centre National de la Recherche Scientifique, Université de Montpellier
- FDFrédéric Delsuc
Institut des Sciences de l'Evolution de Montpellier, Université de Montpellier, Centre National de la Recherche Scientifique
- EDEmmanuel Douzery
Université de Montpellier, Institut des Sciences de l'Evolution de Montpellier, Centre National de la Recherche Scientifique
Topics & keywords
- Reading frame
- Stop codon
- Executable
- Genetics
- Coding region
- Open reading frame
- Computer science
- Computational biology
- Quality Education