articleNature CommunicationsNov 15, 2022GOLD OA

Muscle5: High-accuracy alignment ensembles enable unbiased assessments of sequence homology and phylogeny

Oldham Council

PubMed
Indexed incrossrefdoajpubmed

Abstract

Multiple sequence alignments are widely used to infer evolutionary relationships, enabling inferences of structure, function, and phylogeny. Standard practice is to construct one alignment by some preferred method and use it in further analysis; however, undetected alignment bias can be problematic. I describe Muscle5, a novel algorithm which constructs an ensemble of high-accuracy alignment with diverse biases by perturbing a hidden Markov model and permuting its guide tree. Confidence in an inference is assessed as the fraction of the ensemble which supports it. Applied to phylogenetic tree estimation, I show that ensembles can confidently resolve topologies with low bootstrap according to standard methods,…

Citation impact

1,028
total citations
FWCI
75.62
Percentile
100%
References
34
Citations per year

Authors

1

Topics & keywords

Keywords
  • Phylogenetic tree
  • Phylogenetics
  • Multiple sequence alignment
  • Inference
  • Computer science
  • Computational biology
  • Sequence alignment
  • Tree (set theory)
No related works found for this paper.