Modeling Site Heterogeneity with Posterior Mean Site Frequency Profiles Accelerates Accurate Phylogenomic Estimation
Dalhousie University · Max Perutz Labs · +1 more institution
Abstract
Proteins have distinct structural and functional constraints at different sites that lead to site-specific preferences for particular amino acid residues as the sequences evolve. Heterogeneity in the amino acid substitution process between sites is not modeled by commonly used empirical amino acid exchange matrices. Such model misspecification can lead to artefacts in phylogenetic estimation such as long-branch attraction. Although sophisticated site-heterogeneous mixture models have been developed to address this problem in both Bayesian and maximum likelihood (ML) frameworks, their formidable computational time and memory usage severely limits their use in large phylogenomic analyses. Here we propose a…
Citation impact
- FWCI
- 15.47
- Percentile
- 100%
- References
- 58
Authors
4Topics & keywords
- PMSF
- Tree (set theory)
- Mixture model
- Speedup
- Biological system
- Computer science
- Mathematics
- Biology