A Bayesian Mixture Model for Across-Site Heterogeneities in the Amino-Acid Replacement Process
Canadian Institute for Advanced Research · Université de Montréal
Abstract
Most current models of sequence evolution assume that all sites of a protein evolve under the same substitution process, characterized by a 20 x 20 substitution matrix. Here, we propose to relax this assumption by developing a Bayesian mixture model that allows the amino-acid replacement pattern at different sites of a protein alignment to be described by distinct substitution processes. Our model, named CAT, assumes the existence of distinct processes (or classes) differing by their equilibrium frequencies over the 20 residues. Through the use of a Dirichlet process prior, the total number of classes and their respective amino-acid profiles, as well as the affiliations of each site to a given class, are all…
Citation impact
- FWCI
- 25.90
- Percentile
- 100%
- References
- 60
Authors
2Topics & keywords
- Biology
- Process (computing)
- Bayesian probability
- Computational biology
- Evolutionary biology
- Artificial intelligence
- Computer science