Selecting optimal partitioning schemes for phylogenomic datasets
Australian National University · National Evolutionary Synthesis Center · +3 more institutions
Abstract
Partitioning involves estimating independent models of molecular evolution for different subsets of sites in a sequence alignment, and has been shown to improve phylogenetic inference. Current methods for estimating best-fit partitioning schemes, however, are only computationally feasible with datasets of fewer than 100 loci. This is a problem because datasets with thousands of loci are increasingly common in phylogenetics.
We develop two novel methods for estimating best-fit partitioning schemes on large phylogenomic datasets: strict and relaxed hierarchical clustering. These methods use information from the underlying data to cluster together similar subsets of sites in an alignment, and build on clustering approaches that have been proposed elsewhere.
Citation impact
- FWCI
- 26.46
- Percentile
- 100%
- References
- 49
Authors
5- RLRobert LanfearCorresponding
Australian National University, National Evolutionary Synthesis Center
- BCBrett Calcott
Australian National University
- DKDavid Kainer
Australian National University
- CMChristoph Mayer
Zoological Research Museum Alexander Koenig
- ASAlexandros Stamatakis
Heidelberg Institute for Theoretical Studies, Karlsruhe Institute of Technology
Topics & keywords
- Cluster analysis
- Computer science
- Scalability
- Inference
- Hierarchical clustering
- Data mining
- Machine learning
- Artificial intelligence