Selecting optimal partitioning schemes for phylogenomic datasets

Lanfear, Robert; Calcott, Brett; Kainer, David; Mayer, Christoph; Stamatakis, Alexandros

doi:10.1186/1471-2148-14-82

articleBMC Evolutionary BiologyJan 1, 2014GOLD OA

Selecting optimal partitioning schemes for phylogenomic datasets

RLRobert Lanfear BCBrett Calcott DKDavid Kainer CMChristoph Mayer ASAlexandros Stamatakis

Australian National University · National Evolutionary Synthesis Center · +3 more institutions

PubMed

Indexed incrossrefdoajpubmed

Abstract

Background

Partitioning involves estimating independent models of molecular evolution for different subsets of sites in a sequence alignment, and has been shown to improve phylogenetic inference. Current methods for estimating best-fit partitioning schemes, however, are only computationally feasible with datasets of fewer than 100 loci. This is a problem because datasets with thousands of loci are increasingly common in phylogenetics.

Methods

We develop two novel methods for estimating best-fit partitioning schemes on large phylogenomic datasets: strict and relaxed hierarchical clustering. These methods use information from the underlying data to cluster together similar subsets of sites in an alignment, and build on clustering approaches that have been proposed elsewhere.

Citation impact

785

total citations

FWCI: 26.46
Percentile: 100%
References: 49

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Cluster analysis
Computer science
Scalability
Inference
Hierarchical clustering
Data mining
Machine learning
Artificial intelligence

No related works found for this paper.

Funding

NE
National Evolutionary Synthesis Center