EnsemblCompara GeneTrees: Complete, duplication-aware phylogenetic trees in vertebrates
European Bioinformatics Institute · Wellcome Trust · +1 more institution
Abstract
We have developed a comprehensive gene orientated phylogenetic resource, EnsemblCompara GeneTrees, based on a computational pipeline to handle clustering, multiple alignment, and tree generation, including the handling of large gene families. We developed two novel non-sequence-based metrics of gene tree correctness and benchmarked a number of tree methods. The TreeBeST method from TreeFam shows the best performance in our hands. We also compared this phylogenetic approach to clustering approaches for ortholog prediction, showing a large increase in coverage using the phylogenetic approach. All data are made available in a number of formats and will be kept up to date with the Ensembl project.
Citation impact
- FWCI
- —
- Percentile
- —
- References
- 28
Authors
6Topics & keywords
- Phylogenetic tree
- Biology
- Phylogenetic network
- Computational phylogenetics
- Tree (set theory)
- Cluster analysis
- Pipeline (software)
- Computational biology