Estimating Gene Gain and Loss Rates in the Presence of Error in Genome Assembly and Annotation Using CAFE 3
National Evolutionary Synthesis Center · Indiana University
Abstract
Current sequencing methods produce large amounts of data, but genome assemblies constructed from these data are often fragmented and incomplete. Incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. This means that methods attempting to estimate rates of gene duplication and loss often will be misled by such errors and that rates of gene family evolution will be consistently overestimated. Here, we present a method that takes these errors into account, allowing one to accurately infer rates of gene gain and loss among genomes even with low assembly and annotation quality. The method is implemented in the newest version of the software…
Citation impact
- FWCI
- 7.78
- Percentile
- 100%
- References
- 44
Authors
4Topics & keywords
- Annotation
- Genome
- Biology
- Gene Annotation
- Gene duplication
- Gene
- Computational biology
- Gene family