Quality assessment of gene repertoire annotations with OMArk
SIB Swiss Institute of Bioinformatics · University of Lausanne · +1 more institution
Abstract
In the era of biodiversity genomics, it is crucial to ensure that annotations of protein-coding gene repertoires are accurate. State-of-the-art tools to assess genome annotations measure the completeness of a gene repertoire but are blind to other errors, such as gene overprediction or contamination. We introduce OMArk, a software package that relies on fast, alignment-free sequence comparisons between a query proteome and precomputed gene families across the tree of life. OMArk assesses not only the completeness but also the consistency of the gene repertoire as a whole relative to closely related species and reports likely contamination events. Analysis of 1,805 UniProt Eukaryotic Reference Proteomes with…
Citation impact
- FWCI
- 34.40
- Percentile
- 100%
- References
- 30
Authors
7- YNYannis NeversCorresponding
SIB Swiss Institute of Bioinformatics, University of Lausanne
- AWAlex Warwick Vesztrocy
SIB Swiss Institute of Bioinformatics, University of Lausanne
- VRVictor Rossier
SIB Swiss Institute of Bioinformatics, University of Lausanne
- CTClément-Marie Train
University of Lausanne
- AAAdrian Altenhoff
SIB Swiss Institute of Bioinformatics, ETH Zurich
Topics & keywords
- UniProt
- Proteome
- Gene Annotation
- Repertoire
- Computational biology
- Genome
- Gene
- Biology
- Life in Land