articleNature BiotechnologyFeb 21, 2024HYBRID OA

Quality assessment of gene repertoire annotations with OMArk

SIB Swiss Institute of Bioinformatics · University of Lausanne · +1 more institution

PubMed
Indexed incrossrefdatacitepubmed

Abstract

In the era of biodiversity genomics, it is crucial to ensure that annotations of protein-coding gene repertoires are accurate. State-of-the-art tools to assess genome annotations measure the completeness of a gene repertoire but are blind to other errors, such as gene overprediction or contamination. We introduce OMArk, a software package that relies on fast, alignment-free sequence comparisons between a query proteome and precomputed gene families across the tree of life. OMArk assesses not only the completeness but also the consistency of the gene repertoire as a whole relative to closely related species and reports likely contamination events. Analysis of 1,805 UniProt Eukaryotic Reference Proteomes with…

No related works found for this paper.

Funding