articleNature BiotechnologySep 21, 2023HYBRID OA

Identification of mobile genetic elements with geNomad

Lawrence Berkeley National Laboratory · Joint Genome Institute · +1 more institution

PubMed
Indexed incrossrefpubmed

Abstract

Identifying and characterizing mobile genetic elements in sequencing data is essential for understanding their diversity, ecology, biotechnological applications and impact on public health. Here we introduce geNomad, a classification and annotation framework that combines information from gene content and a deep neural network to identify sequences of plasmids and viruses. geNomad uses a dataset of more than 200,000 marker protein profiles to provide functional gene annotation and taxonomic assignment of viral genomes. Using a conditional random field model, geNomad also detects proviruses integrated into host genomes with high precision. In benchmarks, geNomad achieved high classification performance for…

No related works found for this paper.

Funding