ContEst16S: an algorithm that identifies contaminated prokaryotic genomes using 16S RNA gene sequences

Seoul National University

PubMed
Indexed incrossrefpubmed

Abstract

Thanks to the recent advancement of DNA sequencing technology, the cost and time of prokaryotic genome sequencing have been dramatically decreased. It has repeatedly been reported that genome sequencing using high-throughput next-generation sequencing is prone to contaminations due to its high depth of sequencing coverage. Although a few bioinformatics tools are available to detect potential contaminations, these have inherited limitations as they only use protein-coding genes. Here we introduce a new algorithm, called ContEst16S, to detect potential contaminations using 16S rRNA genes from genome assemblies. We screened 69 745 prokaryotic genomes from the NCBI Assembly Database using ContEst16S and found that…

Citation impact

558
total citations
FWCI
14.13
Percentile
100%
References
21
Citations per year

Authors

6

Topics & keywords

Keywords
  • Biology
  • Genome
  • DNA sequencing
  • Gene
  • Computational biology
  • Sequence assembly
  • Bacterial genome size
  • Genetics
No related works found for this paper.