articlePLoS ONEMar 9, 2011GOLD OA

Fast Identification and Removal of Sequence Contamination from Genomic and Metagenomic Datasets

San Diego State University · Southern States University · +1 more institution

PubMed
Indexed incrossrefdoajpubmed

Abstract

High-throughput sequencing technologies have strongly impacted microbiology, providing a rapid and cost-effective way of generating draft genomes and exploring microbial diversity. However, sequences obtained from impure nucleic acid preparations may contain DNA from sources other than the sample. Those sequence contaminations are a serious concern to the quality of the data used for downstream analysis, causing misassembly of sequence contigs and erroneous conclusions. Therefore, the removal of sequence contaminants is a necessary and required step for all sequencing projects. We developed DeconSeq, a robust framework for the rapid, automated identification and removal of sequence contamination in longer-read…

Citation impact

781
total citations
FWCI
11.04
Percentile
100%
References
57
Citations per year

Authors

2

Topics & keywords

Keywords
  • Metagenomics
  • Contig
  • Identification (biology)
  • Computer science
  • Contamination
  • DNA sequencing
  • Computational biology
  • Data mining
No related works found for this paper.

Funding