articleBioinformaticsJul 2, 2015BRONZE OA

Error filtering, pair assembly and error correction for next-generation sequencing reads

Tiburon Associates (United States) · Technical University of Denmark

PubMed
Indexed incrossrefdoajpubmed

Abstract

MOTIVATION: Next-generation sequencing produces vast amounts of data with errors that are difficult to distinguish from true biological variation when coverage is low. RESULTS: We demonstrate large reductions in error frequencies, especially for high-error-rate reads, by three independent means: (i) filtering reads according to their expected number of errors, (ii) assembling overlapping read pairs and (iii) for amplicon reads, by exploiting unique sequence abundances to perform error correction. We also show that most published paired read assemblers calculate incorrect posterior quality scores. AVAILABILITY AND IMPLEMENTATION: These methods are implemented in the USEARCH package. Binaries are freely…

Citation impact

1,337
total citations
FWCI
34.63
Percentile
100%
References
22
Citations per year

Authors

2

Topics & keywords

Keywords
  • Computer science
  • Word error rate
  • Error detection and correction
  • Sequence (biology)
  • Amplicon sequencing
  • Amplicon
  • Software
  • Algorithm
No related works found for this paper.