articleBMC GenomicsJan 1, 2012GOLD OA

Optimizing illumina next-generation sequencing library preparation for extremely at-biased genomes

Wellcome Sanger Institute · Oxford Nanopore Technologies (United Kingdom)

PubMed
Indexed incrossrefdoajpubmed

Abstract

Background

Massively parallel sequencing technology is revolutionizing approaches to genomic and genetic research. Since its advent, the scale and efficiency of Next-Generation Sequencing (NGS) has rapidly improved. In spite of this success, sequencing genomes or genomic regions with extremely biased base composition is still a great challenge to the currently available NGS platforms. The genomes of some important pathogenic organisms like Plasmodium falciparum (high AT content) and Mycobacterium tuberculosis (high GC content) display extremes of base composition. The standard library preparation procedures that employ PCR amplification have been shown to cause uneven read coverage particularly across AT and GC rich regions, leading to problems in genome assembly and variation analyses. Alternative library-preparation approaches that omit PCR amplification require large quantities of starting material and hence are not suitable for small amounts of DNA/RNA such as those from clinical isolates. We have developed and optimized library-preparation procedures suitable for low quantity starting material and tolerant to extremely high AT content sequences.

Results

We have used our optimized conditions in parallel with standard methods to prepare Illumina sequencing libraries from a non-clinical and a clinical isolate (containing ~53% host contamination). By analyzing and comparing the quality of sequence data generated, we show that our optimized conditions that involve a PCR additive (TMAC), produces amplified libraries with improved coverage of extremely AT-rich regions and reduced bias toward GC neutral templates.

Citation impact

667
total citations
FWCI
17.26
Percentile
100%
References
15
Citations per year

Authors

11

Topics & keywords

Keywords
  • Biology
  • Computational biology
  • Genome
  • Massive parallel sequencing
  • DNA sequencing
  • GC-content
  • Illumina dye sequencing
  • DNA microarray
No related works found for this paper.

Funding