articleGenome ResearchJan 5, 2011BRONZE OA

Adaptive seeds tame genomic sequence comparison

Max Planck Institute for Molecular Genetics · The University of Tokyo

PubMed
Indexed incrossrefpubmed

Abstract

The main way of analyzing biological sequences is by comparing and aligning them to each other. It remains difficult, however, to compare modern multi-billionbase DNA data sets. The difficulty is caused by the nonuniform (oligo)nucleotide composition of these sequences, rather than their size per se. To solve this problem, we modified the standard seed-and-extend approach (e.g., BLAST) to use adaptive seeds. Adaptive seeds are matches that are chosen based on their rareness, instead of using fixed-length matches. This method guarantees that the number of matches, and thus the running time, increases linearly, instead of quadratically, with sequence length. LAST, our open source implementation of adaptive…

Citation impact

1,440
total citations
FWCI
Percentile
References
34
Citations per year

Authors

5

Topics & keywords

Keywords
  • Biology
  • Sequence (biology)
  • Quadratic growth
  • Composition (language)
  • Computational biology
  • DNA sequencing
  • Algorithm
  • Genetics
No related works found for this paper.