articleBioinformaticsAug 12, 2010BRONZE OA

Search and clustering orders of magnitude faster than BLAST

Tiburon Associates (United States)

PubMed
Indexed incrossrefdoajpubmed

Abstract

MOTIVATION: Biological sequence data is accumulating rapidly, motivating the development of improved high-throughput methods for sequence classification. RESULTS: UBLAST and USEARCH are new algorithms enabling sensitive local and global search of large sequence databases at exceptionally high speeds. They are often orders of magnitude faster than BLAST in practical applications, though sensitivity to distant protein relationships is lower. UCLUST is a new clustering method that exploits USEARCH to assign sequences to clusters. UCLUST offers several advantages over the widely used program CD-HIT, including higher speed, lower memory use, improved sensitivity, clustering at lower identities and classification of…

Citation impact

21,648
total citations
FWCI
100.44
Percentile
100%
References
9
Citations per year

Authors

1

Topics & keywords

Keywords
  • Cluster analysis
  • Computer science
  • Sequence (biology)
  • Sensitivity (control systems)
  • Data mining
  • Exploit
  • Pattern recognition (psychology)
  • Machine learning
No related works found for this paper.