Swarm v2: highly-scalable and high-resolution amplicon clustering
University of Kaiserslautern · Oslo University Hospital · +6 more institutions
Abstract
Previously we presented Swarm v1, a novel and open source amplicon clustering program that produced fine-scale molecular operational taxonomic units (OTUs), free of arbitrary global clustering thresholds and input-order dependency. Swarm v1 worked with an initial phase that used iterative single-linkage with a local clustering threshold (d), followed by a phase that used the internal abundance structures of clusters to break chained OTUs. Here we present Swarm v2, which has two important novel features: (1) a new algorithm for d = 1 that allows the computation time of the program to scale linearly with increasing amounts of data; and (2) the new fastidious option that reduces under-grouping by grafting low…
Citation impact
- FWCI
- 15.51
- Percentile
- 100%
- References
- 24
Authors
5- FMFrédéric MahéCorresponding
University of Kaiserslautern
- TRTorbjørn Rognes
Oslo University Hospital, University of Oslo
- CQChristopher Quince
University of Warwick
- CDColomban de Vargas
Centre National de la Recherche Scientifique, Station Biologique de Roscoff, Sorbonne Université, Adaptation et Diversité en Milieu Marin
- MDMicah Dunthorn
University of Kaiserslautern
Topics & keywords
- Swarm behaviour
- Cluster analysis
- Computer science
- Scalability
- Amplicon
- Computation
- Data mining
- Biology