VSEARCH: a versatile open source tool for metagenomics
Oslo University Hospital · University of Oslo · +7 more institutions
Abstract
VSEARCH is an open source and free of charge multithreaded 64-bit tool for processing and preparing metagenomics, genomics and population genomics nucleotide sequence data. It is designed as an alternative to the widely used USEARCH tool (Edgar, 2010) for which the source code is not publicly available, algorithm details are only rudimentarily described, and only a memory-confined 32-bit version is freely available for academic use.
When searching nucleotide sequences, VSEARCH uses a fast heuristic based on words shared by the query and target sequences in order to quickly identify similar sequences, a similar strategy is probably used in USEARCH. VSEARCH then performs optimal global sequence alignment of the query against potential target sequences, using full dynamic programming instead of the seed-and-extend heuristic used by USEARCH. Pairwise alignments are computed in parallel using vectorisation and multiple threads.
Citation impact
- FWCI
- 250.50
- Percentile
- 100%
- References
- 43
Authors
5- TRTorbjørn RognesCorresponding
Oslo University Hospital, University of Oslo
- TFTomáš Flouri
Karlsruhe Institute of Technology, Heidelberg Institute for Theoretical Studies
- BNBen Nichols
University of Glasgow
- CQChristopher Quince
University of Warwick, University of Glasgow
- FMFrédéric Mahé
Centre de Coopération Internationale en Recherche Agronomique pour le Développement, University of Kaiserslautern, Laboratoire des Symbioses Tropicales et Méditerranéennes
Topics & keywords
- Computer science
- Pairwise comparison
- Source code
- Metagenomics
- Shuffling
- Cluster analysis
- Memory footprint
- Data mining