MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph
National Institute of Informatics · University of Hong Kong
Abstract
Abstract Summary: MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner. It finished assembling a soil metagenomics dataset with 252 Gbps in 44.1 and 99.6 h on a single computing node with and without a graphics processing unit, respectively. MEGAHIT assembles the data as a whole, i.e. no pre-processing like partitioning and normalization was needed. When compared with previous methods on assembling the soil data, MEGAHIT generated a three-time larger assembly, with longer contig N50 and average contig length; furthermore, 55.8% of the reads were aligned to the assembly, giving a fourfold improvement. Availability and implementation: The…
Citation impact
- FWCI
- 57.37
- Percentile
- 100%
- References
- 13
Authors
5- DLDinghua LiCorresponding
National Institute of Informatics, University of Hong Kong
- CLChi-Man Liu
National Institute of Informatics, University of Hong Kong
- RLRuibang Luo
National Institute of Informatics, University of Hong Kong
- KSKunihiko Sadakane
National Institute of Informatics, University of Hong Kong
- TLTak‐Wah Lam
National Institute of Informatics, University of Hong Kong
Topics & keywords
- Contig
- Metagenomics
- De Bruijn graph
- Sequence assembly
- Computer science
- De Bruijn sequence
- Software
- k-mer
- Life in Land