Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
European Bioinformatics Institute · Wellcome Trust
Abstract
We have developed a new set of algorithms, collectively called "Velvet," to manipulate de Bruijn graphs for genomic sequence assembly. A de Bruijn graph is a compact representation based on short words (k-mers) that is ideal for high coverage, very short read (25-50 bp) data sets. Applying Velvet to very short reads and paired-ends information only, one can produce contigs of significant length, up to 50-kb N50 length in simulations of prokaryotic data and 3-kb N50 on simulated mammalian BACs. When applied to real Solexa data sets without read pairs, Velvet generated contigs of approximately 8 kb in a prokaryote and 2 kb in a mammalian BAC, in close agreement with our simulated results without read-pair…
Citation impact
- FWCI
- —
- Percentile
- —
- References
- 39
Authors
2Topics & keywords
- Contig
- De Bruijn sequence
- Velvet
- De Bruijn graph
- Biology
- k-mer
- Sequence assembly
- Algorithm