HybPiper: Extracting coding sequence and introns for phylogenetics from high‐throughput sequencing reads using target enrichment
Brooklyn Botanic Garden · Chicago Botanic Garden · +3 more institutions
Abstract
PREMISE OF THE STUDY: Using sequence data generated via target enrichment for phylogenetics requires reassembly of high-throughput sequence reads into loci, presenting a number of bioinformatics challenges. We developed HybPiper as a user-friendly platform for assembly of gene regions, extraction of exon and intron sequences, and identification of paralogous gene copies. We test HybPiper using baits designed to target 333 phylogenetic markers and 125 genes of functional significance in Artocarpus (Moraceae). METHODS AND RESULTS: HybPiper implements parallel execution of sequence assembly in three phases: read mapping, contig assembly, and target sequence extraction. The pipeline was able to recover nearly…
Citation impact
- FWCI
- 12.71
- Percentile
- 100%
- References
- 38
Authors
8Topics & keywords
- Biology
- DNA sequencing
- Intron
- Sequence (biology)
- Computational biology
- Phylogenetics
- Evolutionary biology
- Coding region