Fast and SNP-tolerant detection of complex variants and splicing in short reads
Indexed incrossrefdoajpubmed
Abstract
MOTIVATION: Next-generation sequencing captures sequence differences in reads relative to a reference genome or transcriptome, including splicing events and complex variants involving multiple mismatches and long indels. We present computational methods for fast detection of complex variants and splicing in short reads, based on a successively constrained search process of merging and filtering position lists from a genomic index. Our methods are implemented in GSNAP (Genomic Short-read Nucleotide Alignment Program), which can align both single- and paired-end reads as short as 14 nt and of arbitrarily long length. It can detect short- and long-distance splicing, including interchromosomal splicing, in…
Citation impact
1,995
total citations
- FWCI
- 34.71
- Percentile
- 100%
- References
- 39
Citations per year
Authors
2Topics & keywords
Keywords
- Indel
- Genetics
- Biology
- Computational biology
- Perl
- Reference genome
- Genome
- RNA splicing
No related works found for this paper.