STAR: ultrafast universal RNA-seq aligner
Cold Spring Harbor Laboratory · Pacific Biosciences (United States)
Abstract
MOTIVATION: Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. RESULTS: To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed…
Citation impact
- FWCI
- 89.47
- Percentile
- 100%
- References
- 25
Authors
9- ADAlexander DobinCorresponding
Cold Spring Harbor Laboratory, Pacific Biosciences (United States)
- CDCarrie Davis
Cold Spring Harbor Laboratory, Pacific Biosciences (United States)
- FSFelix Schlesinger
Cold Spring Harbor Laboratory, Pacific Biosciences (United States)
- JDJörg Drenkow
Cold Spring Harbor Laboratory, Pacific Biosciences (United States)
- CZChris Zaleski
Cold Spring Harbor Laboratory, Pacific Biosciences (United States)
Topics & keywords
- Computer science
- JSON
- Computational biology
- Software
- RNA-Seq
- Adapter (computing)
- Algorithm
- Biology