Software for Computing and Annotating Genomic Ranges
Gene Therapy Laboratory · European Molecular Biology Laboratory · +4 more institutions
Abstract
We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those…
Citation impact
- FWCI
- 44.27
- Percentile
- 100%
- References
- 11
Authors
8Topics & keywords
- Bioconductor
- Computer science
- Scalability
- Software
- Visualization
- Computational statistics
- Sequence alignment
- Data mining
- Industry, innovation and infrastructure