A faster circular binary segmentation algorithm for the analysis of array CGH data
Memorial Sloan Kettering Cancer Center
Abstract
MOTIVATION: Array CGH technologies enable the simultaneous measurement of DNA copy number for thousands of sites on a genome. We developed the circular binary segmentation (CBS) algorithm to divide the genome into regions of equal copy number. The algorithm tests for change-points using a maximal t-statistic with a permutation reference distribution to obtain the corresponding P-value. The number of computations required for the maximal test statistic is O(N2), where N is the number of markers. This makes the full permutation approach computationally prohibitive for the newer arrays that contain tens of thousands markers and highlights the need for a faster algorithm. RESULTS: We present a hybrid approach to…
Citation impact
- FWCI
- 30.91
- Percentile
- 100%
- References
- 34
Authors
2Topics & keywords
- Permutation (music)
- Algorithm
- Computer science
- Statistic
- Binary number
- Computation
- Bioconductor
- Test statistic