Ordered quantile normalization: a semiparametric transformation built for the cross-validation era
University of Iowa · Colorado School of Public Health · +1 more institution
Abstract
Normalization transformations have recently experienced a resurgence in popularity in the era of machine learning, particularly in data preprocessing. However, the classical methods that can be adapted to cross-validation are not always effective. We introduce Ordered Quantile (ORQ) normalization, a one-to-one transformation that is designed to consistently and effectively transform a vector of arbitrary distribution into a vector that follows a normal (Gaussian) distribution. In the absence of ties, ORQ normalization is guaranteed to produce normally distributed transformed data. Once trained, an ORQ transformation can be readily and effectively applied to new data. We compare the effectiveness of the ORQ…
Citation impact
- FWCI
- 69.63
- Percentile
- 100%
- References
- 20
Authors
2Topics & keywords
- Normalization (sociology)
- Computer science
- Preprocessor
- Quantile
- Data pre-processing
- Data mining
- Transformation (genetics)
- Database normalization