Improving your data transformations: Applying the Box-Cox transformation

Indexed indatacite

Abstract

Many of us in the social sciences deal with data that do not conform to assumptions of normality and/or homoscedasticity/homogeneity of variance. Some research has shown that parametric tests (e.g., multiple regression, ANOVA) can be robust to modest violations of these assumptions. Yet the reality is that almost all analyses (even nonparametric tests) benefit from improved the normality of variables, particularly where substantial non-normality is present. While many are familiar with select traditional transformations (e.g., square root, log, inverse) for improving normality, the Box-Cox transformation (Box & Cox, 1964) represents a family of power transformations that incorporates and extends the…

Citation impact

885
total citations
FWCI
56.51
Percentile
100%
References
19
Citations per year

Authors

1

Topics & keywords

Keywords
  • Transformation (genetics)
  • Power transform
  • Computer science
  • Data transformation
  • Data science
  • Data mining
  • Artificial intelligence
  • Data warehouse
No related works found for this paper.