Sparse data bias: a problem hiding in plain sight
University of California, Los Angeles · Tehran University of Medical Sciences · +2 more institutions
Abstract
Effects of treatment or other exposure on outcome events are commonly measured by ratios of risks, rates, or odds. Adjusted versions of these measures are usually estimated by maximum likelihood regression (eg, logistic, Poisson, or Cox modelling). But resulting estimates of effect measures can have serious bias when the data lack adequate case numbers for some combination of exposure and outcome levels. This bias can occur even in quite large datasets and is hence often termed sparse data bias. The bias can arise or be worsened by regression adjustment for potentially confounding variables; in the extreme, the resulting estimates could be impossibly huge or even infinite values that are meaningless artefacts…
Citation impact
- FWCI
- 102.54
- Percentile
- 100%
- References
- 38
Authors
3Topics & keywords
- Confounding
- Statistics
- Poisson regression
- Logistic regression
- Poisson distribution
- Econometrics
- Covariate
- Computer science