articleBMJApr 27, 2016BRONZE OA

Sparse data bias: a problem hiding in plain sight

University of California, Los Angeles · Tehran University of Medical Sciences · +2 more institutions

PubMed
Indexed incrossrefpubmed

Abstract

Effects of treatment or other exposure on outcome events are commonly measured by ratios of risks, rates, or odds. Adjusted versions of these measures are usually estimated by maximum likelihood regression (eg, logistic, Poisson, or Cox modelling). But resulting estimates of effect measures can have serious bias when the data lack adequate case numbers for some combination of exposure and outcome levels. This bias can occur even in quite large datasets and is hence often termed sparse data bias. The bias can arise or be worsened by regression adjustment for potentially confounding variables; in the extreme, the resulting estimates could be impossibly huge or even infinite values that are meaningless artefacts…

Citation impact

869
total citations
FWCI
102.54
Percentile
100%
References
38
Citations per year

Authors

3

Topics & keywords

Keywords
  • Confounding
  • Statistics
  • Poisson regression
  • Logistic regression
  • Poisson distribution
  • Econometrics
  • Covariate
  • Computer science
No related works found for this paper.