articleJournal of Clinical EpidemiologyMar 13, 2019HYBRID OA

The proportion of missing data should not be used to guide decisions on multiple imputation

University of Bristol

PubMed
Indexed incrossrefpubmed

Abstract

Objectives

Researchers are concerned whether multiple imputation (MI) or complete case analysis should be used when a large proportion of data are missing. We aimed to provide guidance for drawing conclusions from data with a large proportion of missingness. STUDY DESIGN AND SETTING: Via simulations, we investigated how the proportion of missing data, the fraction of missing information (FMI), and availability of auxiliary variables affected MI performance. Outcome data were missing completely at random or missing at random (MAR).

Results

Provided sufficient auxiliary information was available; MI was beneficial in terms of bias and never detrimental in terms of efficiency. Models with similar FMI values, but differing proportions of missing data, also had similar precision for effect estimates. In the absence of bias, the FMI was a better guide to the efficiency gains using MI than the proportion of missing data.

Citation impact

1,034
total citations
FWCI
93.96
Percentile
100%
References
39
Citations per year

Authors

4

Topics & keywords

Keywords
  • Imputation (statistics)
  • Missing data
  • Statistics
  • Computer science
  • Data mining
  • Econometrics
  • Medicine
  • Data science
No related works found for this paper.

Funding