articleStatistics in MedicineNov 30, 2010Closed access

Multiple imputation using chained equations: Issues and guidance for practice

MRC Biostatistics Unit · MRC Clinical Trials Unit at UCL · +2 more institutions

PubMed
Indexed incrossrefpubmed

Abstract

Multiple imputation by chained equations is a flexible and practical approach to handling missing data. We describe the principles of the method and show how to impute categorical and quantitative variables, including skewed variables. We give guidance on how to specify the imputation model and how many imputations are needed. We describe the practical analysis of multiply imputed data, including model building and model checking. We stress the limitations of the method and discuss the possible pitfalls. We illustrate the ideas using a data set in mental health, giving Stata code fragments.

Citation impact

9,493
total citations
FWCI
85.31
Percentile
100%
References
66
Citations per year

Authors

3

Topics & keywords

Keywords
  • Imputation (statistics)
  • Categorical variable
  • Computer science
  • Missing data
  • Data mining
  • Machine learning
No related works found for this paper.

Funding