Tuning multiple imputation by predictive mean matching and local residual draws
MRC Clinical Trials Unit at UCL · University College London · +1 more institution
Abstract
Multiple imputation is a commonly used method for handling incomplete covariates as it can provide valid inference when data are missing at random. This depends on being able to correctly specify the parametric model used to impute missing values, which may be difficult in many realistic settings. Imputation by predictive mean matching (PMM) borrows an observed value from a donor with a similar predictive mean; imputation by local residual draws (LRD) instead borrows the donor's residual. Both methods relax some assumptions of parametric imputation, promising greater robustness when the imputation model is misspecified.
We review development of PMM and LRD and outline the various forms available, and aim to clarify some choices about how and when they should be used. We compare performance to fully parametric imputation in simulation studies, first when the imputation model is correctly specified and then when it is misspecified.
Citation impact
- FWCI
- 14.44
- Percentile
- 100%
- References
- 42
Authors
3Topics & keywords
- Residual
- Imputation (statistics)
- Statistics
- Matching (statistics)
- Computer science
- Econometrics
- Mathematics
- Missing data