articleSociological MethodologyMay 18, 2007GREEN OA

4. Regression with Missing Ys: An Improved Strategy for Analyzing Multiply Imputed Data

The Ohio State University

Indexed inarxivcrossref

Abstract

When fitting a generalized linear model—such as linear regression, logistic regression, or hierarchical linear modeling—analysts often wonder how to handle missing values of the dependent variable Y. If missing values have been filled in using multiple imputation, the usual advice is to use the imputed Y values in analysis. We show, however, that using imputed Ys can add needless noise to the estimates. Better estimates can usually be obtained using a modified strategy that we call multiple imputation, then deletion (MID). Under MID, all cases are used for imputation but, following imputation, cases with imputed Y values are excluded from the analysis. When there is something wrong with the imputed Y values,…

Citation impact

1,491
total citations
FWCI
16.60
Percentile
100%
References
31
Citations per year

Authors

1

Topics & keywords

Keywords
  • Imputation (statistics)
  • Missing data
  • Statistics
  • Linear regression
  • Logistic regression
  • Regression
  • Mathematics
  • Econometrics
No related works found for this paper.