Imputing missing covariate values for the Cox model
MRC Biostatistics Unit · MRC Clinical Trials Unit at UCL
Abstract
Multiple imputation is commonly used to impute missing data, and is typically more efficient than complete cases analysis in regression analysis when covariates have missing values. Imputation may be performed using a regression model for the incomplete covariates on other covariates and, importantly, on the outcome. With a survival outcome, it is a common practice to use the event indicator D and the log of the observed event or censoring time T in the imputation model, but the rationale is not clear.We assume that the survival outcome follows a proportional hazards model given covariates X and Z. We show that a suitable model for imputing binary or Normal X is a logistic or linear regression on the event…
Citation impact
- FWCI
- 15.33
- Percentile
- 100%
- References
- 19
Authors
2Topics & keywords
- Covariate
- Proportional hazards model
- Statistics
- Imputation (statistics)
- Missing data
- Estimator
- Censoring (clinical trials)
- Logistic regression
- Peace, Justice and strong institutions