Estimating predicted probabilities from logistic regression: different methods correspond to different target populations

Muller, Clemma; MacLehose, Richard F.

doi:10.1093/ije/dyu029

articleInternational Journal of EpidemiologyMar 4, 2014BRONZE OA

Estimating predicted probabilities from logistic regression: different methods correspond to different target populations

CMClemma Muller RFRichard F. MacLehose

University of Minnesota

PubMed

Indexed incrossrefpubmed

Abstract

Background

We review three common methods to estimate predicted probabilities following confounder-adjusted logistic regression: marginal standardization (predicted probabilities summed to a weighted average reflecting the confounder distribution in the target population); prediction at the modes (conditional predicted probabilities calculated by setting each confounder to its modal value); and prediction at the means (predicted probabilities calculated by setting each confounder to its mean value). That each method corresponds to a different target population is underappreciated in practice. Specifically, prediction at the means is often incorrectly interpreted as estimating average probabilities for the overall study population, and furthermore yields nonsensical estimates in the presence of dichotomous confounders. Default commands in popular statistical software packages often lead to inadvertent misapplication of prediction at the means.

Methods

Using an applied example, we demonstrate discrepancies in predicted probabilities across these methods, discuss implications for interpretation and provide syntax for SAS and Stata.

Citation impact

733

total citations

FWCI: 29.89
Percentile: 100%
References: 47

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Statistics
Logistic regression
Econometrics
Confounding
Inference
Population
Causal inference
Regression analysis

UN Sustainable Development Goals

Reduced inequalities

No related works found for this paper.

Funding

NI
National Institutes of Health