articleSource Code for Biology and MedicineDec 1, 2008HYBRID OA

Purposeful selection of variables in logistic regression

University of Arkansas for Medical Sciences · University of Massachusetts Amherst

PubMed
Indexed incrossrefpubmed

Abstract

Background

The main problem in many model-building situations is to choose from a large set of covariates those that should be included in the "best" model. A decision to keep a variable in the model might be based on the clinical or statistical significance. There are several variable selection algorithms in existence. Those methods are mechanical and as such carry some limitations. Hosmer and Lemeshow describe a purposeful selection of covariates within which an analyst makes a variable selection decision at each step of the modeling process.

Methods

In this paper we introduce an algorithm which automates that process. We conduct a simulation study to compare the performance of this algorithm with three well documented variable selection procedures in SAS PROC LOGISTIC: FORWARD, BACKWARD, and STEPWISE.

Citation impact

3,887
total citations
FWCI
7.74
Percentile
100%
References
19
Citations per year

Authors

4

Topics & keywords

Keywords
  • Covariate
  • Logistic regression
  • Computer science
  • Confounding
  • Feature selection
  • Selection (genetic algorithm)
  • Variable (mathematics)
  • Variables
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.