articleThe American StatisticianOct 28, 2009Closed access

Variable Importance Assessment in Regression: Linear Regression versus Random Forest

Rosa Luxemburg Foundation

Indexed incrossref

Abstract

Relative importance of regressor variables is an old topic that still awaits a satisfactory solution. When interest is in attributing importance in linear regression, averaging over orderings methods for decomposing R2 are among the state-of-the-art methods, although the mechanism behind their behavior is not (yet) completely understood. Random forests—a machine-learning tool for classification and regression proposed a few years ago—have an inherent procedure of producing variable importances. This article compares the two approaches (linear model on the one hand and two versions of random forests on the other hand) and finds both striking similarities and differences, some of which can be explained whereas…

Citation impact

1,116
total citations
FWCI
6.53
Percentile
100%
References
45
Citations per year

Authors

1

Topics & keywords

Keywords
  • Random forest
  • Regression
  • Linear regression
  • Variable (mathematics)
  • Regression analysis
  • Econometrics
  • Statistics
  • Variables
UN Sustainable Development Goals
  • Life in Land
No related works found for this paper.