articleEnvironmental Science & TechnologyMay 23, 2017Closed access

Estimating PM 2.5 Concentrations in the Conterminous United States Using the Random Forest Approach

University of Nevada, Reno

PubMed
Indexed incrossrefpubmed

Abstract

To estimate PM2.5 concentrations, many parametric regression models have been developed, while nonparametric machine learning algorithms are used less often and national-scale models are rare. In this paper, we develop a random forest model incorporating aerosol optical depth (AOD) data, meteorological fields, and land use variables to estimate daily 24 h averaged ground-level PM2.5 concentrations over the conterminous United States in 2011. Random forests are an ensemble learning method that provides predictions with high accuracy and interpretability. Our results achieve an overall cross-validation (CV) R2 value of 0.80. Mean prediction error (MPE) and root mean squared prediction error (RMSPE) for daily…

Citation impact

627
total citations
FWCI
26.20
Percentile
100%
References
60
Citations per year

Authors

7

Topics & keywords

Keywords
  • Random forest
  • Interpretability
  • Regression
  • Statistics
  • Environmental science
  • Mean squared error
  • Regression analysis
  • Scale (ratio)
UN Sustainable Development Goals
  • Life in Land
No related works found for this paper.

Funding