articleeScholarship (California Digital Library)Apr 14, 2004GREEN OA

Machine Learning Benchmarks and Random Forest Regression

University of California, San Francisco

Abstract

Breiman (2001a,b) has recently developed an ensemble classification and regression approach that displayed outstanding performance with regard prediction error on a suite of benchmark datasets. As the base constituents of the ensemble are tree-structured predictors, and since each of these is constructed using an injection of randomness, the method is called ‘random forests’. That the exceptional performance is attained with seemingly only a single tuning parameter, to which sensitivity is minimal, makes the methodology all the more remarkable. The individual trees comprising the forest are all grown to maximal depth. While this helps with regard bias, there is the familiar tradeoff with variance. However,…

Citation impact

682
total citations
FWCI
1.85
Percentile
100%
References
7
Citations per year

Authors

1

Topics & keywords

Keywords
  • Overfitting
  • Random forest
  • Computer science
  • Randomness
  • Machine learning
  • Benchmark (surveying)
  • Boosting (machine learning)
  • Benchmarking
UN Sustainable Development Goals
  • Life in Land
No related works found for this paper.