Machine Learning Benchmarks and Random Forest Regression

Segal, Mark R.

articleeScholarship (California Digital Library)Apr 14, 2004GREEN OA

Machine Learning Benchmarks and Random Forest Regression

Abstract

Breiman (2001a,b) has recently developed an ensemble classification and regression approach that displayed outstanding performance with regard prediction error on a suite of benchmark datasets. As the base constituents of the ensemble are tree-structured predictors, and since each of these is constructed using an injection of randomness, the method is called ‘random forests’. That the exceptional performance is attained with seemingly only a single tuning parameter, to which sensitivity is minimal, makes the methodology all the more remarkable. The individual trees comprising the forest are all grown to maximal depth. While this helps with regard bias, there is the familiar tradeoff with variance. However,…

Citation impact

682

total citations

FWCI: 1.85
Percentile: 100%
References: 7

Citations per year

Authors

1

MR
Mark R. SegalCorresponding
University of California, San Francisco

Topics & keywords

Topics

Keywords

Overfitting
Random forest
Computer science
Randomness
Machine learning
Benchmark (surveying)
Boosting (machine learning)
Benchmarking

UN Sustainable Development Goals

Life in Land

No related works found for this paper.