articleJan 1, 2005Closed access

Predicting good probabilities with supervised learning

Cornell University

Indexed incrossref

Abstract

We examine the relationship between the predictions made by different learning algorithms and true posterior probabilities. We show that maximum margin methods such as boosted trees and boosted stumps push probability mass away from 0 and 1 yielding a characteristic sigmoid shaped distortion in the predicted probabilities. Models such as Naive Bayes, which make unrealistic independence assumptions, push probabilities toward 0 and 1. Other models such as neural nets and bagged trees do not have these biases and predict well calibrated probabilities. We experiment with two ways of correcting the biased probabilities predicted by some learning methods: Platt Scaling and Isotonic Regression. We qualitatively…

Citation impact

1,563
total citations
FWCI
21.13
Percentile
100%
References
11
Citations per year

Authors

2

Topics & keywords

Keywords
  • Margin (machine learning)
  • Artificial intelligence
  • Calibration
  • Machine learning
  • Computer science
  • Independence (probability theory)
  • Isotonic regression
  • Random forest
No related works found for this paper.