Predicting good probabilities with supervised learning

Niculescu-Mizil, Alexandru; Caruana, Rich

doi:10.1145/1102351.1102430

articleJan 1, 2005Closed access

Predicting good probabilities with supervised learning

ANAlexandru Niculescu-Mizil RCRich Caruana

Cornell University

Indexed incrossref

Abstract

We examine the relationship between the predictions made by different learning algorithms and true posterior probabilities. We show that maximum margin methods such as boosted trees and boosted stumps push probability mass away from 0 and 1 yielding a characteristic sigmoid shaped distortion in the predicted probabilities. Models such as Naive Bayes, which make unrealistic independence assumptions, push probabilities toward 0 and 1. Other models such as neural nets and bagged trees do not have these biases and predict well calibrated probabilities. We experiment with two ways of correcting the biased probabilities predicted by some learning methods: Platt Scaling and Isotonic Regression. We qualitatively…

Citation impact

1,563

total citations

FWCI: 21.13
Percentile: 100%
References: 11

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Margin (machine learning)
Artificial intelligence
Calibration
Machine learning
Computer science
Independence (probability theory)
Isotonic regression
Random forest

No related works found for this paper.