Abstract
Many of the classification algorithms developed in the machine learning literature, including the support vector machine and boosting, can be viewed as minimum contrast methods that minimize a convex surrogate of the 0–1 loss function. The convexity makes these algorithms computationally efficient. The use of a surrogate, however, has statistical consequences that must be balanced against the computational virtues of convexity. To study these issues, we provide a general quantitative relationship between the risk as assessed using the 0–1 loss and the risk as assessed using any nonnegative surrogate loss function. We show that this relationship gives nontrivial upper bounds on excess risk under the weakest…
Citation impact
1,053
total citations
- FWCI
- 59.45
- Percentile
- 100%
- References
- 60
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Convexity
- Hinge loss
- Pointwise
- Mathematics
- Mathematical optimization
- Function (biology)
- Convex function
- Algorithm
No related works found for this paper.