Abstract
This contribution develops a theoretical framework that takes into account the effect of approximate optimization on learning algorithms. The analysis shows distinct tradeoffs for the case of small-scale and large-scale learning problems. Small-scale learning problems are subject to the usual approximation–estimation tradeoff. Largescale learning problems are subject to a qualitatively different tradeoff involving the computational complexity of the underlying optimization algorithm in non-trivial ways. For instance, a mediocre optimization algorithms, stochastic gradient descent, is shown to perform very well on large-scale learning problems. 1
Citation impact
1,193
total citations
- FWCI
- 100.06
- Percentile
- 100%
- References
- 31
Citations per year
Authors
2Topics & keywords
Topics
Keywords
- Scale (ratio)
- Computer science
- Geography
- Cartography
No related works found for this paper.