book chapterJan 1, 2010Closed access

Large-Scale Machine Learning with Stochastic Gradient Descent

Princeton University

Indexed incrossref

Abstract

During the last decade, the data sizes have grown faster than the speed of processors. In this context, the capabilities of statistical machine learning methods is limited by the computing time rather than the sample size. A more precise analysis uncovers qualitatively different tradeoffs for the case of small-scale and large-scale learning problems. The large-scale case involves the computational complexity of the underlying optimization algorithm in non-trivial ways. Unlikely optimization algorithms such as stochastic gradient descent show amazing performance for large-scale problems. In particular, second order stochastic gradient and averaged stochastic gradient are asymptotically efficient after a single…

Citation impact

5,613
total citations
FWCI
33.28
Percentile
100%
References
26
Citations per year

Authors

1

Topics & keywords

Keywords
  • Stochastic gradient descent
  • Computer science
  • Scale (ratio)
  • Stochastic optimization
  • Gradient descent
  • Set (abstract data type)
  • Online machine learning
  • Context (archaeology)
No related works found for this paper.