Accelerating Stochastic Gradient Descent using Predictive Variance Reduction

Johnson, Rie; Zhang, Tong

articleDec 5, 2013Closed access

Accelerating Stochastic Gradient Descent using Predictive Variance Reduction

Rutgers, The State University of New Jersey · Baidu (China)

Abstract

Stochastic gradient descent is popular for large scale optimization but has slow convergence asymptotically due to the inherent variance. To remedy this problem, we introduce an explicit variance reduction method for stochastic gradient descent which we call stochastic variance reduced gradient (SVRG). For smooth and strongly convex functions, we prove that this method enjoys the same fast convergence rate as those of stochastic dual coordinate ascent (SDCA) and Stochastic Average Gradient (SAG). However, our analysis is significantly simpler and more intuitive. Moreover, unlike SDCA or SAG, our method does not require the storage of gradients, and thus is more easily applicable to complex problems such as…

Citation impact

1,933

total citations

FWCI: 134.92
Percentile: 100%
References: 8

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Variance reduction
Stochastic gradient descent
Convergence (economics)
Variance (accounting)
Computer science
Mathematical optimization
Gradient descent
Dual (grammatical number)

No related works found for this paper.