Solving large scale linear prediction problems using stochastic gradient descent algorithms

Zhang, Tong

doi:10.1145/1015330.1015332

articleJan 1, 2004Closed access

Solving large scale linear prediction problems using stochastic gradient descent algorithms

TZTong Zhang

IBM Research - Thomas J. Watson Research Center

Indexed incrossref

Abstract

Linear prediction methods, such as least squares for regression, logistic regression and support vector machines for classification, have been extensively used in statistics and machine learning. In this paper, we study stochastic gradient descent (SGD) algorithms on regularized forms of linear prediction methods. This class of methods, related to online algorithms such as perceptron, are both efficient and very simple to implement. We obtain numerical rate of convergence for such algorithms, and discuss its implications. Experiments on text data will be provided to demonstrate numerical and statistical consequences of our theoretical findings.

Citation impact

1,148

total citations

FWCI: 8.33
Percentile: 100%
References: 10

Citations per year

Authors

1

TZ
Tong ZhangCorresponding
IBM Research - Thomas J. Watson Research Center

Topics & keywords

Topics

Keywords

Stochastic gradient descent
Computer science
Perceptron
Support vector machine
Convergence (economics)
Algorithm
Rate of convergence
Gradient descent

No related works found for this paper.