articleDec 6, 2010Closed access

Online Learning for Latent Dirichlet Allocation

Princeton University · Institut national de recherche en sciences et technologies du numérique

Abstract

We develop an online variational Bayes (VB) algorithm for Latent Dirichlet Al-location (LDA). Online LDA is based on online stochastic optimization with a natural gradient step, which we show converges to a local optimum of the VB objective function. It can handily analyze massive document collections, includ-ing those arriving in a stream. We study the performance of online LDA in several ways, including by fitting a 100-topic topic model to 3.3M articles from Wikipedia in a single pass. We demonstrate that online LDA finds topic models as good or better than those found with batch VB, and in a fraction of the time. 1

Citation impact

1,296
total citations
FWCI
94.12
Percentile
100%
References
26
Citations per year

Authors

3

Topics & keywords

Keywords
  • Latent Dirichlet allocation
  • Topic model
  • Computer science
  • Function (biology)
  • Online algorithm
  • Dirichlet distribution
  • Bayes' theorem
  • Fraction (chemistry)
No related works found for this paper.