articleDec 6, 2010Closed access
Online Learning for Latent Dirichlet Allocation
Princeton University · Institut national de recherche en sciences et technologies du numérique
Abstract
We develop an online variational Bayes (VB) algorithm for Latent Dirichlet Al-location (LDA). Online LDA is based on online stochastic optimization with a natural gradient step, which we show converges to a local optimum of the VB objective function. It can handily analyze massive document collections, includ-ing those arriving in a stream. We study the performance of online LDA in several ways, including by fitting a 100-topic topic model to 3.3M articles from Wikipedia in a single pass. We demonstrate that online LDA finds topic models as good or better than those found with batch VB, and in a fraction of the time. 1
Citation impact
1,296
total citations
- FWCI
- 94.12
- Percentile
- 100%
- References
- 26
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Latent Dirichlet allocation
- Topic model
- Computer science
- Function (biology)
- Online algorithm
- Dirichlet distribution
- Bayes' theorem
- Fraction (chemistry)
No related works found for this paper.