articleJournal of Statistical SoftwareJan 1, 2011DIAMOND OA

topicmodels : An R Package for Fitting Topic Models

Indexed incrossrefdoaj

Abstract

Topic models allow the probabilistic modeling of term frequency occurrences in documents. The fitted model can be used to estimate the similarity between documents as well as between a set of specified keywords using an additional layer of latent variables which are referred to as topics. The R package topicmodels provides basic infrastructure for fitting topic models based on data structures from the text mining package tm. The package includes interfaces to two algorithms for fitting topic models: the variational expectation-maximization algorithm provided by David M. Blei and co-authors and an algorithm using Gibbs sampling by Xuan-Hieu Phan and co-authors.

Citation impact

1,077
total citations
FWCI
27.69
Percentile
100%
References
41
Citations per year

Authors

2

Topics & keywords

Keywords
  • Gibbs sampling
  • Computer science
  • Similarity (geometry)
  • Expectation–maximization algorithm
  • Set (abstract data type)
  • Topic model
  • Probabilistic logic
  • Data set
UN Sustainable Development Goals
  • Industry, innovation and infrastructure
No related works found for this paper.