articleJun 14, 2009GREEN OA
Evaluation methods for topic models
University of Massachusetts Amherst · University of Toronto
Indexed incrossref
Abstract
A natural evaluation metric for statistical topic models is the probability of held-out documents given a trained model. While exact computation of this probability is intractable, several estimators for this probability have been used in the topic modeling literature, including the harmonic mean method and empirical likelihood method. In this paper, we demonstrate experimentally that commonly-used methods are unlikely to accurately estimate the probability of heldout documents, and propose two alternative methods that are both accurate and efficient. 1.
Citation impact
876
total citations
- FWCI
- 40.75
- Percentile
- 100%
- References
- 17
Citations per year
Authors
4Topics & keywords
Topics
Keywords
- Computer science
- Estimator
- Metric (unit)
- Statistical model
- Computation
- Probability model
- Empirical probability
- Artificial intelligence
No related works found for this paper.