Evaluation methods for topic models

Wallach, Hanna; Murray, Iain; Salakhutdinov, Ruslan; Mimno, David

doi:10.1145/1553374.1553515

articleJun 14, 2009GREEN OA

Evaluation methods for topic models

HWHanna Wallach IMIain Murray RSRuslan Salakhutdinov DMDavid Mimno

University of Massachusetts Amherst · University of Toronto

Indexed incrossref

Abstract

A natural evaluation metric for statistical topic models is the probability of held-out documents given a trained model. While exact computation of this probability is intractable, several estimators for this probability have been used in the topic modeling literature, including the harmonic mean method and empirical likelihood method. In this paper, we demonstrate experimentally that commonly-used methods are unlikely to accurately estimate the probability of heldout documents, and propose two alternative methods that are both accurate and efficient. 1.

Citation impact

876

total citations

FWCI: 40.75
Percentile: 100%
References: 17

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Estimator
Metric (unit)
Statistical model
Computation
Probability model
Empirical probability
Artificial intelligence

No related works found for this paper.