Reading Tea Leaves: How Humans Interpret Topic Models

Chang, Jonathan; Gerrish, Sean; Wang, Chong; Boyd‐Graber, Jordan; Blei, David M.

articleDec 7, 2009Closed access

Reading Tea Leaves: How Humans Interpret Topic Models

JCJonathan Chang SGSean Gerrish CWChong Wang JBJordan Boyd‐Graber DMDavid M. Blei

Meta (United States) · Princeton University · +1 more institution

Abstract

Probabilistic topic models are a popular tool for the unsupervised analysis of text, providing both a predictive model of future text and a latent topic representation of the corpus. Practitioners typically assume that the latent space is semantically meaningful. It is used to check models, summarize the corpus, and guide explo-ration of its contents. However, whether the latent space is interpretable is in need of quantitative evaluation. In this paper, we present new quantitative methods for measuring semantic meaning in inferred topics. We back these measures with large-scale user studies, showing that they capture aspects of the model that are undetected by previous measures of model quality based on…

Citation impact

1,877

total citations

FWCI: 63.70
Percentile: 100%
References: 23

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Topic model
Computer science
Latent semantic analysis
Natural language processing
Artificial intelligence
Probabilistic latent semantic analysis
Representation (politics)
Meaning (existential)

UN Sustainable Development Goals

Quality Education

No related works found for this paper.