Context dependent recurrent neural network language model

Mikolov, Tomáš; Zweig, Geoffrey

doi:10.1109/slt.2012.6424228

articleDec 1, 2012Closed access

Context dependent recurrent neural network language model

TMTomáš Mikolov GZGeoffrey Zweig

Microsoft (United States) · Brno University of Technology

Indexed incrossref

Abstract

Recurrent neural network language models (RNNLMs) have recently demonstrated state-of-the-art performance across a variety of tasks. In this paper, we improve their performance by providing a contextual real-valued input vector in association with each word. This vector is used to convey contextual information about the sentence being modeled. By performing Latent Dirichlet Allocation using a block of preceding text, we achieve a topic-conditioned RNNLM. This approach has the key advantage of avoiding the data fragmentation associated with building multiple topic models on different data subsets. We report perplexity results on the Penn Treebank data, where we achieve a new state-of-the-art. We further apply…

Citation impact

576

total citations

FWCI: 38.01
Percentile: 100%
References: 45

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Perplexity
Treebank
Computer science
Language model
Recurrent neural network
Artificial intelligence
Variety (cybernetics)
Sentence

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.