Context dependent recurrent neural network language model
Microsoft (United States) · Brno University of Technology
Abstract
Recurrent neural network language models (RNNLMs) have recently demonstrated state-of-the-art performance across a variety of tasks. In this paper, we improve their performance by providing a contextual real-valued input vector in association with each word. This vector is used to convey contextual information about the sentence being modeled. By performing Latent Dirichlet Allocation using a block of preceding text, we achieve a topic-conditioned RNNLM. This approach has the key advantage of avoiding the data fragmentation associated with building multiple topic models on different data subsets. We report perplexity results on the Penn Treebank data, where we achieve a new state-of-the-art. We further apply…
Citation impact
- FWCI
- 38.01
- Percentile
- 100%
- References
- 45
Authors
2Topics & keywords
- Perplexity
- Treebank
- Computer science
- Language model
- Recurrent neural network
- Artificial intelligence
- Variety (cybernetics)
- Sentence
- Peace, Justice and strong institutions