articleDec 8, 2008Closed access

A Scalable Hierarchical Distributed Language Model

University of Toronto

Abstract

Neural probabilistic language models (NPLMs) have been shown to be competitive with and occasionally superior to the widely-used n-gram language models. The main drawback of NPLMs is their extremely long training and testing times. Morin and Bengio have proposed a hierarchical language model built around a binary tree of words, which was two orders of magnitude faster than the nonhierarchical model it was based on. However, it performed considerably worse than its non-hierarchical counterpart in spite of using a word tree created using expert knowledge. We introduce a fast hierarchical language model along with a simple feature-based algorithm for automatic construction of word trees from the data. We then…

Citation impact

849
total citations
FWCI
15.28
Percentile
100%
References
12
Citations per year

Authors

2

Topics & keywords

Keywords
  • Language model
  • Computer science
  • Hierarchical database model
  • Cache language model
  • Artificial intelligence
  • Scalability
  • Probabilistic logic
  • Simple (philosophy)
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.