articleSep 9, 2012Closed access

LSTM neural networks for language modeling

RWTH Aachen University

Indexed incrossref

Abstract

Neural networks have become increasingly popular for the task of language modeling. Whereas feed-forward networks only exploit a fixed context length to predict the next word of a sequence, conceptually, standard recurrent neural networks can take into account all of the predecessor words. On the other hand, it is well known that recurrent networks are difficult to train and therefore are unlikely to show the full potential of recurrent models. These problems are addressed by a the Long Short-Term Memory neural network architecture. In this work, we analyze this type of network on an English and a large French language modeling task. Experiments show improvements of about 8 % relative in perplexity over…

Citation impact

1,986
total citations
FWCI
32.28
Percentile
100%
References
22
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Artificial neural network
  • Artificial intelligence
  • Language model
  • Natural language processing
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.