articleJun 28, 2011Closed access

Generating Text with Recurrent Neural Networks

University of Toronto · University of New Brunswick

Abstract

Recurrent Neural Networks (RNNs) are very powerful sequence models that do not enjoy widespread use because it is extremely difficult to train them properly. Fortunately, recent advances in Hessian-free optimization have been able to overcome the difficulties associated with training RNNs, making it possible to apply them successfully to challenging sequence problems. In this paper we demonstrate the power of RNNs trained with the new Hessian-Free optimizer (HF) by applying them to character-level language modeling tasks. The standard RNN architecture, while effective, is not ideally suited for such tasks, so we introduce a new RNN variant that uses multiplicative (or “gated”) connections which allow the…

Citation impact

1,171
total citations
FWCI
41.82
Percentile
100%
References
24
Citations per year

Authors

3

Topics & keywords

Keywords
  • Recurrent neural network
  • Computer science
  • Language model
  • Sequence (biology)
  • Artificial intelligence
  • Hessian matrix
  • Artificial neural network
  • Multiplicative function
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.