Generating Text with Recurrent Neural Networks
University of Toronto · University of New Brunswick
Abstract
Recurrent Neural Networks (RNNs) are very powerful sequence models that do not enjoy widespread use because it is extremely difficult to train them properly. Fortunately, recent advances in Hessian-free optimization have been able to overcome the difficulties associated with training RNNs, making it possible to apply them successfully to challenging sequence problems. In this paper we demonstrate the power of RNNs trained with the new Hessian-Free optimizer (HF) by applying them to character-level language modeling tasks. The standard RNN architecture, while effective, is not ideally suited for such tasks, so we introduce a new RNN variant that uses multiplicative (or “gated”) connections which allow the…
Citation impact
- FWCI
- 41.82
- Percentile
- 100%
- References
- 24
Authors
3Topics & keywords
- Recurrent neural network
- Computer science
- Language model
- Sequence (biology)
- Artificial intelligence
- Hessian matrix
- Artificial neural network
- Multiplicative function
- Quality Education