How to Construct Deep Recurrent Neural Networks

Pascanu, Razvan; Gülçehre, Çaǧlar; Cho, Kyunghyun; Bengio, Yoshua

articleInternational Conference on Learning RepresentationsJan 1, 2014Closed access

How to Construct Deep Recurrent Neural Networks

RPRazvan Pascanu ÇGÇaǧlar Gülçehre KCKyunghyun Cho YBYoshua Bengio

Département d'Informatique · Université de Montréal · +1 more institution

Abstract

Abstract: In this paper, we explore different ways to extend a recurrent neural network (RNN) to a \textit{deep} RNN. We start by arguing that the concept of depth in an RNN is not as clear as it is in feedforward neural networks. By carefully analyzing and understanding the architecture of an RNN, however, we find three points of an RNN which may be made deeper; (1) input-to-hidden function, (2) hidden-to-hidden transition and (3) hidden-to-output function. Based on this observation, we propose two novel architectures of a deep RNN which are orthogonal to an earlier attempt of stacking multiple recurrent layers to build a deep RNN (Schmidhuber, 1992; El Hihi and Bengio, 1996). We provide an alternative…

Citation impact

581

total citations

FWCI: 36.67
Percentile: 100%
References: 44

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Recurrent neural network
Computer science
Deep learning
Artificial intelligence
Construct (python library)
Feedforward neural network
Feed forward
Function (biology)

UN Sustainable Development Goals

Quality Education

No related works found for this paper.