Performance Evaluation of Deep Neural Networks Applied to Speech Recognition: RNN, LSTM and GRU

North Dakota State University

Indexed incrossref

Abstract

Abstract Deep Neural Networks (DNN) are nothing but neural networks with many hidden layers. DNNs are becoming popular in automatic speech recognition tasks which combines a good acoustic with a language model. Standard feedforward neural networks cannot handle speech data well since they do not have a way to feed information from a later layer back to an earlier layer. Thus, Recurrent Neural Networks (RNNs) have been introduced to take temporal dependencies into account. However, the shortcoming of RNNs is that long-term dependencies due to the vanishing/exploding gradient problem cannot be handled. Therefore, Long Short-Term Memory (LSTM) networks were introduced, which are a special case of RNNs, that takes…

Citation impact

561
total citations
FWCI
24.95
Percentile
100%
References
30
Citations per year

Authors

3

Topics & keywords

Keywords
  • Recurrent neural network
  • Computer science
  • Speech recognition
  • Artificial neural network
  • Artificial intelligence
  • Long short term memory
  • Word error rate
  • Word (group theory)
No related works found for this paper.

Funding