articleSep 14, 2014Closed access
Long short-term memory recurrent neural network architectures for large scale acoustic modeling
Indexed incrossref
Abstract
Long Short-Term Memory (LSTM) is a specific recurrent neural network (RNN) architecture that was designed to model temporal sequences and their long-range dependencies more accurately than conventional RNNs. In this paper, we explore LSTM RNN architectures for large scale acoustic modeling in speech recognition. We recently showed that LSTM RNNs are more effective than DNNs and conventional RNNs for acoustic modeling, considering moderately-sized models trained on a single machine. Here, we introduce the first distributed training of LSTM RNNs using asynchronous stochastic gradient descent optimization on a large cluster of machines. We show that a two-layer deep LSTM RNN where each LSTM layer has a linear…
Citation impact
2,958
total citations
- FWCI
- 200.40
- Percentile
- 100%
- References
- 22
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Computer science
- Term (time)
- Long short term memory
- Artificial neural network
- Scale (ratio)
- Recurrent neural network
- Speech recognition
- Artificial intelligence
No related works found for this paper.