Improving deep neural networks for LVCSR using rectified linear units and dropout
University of Toronto · IBM (United States) · +1 more institution
Abstract
Recently, pre-trained deep neural networks (DNNs) have outperformed traditional acoustic models based on Gaussian mixture models (GMMs) on a variety of large vocabulary speech recognition benchmarks. Deep neural nets have also achieved excellent results on various computer vision tasks using a random “dropout” procedure that drastically improves generalization error by randomly omitting a fraction of the hidden units in all layers. Since dropout helps avoid over-fitting, it has also been successful on a small-scale phone recognition task using larger neural nets. However, training deep neural net acoustic models for large vocabulary speech recognition takes a very long time and dropout is likely to only…
Citation impact
- FWCI
- 117.12
- Percentile
- 100%
- References
- 21
Authors
3Topics & keywords
- Computer science
- Dropout (neural networks)
- Artificial neural network
- Speech recognition
- Artificial intelligence
- Sigmoid function
- Discriminative model
- Deep neural networks
- Reduced inequalities