Improving deep neural networks for LVCSR using rectified linear units and dropout

Dahl, George E.; Sainath, Tara N.; Hinton, Geoffrey E.

doi:10.1109/icassp.2013.6639346

articleMay 1, 2013Closed access

Improving deep neural networks for LVCSR using rectified linear units and dropout

GEGeorge E. Dahl TNTara N. Sainath GEGeoffrey E. Hinton

University of Toronto · IBM (United States) · +1 more institution

Indexed incrossref

Abstract

Recently, pre-trained deep neural networks (DNNs) have outperformed traditional acoustic models based on Gaussian mixture models (GMMs) on a variety of large vocabulary speech recognition benchmarks. Deep neural nets have also achieved excellent results on various computer vision tasks using a random “dropout” procedure that drastically improves generalization error by randomly omitting a fraction of the hidden units in all layers. Since dropout helps avoid over-fitting, it has also been successful on a small-scale phone recognition task using larger neural nets. However, training deep neural net acoustic models for large vocabulary speech recognition takes a very long time and dropout is likely to only…

Citation impact

1,280

total citations

FWCI: 117.12
Percentile: 100%
References: 21

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Computer science
Dropout (neural networks)
Artificial neural network
Speech recognition
Artificial intelligence
Sigmoid function
Discriminative model
Deep neural networks

UN Sustainable Development Goals

Reduced inequalities

No related works found for this paper.