Do Deep Nets Really Need to be Deep?

Ba, Jimmy; Caruana, Rich

doi:10.48550/arxiv.1312.6184

preprintarXiv (Cornell University)Dec 21, 2013GREEN OA

Do Deep Nets Really Need to be Deep?

JBJimmy Ba RCRich Caruana

University of Toronto · Microsoft (United States)

Indexed inarxivdatacite

Abstract

Currently, deep neural networks are the state of the art on problems such as speech recognition and computer vision. In this extended abstract, we show that shallow feed-forward networks can learn the complex functions previously learned by deep nets and achieve accuracies previously only achievable with deep models. Moreover, in some cases the shallow neural nets can learn these deep functions using a total number of parameters similar to the original deep model. We evaluate our method on the TIMIT phoneme recognition task and are able to train shallow fully-connected nets that perform similarly to complex, well-engineered, deep convolutional architectures. Our success in training shallow neural nets to mimic…

Citation impact

1,482

total citations

FWCI: —
Percentile: —
References: 21

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Deep learning
Computer science
Deep neural networks
Artificial intelligence
Task (project management)
Convolutional neural network
TIMIT
Deep water

No related works found for this paper.