preprintarXiv (Cornell University)Dec 21, 2013GREEN OA

Do Deep Nets Really Need to be Deep?

University of Toronto · Microsoft (United States)

Indexed inarxivdatacite

Abstract

Currently, deep neural networks are the state of the art on problems such as speech recognition and computer vision. In this extended abstract, we show that shallow feed-forward networks can learn the complex functions previously learned by deep nets and achieve accuracies previously only achievable with deep models. Moreover, in some cases the shallow neural nets can learn these deep functions using a total number of parameters similar to the original deep model. We evaluate our method on the TIMIT phoneme recognition task and are able to train shallow fully-connected nets that perform similarly to complex, well-engineered, deep convolutional architectures. Our success in training shallow neural nets to mimic…

Citation impact

1,482
total citations
FWCI
Percentile
References
21
Citations per year

Authors

2

Topics & keywords

Keywords
  • Deep learning
  • Computer science
  • Deep neural networks
  • Artificial intelligence
  • Task (project management)
  • Convolutional neural network
  • TIMIT
  • Deep water
No related works found for this paper.