preprintarXiv (Cornell University)Oct 23, 2017GREEN OA

A Survey of Model Compression and Acceleration for Deep Neural Networks

Indexed inarxivdatacite

Abstract

Deep neural networks (DNNs) have recently achieved great success in many visual recognition tasks. However, existing deep neural network models are computationally expensive and memory intensive, hindering their deployment in devices with low memory resources or in applications with strict latency requirements. Therefore, a natural thought is to perform model compression and acceleration in deep networks without significantly decreasing the model performance. During the past five years, tremendous progress has been made in this area. In this paper, we review the recent techniques for compacting and accelerating DNN models. In general, these techniques are divided into four categories: parameter pruning and…

Citation impact

884
total citations
FWCI
Percentile
References
81
Citations per year

Authors

4

Topics & keywords

Keywords
  • Acceleration
  • Artificial neural network
  • Compression (physics)
  • Computer science
  • Deep neural networks
  • Artificial intelligence
  • Physics
No related works found for this paper.