A Survey of Model Compression and Acceleration for Deep Neural Networks

Cheng, Yu; Wang, Duo; Zhou, Pan; Zhang, Tao

doi:10.48550/arxiv.1710.09282

preprintarXiv (Cornell University)Oct 23, 2017GREEN OA

A Survey of Model Compression and Acceleration for Deep Neural Networks

YCYu Cheng DWDuo Wang PZPan Zhou TZTao Zhang

Indexed inarxivdatacite

Abstract

Deep neural networks (DNNs) have recently achieved great success in many visual recognition tasks. However, existing deep neural network models are computationally expensive and memory intensive, hindering their deployment in devices with low memory resources or in applications with strict latency requirements. Therefore, a natural thought is to perform model compression and acceleration in deep networks without significantly decreasing the model performance. During the past five years, tremendous progress has been made in this area. In this paper, we review the recent techniques for compacting and accelerating DNN models. In general, these techniques are divided into four categories: parameter pruning and…

Citation impact

884

total citations

FWCI: —
Percentile: —
References: 81

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Acceleration
Artificial neural network
Compression (physics)
Computer science
Deep neural networks
Artificial intelligence
Physics

No related works found for this paper.