A Survey of Model Compression and Acceleration for Deep Neural Networks
Indexed inarxivdatacite
Abstract
Deep neural networks (DNNs) have recently achieved great success in many visual recognition tasks. However, existing deep neural network models are computationally expensive and memory intensive, hindering their deployment in devices with low memory resources or in applications with strict latency requirements. Therefore, a natural thought is to perform model compression and acceleration in deep networks without significantly decreasing the model performance. During the past five years, tremendous progress has been made in this area. In this paper, we review the recent techniques for compacting and accelerating DNN models. In general, these techniques are divided into four categories: parameter pruning and…
Citation impact
884
total citations
- FWCI
- —
- Percentile
- —
- References
- 81
Citations per year
Authors
4Topics & keywords
Topics
Keywords
- Acceleration
- Artificial neural network
- Compression (physics)
- Computer science
- Deep neural networks
- Artificial intelligence
- Physics
No related works found for this paper.