Structured Pruning of Deep Convolutional Neural Networks
Indexed incrossref
Abstract
Real-time application of deep learning algorithms is often hindered by high computational complexity and frequent memory accesses. Network pruning is a promising technique to solve this problem. However, pruning usually results in irregular network connections that not only demand extra representation efforts but also do not fit well on parallel computation. We introduce structured sparsity at various scales for convolutional neural networks: feature map-wise, kernel-wise, and intra-kernel strided sparsity. This structured sparsity is very advantageous for direct computational resource savings on embedded computers, in parallel computing environments, and in hardware-based systems. To decide the importance of…
Citation impact
715
total citations
- FWCI
- 26.99
- Percentile
- 100%
- References
- 34
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Pruning
- Computer science
- Kernel (algebra)
- Feature (linguistics)
- Convolutional neural network
- Computation
- Artificial intelligence
- Simple (philosophy)
UN Sustainable Development Goals
- Decent work and economic growth
No related works found for this paper.