Structured Pruning of Deep Convolutional Neural Networks

Seoul National University

Indexed incrossref

Abstract

Real-time application of deep learning algorithms is often hindered by high computational complexity and frequent memory accesses. Network pruning is a promising technique to solve this problem. However, pruning usually results in irregular network connections that not only demand extra representation efforts but also do not fit well on parallel computation. We introduce structured sparsity at various scales for convolutional neural networks: feature map-wise, kernel-wise, and intra-kernel strided sparsity. This structured sparsity is very advantageous for direct computational resource savings on embedded computers, in parallel computing environments, and in hardware-based systems. To decide the importance of…

Citation impact

715
total citations
FWCI
26.99
Percentile
100%
References
34
Citations per year

Authors

3

Topics & keywords

Keywords
  • Pruning
  • Computer science
  • Kernel (algebra)
  • Feature (linguistics)
  • Convolutional neural network
  • Computation
  • Artificial intelligence
  • Simple (philosophy)
UN Sustainable Development Goals
  • Decent work and economic growth
No related works found for this paper.

Funding