Structured Pruning of Deep Convolutional Neural Networks

Anwar, Sajid; Hwang, Kyuyeon; Sung, Wonyong

doi:10.1145/3005348

articleACM Journal on Emerging Technologies in Computing SystemsFeb 9, 2017Closed access

Structured Pruning of Deep Convolutional Neural Networks

SASajid Anwar KHKyuyeon Hwang WSWonyong Sung

Seoul National University

Indexed incrossref

Abstract

Real-time application of deep learning algorithms is often hindered by high computational complexity and frequent memory accesses. Network pruning is a promising technique to solve this problem. However, pruning usually results in irregular network connections that not only demand extra representation efforts but also do not fit well on parallel computation. We introduce structured sparsity at various scales for convolutional neural networks: feature map-wise, kernel-wise, and intra-kernel strided sparsity. This structured sparsity is very advantageous for direct computational resource savings on embedded computers, in parallel computing environments, and in hardware-based systems. To decide the importance of…

Citation impact

715

total citations

FWCI: 26.99
Percentile: 100%
References: 34

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Pruning
Computer science
Kernel (algebra)
Feature (linguistics)
Convolutional neural network
Computation
Artificial intelligence
Simple (philosophy)

UN Sustainable Development Goals

Decent work and economic growth

No related works found for this paper.

Funding

NR
National Research Foundation of Korea
Award: 2015R1A2A1A10056051