Pruning Convolutional Neural Networks for Resource Efficient Inference

Molchanov, Pavlo; Tyree, Stephen; Karras, Tero; Aila, Timo; Kautz, Jan

doi:10.48550/arxiv.1611.06440

preprintarXiv (Cornell University)Nov 19, 2016GREEN OA

Pruning Convolutional Neural Networks for Resource Efficient Inference

PMPavlo Molchanov STStephen Tyree TKTero Karras TATimo Aila JKJan Kautz

Indexed inarxivdatacite

Abstract

We propose a new formulation for pruning convolutional kernels in neural networks to enable efficient inference. We interleave greedy criteria-based pruning with fine-tuning by backpropagation - a computationally efficient procedure that maintains good generalization in the pruned network. We propose a new criterion based on Taylor expansion that approximates the change in the cost function induced by pruning network parameters. We focus on transfer learning, where large pretrained networks are adapted to specialized tasks. The proposed criterion demonstrates superior performance compared to other criteria, e.g. the norm of kernel weights or feature map activation, for pruning large CNNs after adaptation to…

Citation impact

1,205

total citations

FWCI: —
Percentile: —
References: 23

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Convolutional neural network
Artificial intelligence
Inference
Pruning
Backpropagation
Classifier (UML)
Machine learning

UN Sustainable Development Goals

Decent work and economic growth

No related works found for this paper.