Pruning Filters for Efficient ConvNets
University of Maryland, College Park · Princeton University
Abstract
The success of CNNs in various applications is accompanied by a significant increase in the computation and parameter storage costs. Recent efforts toward reducing these overheads involve pruning and compressing the weights of various layers without hurting original accuracy. However, magnitude-based pruning of weights reduces a significant number of parameters from the fully connected layers and may not adequately reduce the computation costs in the convolutional layers due to irregular sparsity in the pruned networks. We present an acceleration method for CNNs, where we prune filters from CNNs that are identified as having a small effect on the output accuracy. By removing whole filters in the network…
Citation impact
- FWCI
- —
- Percentile
- —
- References
- 28
Authors
5Topics & keywords
- Pruning
- Computation
- Computer science
- Convolution (computer science)
- Inference
- Feature (linguistics)
- Filter (signal processing)
- Algorithm