ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
Nanjing University · Shanghai Jiao Tong University
Abstract
We propose an efficient and unified framework, namely ThiNet, to simultaneously accelerate and compress CNN models in both training and inference stages. We focus on the filter level pruning, i.e., the whole filter would be discarded if it is less important. Our method does not change the original network structure, thus it can be perfectly supported by any off-the-shelf deep learning libraries. We formally establish filter pruning as an optimization problem, and reveal that we need to prune filters based on statistics information computed from its next layer, not the current layer, which differentiates ThiNet from existing methods. Experimental results demonstrate the effectiveness of this strategy, which has…
Citation impact
- FWCI
- 70.05
- Percentile
- 100%
- References
- 50
Authors
3Topics & keywords
- FLOPS
- Computer science
- Pruning
- Benchmark (surveying)
- Filter (signal processing)
- Generalization
- Inference
- Reduction (mathematics)