Incremental Network Quantization: Towards Lossless CNNs with\n Low-Precision Weights
Indexed inarxiv
Abstract
This paper presents incremental network quantization (INQ), a novel method,\ntargeting to efficiently convert any pre-trained full-precision convolutional\nneural network (CNN) model into a low-precision version whose weights are\nconstrained to be either powers of two or zero. Unlike existing methods which\nare struggled in noticeable accuracy loss, our INQ has the potential to resolve\nthis issue, as benefiting from two innovations. On one hand, we introduce three\ninterdependent operations, namely weight partition, group-wise quantization and\nre-training. A well-proven measure is employed to divide the weights in each\nlayer of a pre-trained CNN model into two disjoint groups. The weights in the\nfirst…
Citation impact
594
total citations
- FWCI
- —
- Percentile
- —
- References
- 0
Citations per year
Authors
5Topics & keywords
Topics
Keywords
- Quantization (signal processing)
- Computer science
- Convolutional neural network
- Disjoint sets
- Deep learning
- Residual neural network
- Artificial intelligence
- Lossless compression
UN Sustainable Development Goals
- Industry, innovation and infrastructure
No related works found for this paper.