preprintarXiv (Cornell University)Feb 9, 2017GREEN OA

Incremental Network Quantization: Towards Lossless CNNs with\n Low-Precision Weights

Indexed inarxiv

Abstract

This paper presents incremental network quantization (INQ), a novel method,\ntargeting to efficiently convert any pre-trained full-precision convolutional\nneural network (CNN) model into a low-precision version whose weights are\nconstrained to be either powers of two or zero. Unlike existing methods which\nare struggled in noticeable accuracy loss, our INQ has the potential to resolve\nthis issue, as benefiting from two innovations. On one hand, we introduce three\ninterdependent operations, namely weight partition, group-wise quantization and\nre-training. A well-proven measure is employed to divide the weights in each\nlayer of a pre-trained CNN model into two disjoint groups. The weights in the\nfirst…

Citation impact

594
total citations
FWCI
Percentile
References
0
Citations per year

Authors

5

Topics & keywords

Keywords
  • Quantization (signal processing)
  • Computer science
  • Convolutional neural network
  • Disjoint sets
  • Deep learning
  • Residual neural network
  • Artificial intelligence
  • Lossless compression
UN Sustainable Development Goals
  • Industry, innovation and infrastructure
No related works found for this paper.