Incremental Network Quantization: Towards Lossless CNNs with\n Low-Precision Weights

Zhou, Aojun; Yao, Anbang; Guo, Yiwen; Xu, Lin; Chen, Yurong

doi:10.48550/arxiv.1702.03044

preprintarXiv (Cornell University)Feb 9, 2017GREEN OA

Incremental Network Quantization: Towards Lossless CNNs with\n Low-Precision Weights

AZAojun Zhou AYAnbang Yao YGYiwen Guo LXLin Xu YCYurong Chen

Indexed inarxiv

Abstract

This paper presents incremental network quantization (INQ), a novel method,\ntargeting to efficiently convert any pre-trained full-precision convolutional\nneural network (CNN) model into a low-precision version whose weights are\nconstrained to be either powers of two or zero. Unlike existing methods which\nare struggled in noticeable accuracy loss, our INQ has the potential to resolve\nthis issue, as benefiting from two innovations. On one hand, we introduce three\ninterdependent operations, namely weight partition, group-wise quantization and\nre-training. A well-proven measure is employed to divide the weights in each\nlayer of a pre-trained CNN model into two disjoint groups. The weights in the\nfirst…

Citation impact

594

total citations

FWCI: —
Percentile: —
References: 0

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Quantization (signal processing)
Computer science
Convolutional neural network
Disjoint sets
Deep learning
Residual neural network
Artificial intelligence
Lossless compression

UN Sustainable Development Goals

Industry, innovation and infrastructure

No related works found for this paper.