Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks

Gong, Ruihao; Liu, Xianglong; Jiang, Shenghu; Li, Tianxiang; Hu, Peng; Lin, Jiazhen; Yu, Fengwei; Yan, Junjie

doi:10.1109/iccv.2019.00495

articleOct 1, 2019Closed access

Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks

RGRuihao Gong XLXianglong Liu SJShenghu Jiang TLTianxiang Li PHPeng Hu

Beihang University · Beijing Institute of Technology · +1 more institution

Indexed incrossref

Abstract

Hardware-friendly network quantization (e.g., binary/uniform quantization) can efficiently accelerate the inference and meanwhile reduce memory consumption of the deep neural networks, which is crucial for model deployment on resource-limited devices like mobile phones. However, due to the discreteness of low-bit quantization, existing quantization methods often face the unstable training process and severe performance degradation. To address this problem, in this paper we propose Differentiable Soft Quantization (DSQ) to bridge the gap between the full-precision and low-bit networks. DSQ can automatically evolve during training to gradually approximate the standard quantization. Owing to its differentiable…

Citation impact

457

total citations

FWCI: 22.96
Percentile: 100%
References: 76

Citations per year

Authors

8

Topics & keywords

Topics

Keywords

Quantization (signal processing)
Computer science
Differentiable function
Inference
Artificial neural network
Algorithm
Artificial intelligence
Mathematics

No related works found for this paper.