Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations

Hubara, Itay; Courbariaux, Matthieu; Soudry, Daniel; El‐Yaniv, Ran; Bengio, Yoshua

doi:10.48550/arxiv.1609.07061

preprintarXiv (Cornell University)Sep 22, 2016GREEN OA

Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations

IHItay Hubara MCMatthieu Courbariaux DSDaniel Soudry RERan El‐Yaniv YBYoshua Bengio

Technion – Israel Institute of Technology · Université de Montréal · +1 more institution

Indexed inarxivdatacite

Abstract

We introduce a method to train Quantized Neural Networks (QNNs) --- neural networks with extremely low precision (e.g., 1-bit) weights and activations, at run-time. At train-time the quantized weights and activations are used for computing the parameter gradients. During the forward pass, QNNs drastically reduce memory size and accesses, and replace most arithmetic operations with bit-wise operations. As a result, power consumption is expected to be drastically reduced. We trained QNNs over the MNIST, CIFAR-10, SVHN and ImageNet datasets. The resulting QNNs achieve prediction accuracy comparable to their 32-bit counterparts. For example, our quantized version of AlexNet with 1-bit weights and 2-bit activations…

Citation impact

1,424

total citations

FWCI: —
Percentile: —
References: 53

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Artificial neural network
Computer science
Artificial intelligence
Training (meteorology)
Deep neural networks
Machine learning
Pattern recognition (psychology)

UN Sustainable Development Goals

Affordable and clean energy

No related works found for this paper.