Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations
Technion – Israel Institute of Technology · Université de Montréal · +1 more institution
Abstract
We introduce a method to train Quantized Neural Networks (QNNs) --- neural networks with extremely low precision (e.g., 1-bit) weights and activations, at run-time. At train-time the quantized weights and activations are used for computing the parameter gradients. During the forward pass, QNNs drastically reduce memory size and accesses, and replace most arithmetic operations with bit-wise operations. As a result, power consumption is expected to be drastically reduced. We trained QNNs over the MNIST, CIFAR-10, SVHN and ImageNet datasets. The resulting QNNs achieve prediction accuracy comparable to their 32-bit counterparts. For example, our quantized version of AlexNet with 1-bit weights and 2-bit activations…
Citation impact
- FWCI
- —
- Percentile
- —
- References
- 53
Authors
5Topics & keywords
- Artificial neural network
- Computer science
- Artificial intelligence
- Training (meteorology)
- Deep neural networks
- Machine learning
- Pattern recognition (psychology)
- Affordable and clean energy