Trained Ternary Quantization

Zhu, Chenzhuo; Han, Song; Mao, Huizi; Dally, William J.

doi:10.48550/arxiv.1612.01064

preprintarXiv (Cornell University)Dec 4, 2016GREEN OA

Trained Ternary Quantization

CZChenzhuo Zhu SHSong Han HMHuizi Mao WJWilliam J. Dally

Stanford Health Care · Stanford Medicine · +2 more institutions

Indexed inarxivdatacite

Abstract

Deep neural networks are widely used in machine learning applications. However, the deployment of large neural networks models can be difficult to deploy on mobile devices with limited power budgets. To solve this problem, we propose Trained Ternary Quantization (TTQ), a method that can reduce the precision of weights in neural networks to ternary values. This method has very little accuracy degradation and can even improve the accuracy of some models (32, 44, 56-layer ResNet) on CIFAR-10 and AlexNet on ImageNet. And our AlexNet model is trained from scratch, which means it's as easy as to train normal full precision model. We highlight our trained quantization method that can learn both ternary values and…

Citation impact

746

total citations

FWCI: —
Percentile: —
References: 15

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Ternary operation
Computer science
Quantization (signal processing)
Artificial neural network
Inference
Algorithm
Binary number
Artificial intelligence

No related works found for this paper.