preprintarXiv (Cornell University)Jun 20, 2016GREEN OA

DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients

Indexed inarxivdatacite

Abstract

We propose DoReFa-Net, a method to train convolutional neural networks that have low bitwidth weights and activations using low bitwidth parameter gradients. In particular, during backward pass, parameter gradients are stochastically quantized to low bitwidth numbers before being propagated to convolutional layers. As convolutions during forward/backward passes can now operate on low bitwidth weights and activations/gradients respectively, DoReFa-Net can use bit convolution kernels to accelerate both training and inference. Moreover, as bit convolutions can be efficiently implemented on CPU, FPGA, ASIC and GPU, DoReFa-Net opens the way to accelerate training of low bitwidth neural network on these hardware.…

Citation impact

1,804
total citations
FWCI
Percentile
References
26
Citations per year

Authors

6

Topics & keywords

Keywords
  • Computer science
  • Convolutional neural network
  • Net (polyhedron)
  • Convolution (computer science)
  • Inference
  • Set (abstract data type)
  • Algorithm
  • Parallel computing
No related works found for this paper.