A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification
University of Zanjan · University Medical Center Freiburg
Abstract
Recent advancements in machine learning achieved by Deep Neural Networks (DNNs) have been significant. While demonstrating high accuracy, DNNs are associated with a huge number of parameters and computations, which leads to high memory usage and energy consumption. As a result, deploying DNNs on devices with constrained hardware resources poses significant challenges. To overcome this, various compression techniques have been widely employed to optimize DNN accelerators. A promising approach is quantization, in which the full-precision values are stored in low bit-width precision. Quantization not only reduces memory requirements but also replaces high-cost operations with low-cost ones. DNN quantization…
Citation impact
- FWCI
- 21.37
- Percentile
- 100%
- References
- 104
Authors
3Topics & keywords
- Computer science
- Quantization (signal processing)
- Floating point
- Linde–Buzo–Gray algorithm
- Computer engineering
- Deep learning
- Artificial neural network
- Artificial intelligence
- Affordable and clean energy