articleACM Transactions on Intelligent Systems and TechnologySep 11, 2023Closed access

A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification

University of Zanjan · University Medical Center Freiburg

Indexed incrossref

Abstract

Recent advancements in machine learning achieved by Deep Neural Networks (DNNs) have been significant. While demonstrating high accuracy, DNNs are associated with a huge number of parameters and computations, which leads to high memory usage and energy consumption. As a result, deploying DNNs on devices with constrained hardware resources poses significant challenges. To overcome this, various compression techniques have been widely employed to optimize DNN accelerators. A promising approach is quantization, in which the full-precision values are stored in low bit-width precision. Quantization not only reduces memory requirements but also replaces high-cost operations with low-cost ones. DNN quantization…

Citation impact

188
total citations
FWCI
21.37
Percentile
100%
References
104
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Quantization (signal processing)
  • Floating point
  • Linde–Buzo–Gray algorithm
  • Computer engineering
  • Deep learning
  • Artificial neural network
  • Artificial intelligence
UN Sustainable Development Goals
  • Affordable and clean energy
No related works found for this paper.