A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification

Rokh, Babak; Azarpeyvand, Ali; Khanteymoori, Alireza

doi:10.1145/3623402

articleACM Transactions on Intelligent Systems and TechnologySep 11, 2023Closed access

A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification

BRBabak Rokh AAAli Azarpeyvand AKAlireza Khanteymoori

University of Zanjan · University Medical Center Freiburg

Indexed incrossref

Abstract

Recent advancements in machine learning achieved by Deep Neural Networks (DNNs) have been significant. While demonstrating high accuracy, DNNs are associated with a huge number of parameters and computations, which leads to high memory usage and energy consumption. As a result, deploying DNNs on devices with constrained hardware resources poses significant challenges. To overcome this, various compression techniques have been widely employed to optimize DNN accelerators. A promising approach is quantization, in which the full-precision values are stored in low bit-width precision. Quantization not only reduces memory requirements but also replaces high-cost operations with low-cost ones. DNN quantization…

Citation impact

188

total citations

FWCI: 21.37
Percentile: 100%
References: 104

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Computer science
Quantization (signal processing)
Floating point
Linde–Buzo–Gray algorithm
Computer engineering
Deep learning
Artificial neural network
Artificial intelligence

UN Sustainable Development Goals

Affordable and clean energy

No related works found for this paper.