Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices: A Review

Shuvo, Md Maruf Hossain; Islam, Syed K.; Cheng, Jianlin; Morshed, Bashir I.

doi:10.1109/jproc.2022.3226481

reviewProceedings of the IEEEDec 14, 2022HYBRID OA

Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices: A Review

MMMd Maruf Hossain Shuvo SKSyed K. Islam JCJianlin Cheng BIBashir I. Morshed

Analog Devices (United States) · University of Missouri · +1 more institution

Indexed incrossref

Abstract

Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted in breakthroughs in many areas. However, deploying these highly accurate models for data-driven, learned, automatic, and practical machine learning (ML) solutions to end-user applications remains challenging. DL algorithms are often computationally expensive, power-hungry, and require large memory to process complex and iterative operations of millions of parameters. Hence, training and inference of DL models are typically performed on high-performance computing (HPC) clusters in the cloud. Data transmission to the cloud results in high latency, round-trip delay, security and privacy concerns, and the inability of…

Citation impact

325

total citations

FWCI: 39.03
Percentile: 100%
References: 475

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Edge device
Cloud computing
Edge computing
Software deployment
Deep learning
Inference
Artificial intelligence

No related works found for this paper.

Funding

UO
University of Missouri