articleIEEE Transactions on Wireless CommunicationsOct 18, 2019Closed access

Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing

Sun Yat-sen University

Indexed incrossref

Abstract

As a key technology of enabling Artificial Intelligence (AI) applications in 5G era, Deep Neural Networks (DNNs) have quickly attracted widespread attention. However, it is challenging to run computation-intensive DNN-based tasks on mobile devices due to the limited computation resources. What’s worse, traditional cloud-assisted DNN inference is heavily hindered by the significant wide-area network latency, leading to poor real-time performance as well as low quality of user experience. To address these challenges, in this paper, we propose Edgent , a framework that leverages edge computing for DNN collaborative inference through device-edge synergy. Edgent exploits two design knobs: (1) DNN partitioning that…

Citation impact

849
total citations
FWCI
60.20
Percentile
100%
References
65
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Inference
  • Edge computing
  • Artificial neural network
  • Enhanced Data Rates for GSM Evolution
  • Artificial intelligence
No related works found for this paper.

Funding