Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing
Indexed incrossref
Abstract
As a key technology of enabling Artificial Intelligence (AI) applications in 5G era, Deep Neural Networks (DNNs) have quickly attracted widespread attention. However, it is challenging to run computation-intensive DNN-based tasks on mobile devices due to the limited computation resources. What’s worse, traditional cloud-assisted DNN inference is heavily hindered by the significant wide-area network latency, leading to poor real-time performance as well as low quality of user experience. To address these challenges, in this paper, we propose Edgent , a framework that leverages edge computing for DNN collaborative inference through device-edge synergy. Edgent exploits two design knobs: (1) DNN partitioning that…
Citation impact
849
total citations
- FWCI
- 60.20
- Percentile
- 100%
- References
- 65
Citations per year
Authors
4Topics & keywords
Topics
Keywords
- Computer science
- Inference
- Edge computing
- Artificial neural network
- Enhanced Data Rates for GSM Evolution
- Artificial intelligence
No related works found for this paper.