Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges
University of California, Irvine · Northeastern University
Abstract
Mobile devices such as smartphones and autonomous vehicles increasingly rely on deep neural networks (DNNs) to execute complex inference tasks such as image classification and speech recognition, among others. However, continuously executing the entire DNN on mobile devices can quickly deplete their battery. Although task offloading to cloud/edge servers may decrease the mobile device’s computational burden, erratic patterns in channel quality, network, and edge server load can lead to a significant delay in task execution. Recently, approaches based on split computing (SC) have been proposed, where the DNN is split into a head and a tail model, executed respectively on the mobile device and on the edge…
Citation impact
- FWCI
- 33.05
- Percentile
- 100%
- References
- 203
Authors
3Topics & keywords
- Computer science
- Mobile device
- Cloud computing
- Inference
- Deep learning
- Edge device
- Server
- Task (project management)
- Affordable and clean energy