A Review on Edge Large Language Models: Design, Execution, and Applications
Zhejiang University of Technology · Zhejiang University
Abstract
Large language models (LLMs) have revolutionized natural language processing with their exceptional understanding, synthesizing, and reasoning capabilities. However, deploying LLMs on resource-constrained edge devices presents significant challenges due to computational limitations, memory constraints, and edge hardware heterogeneity. This survey provides a comprehensive overview of recent advancements in edge LLMs, covering the entire lifecycle—from resource-efficient model design and pre-deployment strategies to runtime inference optimizations. It also explores on-device applications across various domains. By synthesizing state-of-the-art techniques and identifying future research directions, this survey…
Citation impact
- FWCI
- 101.75
- Percentile
- 100%
- References
- 182
Authors
6- YZYue ZhengCorresponding
Zhejiang University of Technology
- YCYuhao Chen
Zhejiang University
- BQBin Qian
Zhejiang University
- XSXiufang Shi
Zhejiang University of Technology
- YSYuanchao Shu
Zhejiang University
Topics & keywords
- Computer science
- Programming language
- Enhanced Data Rates for GSM Evolution
- Software engineering
- Artificial intelligence