A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations
The University of Adelaide · Harbin Institute of Technology
Abstract
Modern deep neural networks, particularly recent large language models, come with massive model sizes that require significant computational and storage resources. To enable the deployment of modern models on resource-constrained environments and to accelerate inference time, researchers have increasingly explored pruning techniques as a popular research direction in neural network compression. More than three thousand pruning papers have been published from 2020 to 2024. However, there is a dearth of up-to-date comprehensive review papers on pruning. To address this issue, in this survey, we provide a comprehensive review of existing research works on deep neural network pruning in a taxonomy of 1)…
Citation impact
- FWCI
- 113.60
- Percentile
- 100%
- References
- 267
Authors
3Topics & keywords
- Artificial intelligence
- Computer science
- Taxonomy (biology)
- Artificial neural network
- Machine learning
- Pattern recognition (psychology)
- Biology