A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations

The University of Adelaide · Harbin Institute of Technology

PubMed
Indexed incrossrefpubmed

Abstract

Modern deep neural networks, particularly recent large language models, come with massive model sizes that require significant computational and storage resources. To enable the deployment of modern models on resource-constrained environments and to accelerate inference time, researchers have increasingly explored pruning techniques as a popular research direction in neural network compression. More than three thousand pruning papers have been published from 2020 to 2024. However, there is a dearth of up-to-date comprehensive review papers on pruning. To address this issue, in this survey, we provide a comprehensive review of existing research works on deep neural network pruning in a taxonomy of 1)…

Citation impact

362
total citations
FWCI
113.60
Percentile
100%
References
267
Citations per year

Authors

3

Topics & keywords

Keywords
  • Artificial intelligence
  • Computer science
  • Taxonomy (biology)
  • Artificial neural network
  • Machine learning
  • Pattern recognition (psychology)
  • Biology
No related works found for this paper.

Funding