,
volume 46
,
issue 12
,
pages 10558-10578
A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations
Publication type: Journal Article
Publication date: 2024-12-01
scimago Q1
wos Q1
SJR: 3.910
CiteScore: 35.0
Impact factor: 18.6
ISSN: 01628828, 21609292, 19393539
PubMed ID:
39167504
Abstract
Modern deep neural networks, particularly recent large language models, come with massive model sizes that require significant computational and storage resources. To enable the deployment of modern models on resource-constrained environments and to accelerate inference time, researchers have increasingly explored pruning techniques as a popular research direction in neural network compression. More than three thousand pruning papers have been published from 2020 to 2024. However, there is a dearth of up-to-date comprehensive review papers on pruning. To address this issue, in this survey, we provide a comprehensive review of existing research works on deep neural network pruning in a taxonomy of 1) universal/specific speedup, 2) when to prune, 3) how to prune, and 4) fusion of pruning and other compression techniques. We then provide a thorough comparative analysis of eight pairs of contrast settings for pruning (e.g., unstructured/structured, one-shot/iterative, data-free/data-driven, initialized/pre-trained weights, etc.) and explore several emerging topics, including pruning for large language models, vision transformers, diffusion models, and large multimodal models, post-training pruning, and different levels of supervision for pruning to shed light on the commonalities and differences of existing methods and lay the foundation for further method development. Finally, we provide some valuable recommendations on selecting pruning methods and prospect several promising research directions for neural network pruning. To facilitate future research on deep neural network pruning, we summarize broad pruning applications (e.g., adversarial robustness, natural language understanding, etc.) and build a curated collection of datasets, networks, and evaluations on different applications. We maintain a repository on https://github.com/hrcheng1066/awesome-pruning that serves as a comprehensive resource for neural network pruning papers and corresponding open-source codes. We will keep updating this repository to include the latest advancements in the field.
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
2
4
6
8
10
12
14
|
|
|
IEEE Access
14 publications, 6.8%
|
|
|
Neurocomputing
7 publications, 3.4%
|
|
|
Sensors
5 publications, 2.43%
|
|
|
Remote Sensing
4 publications, 1.94%
|
|
|
Neural Networks
4 publications, 1.94%
|
|
|
Journal of Supercomputing
3 publications, 1.46%
|
|
|
Computers and Security
3 publications, 1.46%
|
|
|
Expert Systems with Applications
3 publications, 1.46%
|
|
|
Future Generation Computer Systems
3 publications, 1.46%
|
|
|
Lecture Notes in Computer Science
3 publications, 1.46%
|
|
|
Cluster Computing
3 publications, 1.46%
|
|
|
Engineering Applications of Artificial Intelligence
2 publications, 0.97%
|
|
|
Digital Signal Processing: A Review Journal
2 publications, 0.97%
|
|
|
Artificial Intelligence in Data and Big Data Processing
2 publications, 0.97%
|
|
|
Agriculture (Switzerland)
2 publications, 0.97%
|
|
|
IEEE Transactions on Geoscience and Remote Sensing
2 publications, 0.97%
|
|
|
Mathematics
2 publications, 0.97%
|
|
|
Electronics (Switzerland)
2 publications, 0.97%
|
|
|
IEEE Transactions on Mobile Computing
2 publications, 0.97%
|
|
|
Pattern Recognition
2 publications, 0.97%
|
|
|
Applied Sciences (Switzerland)
2 publications, 0.97%
|
|
|
Algorithms
2 publications, 0.97%
|
|
|
Scientific Reports
2 publications, 0.97%
|
|
|
Aquacultural Engineering
1 publication, 0.49%
|
|
|
Information (Switzerland)
1 publication, 0.49%
|
|
|
IEEE Internet of Things Journal
1 publication, 0.49%
|
|
|
Foundations of Data Science
1 publication, 0.49%
|
|
|
IEEE Open Journal of Nanotechnology
1 publication, 0.49%
|
|
|
IET Image Processing
1 publication, 0.49%
|
|
|
2
4
6
8
10
12
14
|
Publishers
|
10
20
30
40
50
60
70
80
90
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
86 publications, 41.75%
|
|
|
Elsevier
44 publications, 21.36%
|
|
|
Springer Nature
28 publications, 13.59%
|
|
|
MDPI
27 publications, 13.11%
|
|
|
Association for Computing Machinery (ACM)
6 publications, 2.91%
|
|
|
Wiley
4 publications, 1.94%
|
|
|
IOP Publishing
2 publications, 0.97%
|
|
|
American Institute of Mathematical Sciences (AIMS)
1 publication, 0.49%
|
|
|
Institution of Engineering and Technology (IET)
1 publication, 0.49%
|
|
|
Eco-Vector LLC
1 publication, 0.49%
|
|
|
Frontiers Media S.A.
1 publication, 0.49%
|
|
|
American Physical Society (APS)
1 publication, 0.49%
|
|
|
Public Library of Science (PLoS)
1 publication, 0.49%
|
|
|
World Scientific
1 publication, 0.49%
|
|
|
AME Publishing Company
1 publication, 0.49%
|
|
|
10
20
30
40
50
60
70
80
90
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
208
Total citations:
208
Citations from 2024:
187
(90.78%)
Cite this
GOST |
RIS |
BibTex |
MLA
Cite this
GOST
Copy
Cheng H. et al. A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations // IEEE Transactions on Pattern Analysis and Machine Intelligence. 2024. Vol. 46. No. 12. pp. 10558-10578.
GOST all authors (up to 50)
Copy
Cheng H., Zhang M., Shi J. Q. A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations // IEEE Transactions on Pattern Analysis and Machine Intelligence. 2024. Vol. 46. No. 12. pp. 10558-10578.
Cite this
RIS
Copy
TY - JOUR
DO - 10.1109/tpami.2024.3447085
UR - https://ieeexplore.ieee.org/document/10643325/
TI - A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations
T2 - IEEE Transactions on Pattern Analysis and Machine Intelligence
AU - Cheng, Hongrong
AU - Zhang, Miao
AU - Shi, Javen Qinfeng
PY - 2024
DA - 2024/12/01
PB - Institute of Electrical and Electronics Engineers (IEEE)
SP - 10558-10578
IS - 12
VL - 46
PMID - 39167504
SN - 0162-8828
SN - 2160-9292
SN - 1939-3539
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2024_Cheng,
author = {Hongrong Cheng and Miao Zhang and Javen Qinfeng Shi},
title = {A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations},
journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
year = {2024},
volume = {46},
publisher = {Institute of Electrical and Electronics Engineers (IEEE)},
month = {dec},
url = {https://ieeexplore.ieee.org/document/10643325/},
number = {12},
pages = {10558--10578},
doi = {10.1109/tpami.2024.3447085}
}
Cite this
MLA
Copy
Cheng, Hongrong, et al. “A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations.” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 46, no. 12, Dec. 2024, pp. 10558-10578. https://ieeexplore.ieee.org/document/10643325/.