Open Access
Open access

Masked Generative Distillation

Zhendong Yang 1, 2
Zhe Li 2
Mingqi Shao 1
Dachuan Shi 1
Zehuan Yuan 2
Chun Yuan 1
1
 
Tsinghua Shenzhen International Graduate School, Shenzhen, China
2
 
ByteDance Inc., Beijing, China
Publication typeBook Chapter
Publication date2022-11-02
scimago Q2
SJR0.352
CiteScore2.4
Impact factor
ISSN03029743, 16113349, 18612075, 18612083
Abstract
Knowledge distillation has been applied to various tasks successfully. The current distillation algorithm usually improves students’ performance by imitating the output of the teacher. This paper shows that teachers can also improve students’ representation power by guiding students’ feature recovery. From this point of view, we propose Masked Generative Distillation (MGD), which is simple: we mask random pixels of the student’s feature and force it to generate the teacher’s full feature through a simple block. MGD is a truly general feature-based distillation method, which can be utilized on various tasks, including image classification, object detection, semantic segmentation and instance segmentation. We experiment on different models with extensive datasets and the results show that all the students achieve excellent improvements. Notably, we boost ResNet-18 from 69.90% to 71.69% ImageNet top-1 accuracy, RetinaNet with ResNet-50 backbone from 37.4 to 41.0 Boundingbox mAP, SOLO based on ResNet-50 from 33.1 to 36.2 Mask mAP and DeepLabV3 based on ResNet-18 from 73.20 to 76.02 mIoU. Our codes are available at https://github.com/yzd-v/MGD .
Found 
Found 

Top-30

Journals

2
4
6
8
10
12
Lecture Notes in Computer Science
11 publications, 6.01%
IEEE Transactions on Geoscience and Remote Sensing
9 publications, 4.92%
IEEE Access
5 publications, 2.73%
Knowledge-Based Systems
4 publications, 2.19%
Neurocomputing
4 publications, 2.19%
IEEE Transactions on Instrumentation and Measurement
4 publications, 2.19%
Expert Systems with Applications
3 publications, 1.64%
IEEE Geoscience and Remote Sensing Letters
3 publications, 1.64%
IEEE Transactions on Multimedia
3 publications, 1.64%
Image and Vision Computing
3 publications, 1.64%
IEEE Transactions on Circuits and Systems for Video Technology
3 publications, 1.64%
Sensors
3 publications, 1.64%
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
3 publications, 1.64%
Remote Sensing
2 publications, 1.09%
Pattern Recognition
2 publications, 1.09%
International Journal of Computer Vision
2 publications, 1.09%
Communications in Computer and Information Science
2 publications, 1.09%
Journal of Real-Time Image Processing
2 publications, 1.09%
ISPRS Journal of Photogrammetry and Remote Sensing
2 publications, 1.09%
Computers and Electrical Engineering
2 publications, 1.09%
Computers in Biology and Medicine
2 publications, 1.09%
Neural Networks
2 publications, 1.09%
Multimedia Systems
2 publications, 1.09%
Computers and Electronics in Agriculture
2 publications, 1.09%
Journal of Supercomputing
2 publications, 1.09%
IEEE Transactions on Intelligent Transportation Systems
2 publications, 1.09%
Animals
2 publications, 1.09%
Scientific Reports
2 publications, 1.09%
IEEE Open Journal of the Computer Society
1 publication, 0.55%
2
4
6
8
10
12

Publishers

10
20
30
40
50
60
70
80
90
100
Institute of Electrical and Electronics Engineers (IEEE)
92 publications, 50.27%
Elsevier
32 publications, 17.49%
Springer Nature
30 publications, 16.39%
MDPI
15 publications, 8.2%
Association for Computing Machinery (ACM)
6 publications, 3.28%
IOP Publishing
2 publications, 1.09%
Taylor & Francis
1 publication, 0.55%
Institution of Engineering and Technology (IET)
1 publication, 0.55%
PeerJ
1 publication, 0.55%
SAGE
1 publication, 0.55%
10
20
30
40
50
60
70
80
90
100
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
184
Share
Cite this
GOST |
Cite this
GOST Copy
Yang Z. et al. Masked Generative Distillation // Lecture Notes in Computer Science. 2022. pp. 53-69.
GOST all authors (up to 50) Copy
Yang Z., Li Z., Shao M., Shi D., Yuan Z., Yuan C. Masked Generative Distillation // Lecture Notes in Computer Science. 2022. pp. 53-69.
RIS |
Cite this
RIS Copy
TY - GENERIC
DO - 10.1007/978-3-031-20083-0_4
UR - https://doi.org/10.1007/978-3-031-20083-0_4
TI - Masked Generative Distillation
T2 - Lecture Notes in Computer Science
AU - Yang, Zhendong
AU - Li, Zhe
AU - Shao, Mingqi
AU - Shi, Dachuan
AU - Yuan, Zehuan
AU - Yuan, Chun
PY - 2022
DA - 2022/11/02
PB - Springer Nature
SP - 53-69
SN - 0302-9743
SN - 1611-3349
SN - 1861-2075
SN - 1861-2083
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@incollection{2022_Yang,
author = {Zhendong Yang and Zhe Li and Mingqi Shao and Dachuan Shi and Zehuan Yuan and Chun Yuan},
title = {Masked Generative Distillation},
publisher = {Springer Nature},
year = {2022},
pages = {53--69},
month = {nov}
}