Publication type: Proceedings Article
Publication date: 2021-10-01
Abstract
Knowledge distillation (KD) transfers the dark knowledge from cumbersome networks (teacher) to lightweight (student) networks and expects the student to achieve more promising performance than training without the teacher’s knowledge. However, a counter-intuitive argument is that better teachers do not make better students due to the capacity mismatch. To this end, we present a novel adaptive knowledge distillation method to complement traditional approaches. The proposed method, named as Student Customized Knowledge Distillation (SCKD), examines the capacity mismatch between teacher and student from the perspective of gradient similarity. We formulate the knowledge distillation as a multi-task learning problem so that the teacher transfers knowledge to the student only if the student can benefit from learning such knowledge. We validate our methods on multiple datasets with various teacher-student configurations on image classification, object detection, and semantic segmentation.
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
2
4
6
8
10
12
|
|
|
Lecture Notes in Computer Science
11 publications, 17.19%
|
|
|
IEEE Transactions on Circuits and Systems for Video Technology
4 publications, 6.25%
|
|
|
Medical Image Analysis
2 publications, 3.13%
|
|
|
Neurocomputing
2 publications, 3.13%
|
|
|
Engineering Applications of Artificial Intelligence
2 publications, 3.13%
|
|
|
IEEE Journal of Biomedical and Health Informatics
2 publications, 3.13%
|
|
|
IEEE Transactions on Neural Networks and Learning Systems
2 publications, 3.13%
|
|
|
International Journal of Computer Vision
1 publication, 1.56%
|
|
|
Communications in Computer and Information Science
1 publication, 1.56%
|
|
|
IEEE Transactions on Visualization and Computer Graphics
1 publication, 1.56%
|
|
|
IEEE Access
1 publication, 1.56%
|
|
|
Studies in Computational Intelligence
1 publication, 1.56%
|
|
|
Pattern Recognition
1 publication, 1.56%
|
|
|
IEEE Transactions on Intelligent Vehicles
1 publication, 1.56%
|
|
|
IEEE Transactions on Multimedia
1 publication, 1.56%
|
|
|
ACM Transactions on Information Systems
1 publication, 1.56%
|
|
|
Lecture Notes in Electrical Engineering
1 publication, 1.56%
|
|
|
IEEE Transactions on Emerging Topics in Computational Intelligence
1 publication, 1.56%
|
|
|
IEEE Transactions on Geoscience and Remote Sensing
1 publication, 1.56%
|
|
|
IEEE Transactions on Pattern Analysis and Machine Intelligence
1 publication, 1.56%
|
|
|
Visual Computer
1 publication, 1.56%
|
|
|
Knowledge-Based Systems
1 publication, 1.56%
|
|
|
Future Generation Computer Systems
1 publication, 1.56%
|
|
|
IEEE Transactions on Circuits and Systems II: Express Briefs
1 publication, 1.56%
|
|
|
Information Sciences
1 publication, 1.56%
|
|
|
Complex & Intelligent Systems
1 publication, 1.56%
|
|
|
Information Fusion
1 publication, 1.56%
|
|
|
Journal of Software Evolution and Process
1 publication, 1.56%
|
|
|
IEEE Geoscience and Remote Sensing Letters
1 publication, 1.56%
|
|
|
2
4
6
8
10
12
|
Publishers
|
5
10
15
20
25
30
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
30 publications, 46.88%
|
|
|
Springer Nature
17 publications, 26.56%
|
|
|
Elsevier
13 publications, 20.31%
|
|
|
Association for Computing Machinery (ACM)
3 publications, 4.69%
|
|
|
Wiley
1 publication, 1.56%
|
|
|
5
10
15
20
25
30
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
64
Total citations:
64
Citations from 2024:
40
(62.5%)