Association for the Advancement of Artificial Intelligence (AAAI)

Proceedings of the AAAI Conference on Artificial Intelligence

, volume 35 , issue 16 , pages 14284-14291

Reinforced Multi-Teacher Selection for Knowledge Distillation

Yuan Fei ¹

Linjun Shou ²

Jian Pei ³

Wutao Lin ²

Ming Gong ²

Yan Fu ¹

Daxin Jiang ²

Hide authors affiliations Show authors affiliations: 3 affiliations

University of electronic science and technology of China |

Microsoft STCA NLP Group

School of Computing Science, Simon Fraser University |

Publication type: Journal Article

Publication date: 2021-05-18

Association for the Advancement of Artificial Intelligence (AAAI)

Proceedings of the AAAI Conference on Artificial Intelligence

SJR: 0.133

CiteScore: 2.0

Impact factor: —

ISSN: 21595399, 23743468

DOI: 10.1609/aaai.v35i16.17680

Copy DOI

General Medicine

Abstract

In natural language processing (NLP) tasks, slow inference speed and huge footprints in GPU usage remain the bottleneck of applying pre-trained deep models in production. As a popular method for model compression, knowledge distillation transfers knowledge from one or multiple large (teacher) models to a small (student) model. When multiple teacher models are available in distillation, the state-of-the-art methods assign a fixed weight to a teacher model in the whole distillation. Furthermore, most of the existing methods allocate an equal weight to every teacher model. In this paper, we observe that, due to the complexity of training examples and the differences in student model capability, learning differentially from teacher models can lead to better performance of student models distilled. We systematically develop a reinforced method to dynamically assign weights to teacher models for different training instances and optimize the performance of student model. Our extensive experimental results on several NLP tasks clearly verify the feasibility and effectiveness of our approach.

Found

Top-30

Journals

Publishers

	5 10 15 20 25 30 35 40
Institute of Electrical and Electronics Engineers (IEEE)	Institute of Electrical and Electronics Engineers (IEEE), 39, 44.83% Institute of Electrical and Electronics Engineers (IEEE) 39 publications, 44.83%
Springer Nature	Springer Nature, 18, 20.69% Springer Nature 18 publications, 20.69%
MDPI	MDPI, 9, 10.34% MDPI 9 publications, 10.34%
Elsevier	Elsevier, 8, 9.2% Elsevier 8 publications, 9.2%
Association for Computing Machinery (ACM)	Association for Computing Machinery (ACM), 5, 5.75% Association for Computing Machinery (ACM) 5 publications, 5.75%
SAGE	SAGE, 1, 1.15% SAGE 1 publication, 1.15%
Oxford University Press	Oxford University Press, 1, 1.15% Oxford University Press 1 publication, 1.15%
Cold Spring Harbor Laboratory	Cold Spring Harbor Laboratory, 1, 1.15% Cold Spring Harbor Laboratory 1 publication, 1.15%
Institution of Engineering and Technology (IET)	Institution of Engineering and Technology (IET), 1, 1.15% Institution of Engineering and Technology (IET) 1 publication, 1.15%
Walter de Gruyter	Walter de Gruyter, 1, 1.15% Walter de Gruyter 1 publication, 1.15%
Frontiers Media S.A.	Frontiers Media S.A., 1, 1.15% Frontiers Media S.A. 1 publication, 1.15%
	5 10 15 20 25 30 35 40

We do not take into account publications without a DOI.
Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

Cite this

GOST |

Cite this

GOST Copy

Fei Y. et al. Reinforced Multi-Teacher Selection for Knowledge Distillation // Proceedings of the AAAI Conference on Artificial Intelligence. 2021. Vol. 35. No. 16. pp. 14284-14291.

GOST all authors (up to 50) Copy

Fei Y., Shou L., Pei J., Lin W., Gong M., Fu Y., Jiang D. Reinforced Multi-Teacher Selection for Knowledge Distillation // Proceedings of the AAAI Conference on Artificial Intelligence. 2021. Vol. 35. No. 16. pp. 14284-14291.

RIS |

Cite this

RIS Copy

TY - JOUR

DO - 10.1609/aaai.v35i16.17680

UR - https://doi.org/10.1609/aaai.v35i16.17680

TI - Reinforced Multi-Teacher Selection for Knowledge Distillation

T2 - Proceedings of the AAAI Conference on Artificial Intelligence

AU - Fei, Yuan

AU - Shou, Linjun

AU - Pei, Jian

AU - Lin, Wutao

AU - Gong, Ming

AU - Fu, Yan

AU - Jiang, Daxin

PY - 2021

DA - 2021/05/18

PB - Association for the Advancement of Artificial Intelligence (AAAI)

SP - 14284-14291

IS - 16

VL - 35

SN - 2159-5399

SN - 2374-3468

ER -

BibTex |

Cite this

BibTex (up to 50 authors) Copy

@article{2021_Fei,

author = {Yuan Fei and Linjun Shou and Jian Pei and Wutao Lin and Ming Gong and Yan Fu and Daxin Jiang},

title = {Reinforced Multi-Teacher Selection for Knowledge Distillation},

journal = {Proceedings of the AAAI Conference on Artificial Intelligence},

year = {2021},

volume = {35},

publisher = {Association for the Advancement of Artificial Intelligence (AAAI)},

month = {may},

url = {https://doi.org/10.1609/aaai.v35i16.17680},

number = {16},

pages = {14284--14291},

doi = {10.1609/aaai.v35i16.17680}

}

MLA

Cite this

MLA Copy

Fei, Yuan, et al. “Reinforced Multi-Teacher Selection for Knowledge Distillation.” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 16, May. 2021, pp. 14284-14291. https://doi.org/10.1609/aaai.v35i16.17680.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Journal

Proceedings of the AAAI Conference on Artificial Intelligence

SJR

0.133

CiteScore

2.0

Impact factor

—

ISSN

21595399 (Print)

23743468 (Electronic)

	1 2 3 4 5 6 7 8
Lecture Notes in Computer Science	Lecture Notes in Computer Science, 8, 9.2% Lecture Notes in Computer Science 8 publications, 9.2%
Applied Sciences (Switzerland)	Applied Sciences (Switzerland), 3, 3.45% Applied Sciences (Switzerland) 3 publications, 3.45%
IEEE Transactions on Pattern Analysis and Machine Intelligence	IEEE Transactions on Pattern Analysis and Machine Intelligence, 3, 3.45% IEEE Transactions on Pattern Analysis and Machine Intelligence 3 publications, 3.45%
IEEE Access	IEEE Access, 2, 2.3% IEEE Access 2 publications, 2.3%
International Journal of Computer Vision	International Journal of Computer Vision, 2, 2.3% International Journal of Computer Vision 2 publications, 2.3%
Expert Systems with Applications	Expert Systems with Applications, 2, 2.3% Expert Systems with Applications 2 publications, 2.3%
Mathematics	Mathematics, 2, 2.3% Mathematics 2 publications, 2.3%
Applied Intelligence	Applied Intelligence, 2, 2.3% Applied Intelligence 2 publications, 2.3%
International Journal of Machine Learning and Cybernetics	International Journal of Machine Learning and Cybernetics, 2, 2.3% International Journal of Machine Learning and Cybernetics 2 publications, 2.3%
IEEE Transactions on Intelligent Transportation Systems	IEEE Transactions on Intelligent Transportation Systems, 2, 2.3% IEEE Transactions on Intelligent Transportation Systems 2 publications, 2.3%
Journal of Systems Architecture	Journal of Systems Architecture, 1, 1.15% Journal of Systems Architecture 1 publication, 1.15%
IEEE Computational Intelligence Magazine	IEEE Computational Intelligence Magazine, 1, 1.15% IEEE Computational Intelligence Magazine 1 publication, 1.15%
IEEE Transactions on Instrumentation and Measurement	IEEE Transactions on Instrumentation and Measurement, 1, 1.15% IEEE Transactions on Instrumentation and Measurement 1 publication, 1.15%
IEEE Transactions on Image Processing	IEEE Transactions on Image Processing, 1, 1.15% IEEE Transactions on Image Processing 1 publication, 1.15%
Proceedings of the ACM on Management of Data	Proceedings of the ACM on Management of Data, 1, 1.15% Proceedings of the ACM on Management of Data 1 publication, 1.15%
Sixth International Conference on Data Mining (ICDM'06)	Sixth International Conference on Data Mining (ICDM'06), 1, 1.15% Sixth International Conference on Data Mining (ICDM'06) 1 publication, 1.15%
Conference Record of the Asilomar Conference on Signals, Systems and Computers	Conference Record of the Asilomar Conference on Signals, Systems and Computers, 1, 1.15% Conference Record of the Asilomar Conference on Signals, Systems and Computers 1 publication, 1.15%
Remote Sensing	Remote Sensing, 1, 1.15% Remote Sensing 1 publication, 1.15%
IEEE Transactions on Industrial Electronics	IEEE Transactions on Industrial Electronics, 1, 1.15% IEEE Transactions on Industrial Electronics 1 publication, 1.15%
Information Processing and Management	Information Processing and Management, 1, 1.15% Information Processing and Management 1 publication, 1.15%
Neural Networks	Neural Networks, 1, 1.15% Neural Networks 1 publication, 1.15%
IEEE/ACM Transactions on Audio Speech and Language Processing	IEEE/ACM Transactions on Audio Speech and Language Processing, 1, 1.15% IEEE/ACM Transactions on Audio Speech and Language Processing 1 publication, 1.15%
Journal of Intelligent and Fuzzy Systems	Journal of Intelligent and Fuzzy Systems, 1, 1.15% Journal of Intelligent and Fuzzy Systems 1 publication, 1.15%
ACM Transactions on Recommender Systems	ACM Transactions on Recommender Systems, 1, 1.15% ACM Transactions on Recommender Systems 1 publication, 1.15%
IEEE Sensors Journal	IEEE Sensors Journal, 1, 1.15% IEEE Sensors Journal 1 publication, 1.15%
IEEE Transactions on Knowledge and Data Engineering	IEEE Transactions on Knowledge and Data Engineering, 1, 1.15% IEEE Transactions on Knowledge and Data Engineering 1 publication, 1.15%
IEEE Transactions on Artificial Intelligence	IEEE Transactions on Artificial Intelligence, 1, 1.15% IEEE Transactions on Artificial Intelligence 1 publication, 1.15%
Electronics (Switzerland)	Electronics (Switzerland), 1, 1.15% Electronics (Switzerland) 1 publication, 1.15%
Computational Biology and Chemistry	Computational Biology and Chemistry, 1, 1.15% Computational Biology and Chemistry 1 publication, 1.15%
	1 2 3 4 5 6 7 8