Institute of Electrical and Electronics Engineers (IEEE)

IEEE Transactions on Multimedia

, том 25 , страницы 9015-9028

Fine-Grained Visual Classification Via Internal Ensemble Learning Transformer

Тип публикации: Journal Article

Дата публикации: 2023-02-16

Institute of Electrical and Electronics Engineers (IEEE)

IEEE Transactions on Multimedia

SCImago Q1

Tоп 10% SCImago

WOS Q1

БС1

SJR: 1.989

CiteScore: 12.9

Impact factor: 9.7

ISSN: 15209210, 19410077

DOI: 10.1109/tmm.2023.3244340

Скопировать DOI

Computer Science Applications

Electrical and Electronic Engineering

Signal Processing

Media Technology

Краткое описание

Recently, vision transformers (ViTs) have been investigated in fine-grained visual recognition (FGVC) and are now considered state of the art. However, most ViT-based works ignore the different learning performances of the heads in the multi-head self-attention (MHSA) mechanism and its layers. To address these issues, in this paper, we propose a novel internal ensemble learning transformer (IELT) for FGVC. The proposed IELT involves three main modules: multi-head voting (MHV) module, cross-layer refinement (CLR) module, and dynamic selection (DS) module. To solve the problem of the inconsistent performances of multiple heads, we propose the MHV module, which considers all of the heads in each layer as weak learners and votes for tokens of discriminative regions as cross-layer feature based on the attention maps and spatial relationships. To effectively mine the cross-layer feature and suppress the noise, the CLR module is proposed, where the refined feature is extracted and the assist logits operation is developed for the final prediction. In addition, a newly designed DS module adjusts the token selection number at each layer by weighting their contributions of the refined feature. In this way, the idea of ensemble learning is combined with the ViT to improve fine-grained feature representation. The experiments demonstrate that our method achieves competitive results compared with the state of the art on five popular FGVC datasets. Source code has been released and can be found at https://github.com/mobulan/IELT .

Для доступа к списку цитирований публикации необходимо авторизоваться.

Войти с ORCID

Топ-30

Журналы

Издатели

	10 20 30 40 50 60
Institute of Electrical and Electronics Engineers (IEEE)	Institute of Electrical and Electronics Engineers (IEEE), 60, 44.12% Institute of Electrical and Electronics Engineers (IEEE) 60 публикаций, 44.12%
Elsevier	Elsevier, 38, 27.94% Elsevier 38 публикаций, 27.94%
Springer Nature	Springer Nature, 22, 16.18% Springer Nature 22 публикации, 16.18%
MDPI	MDPI, 10, 7.35% MDPI 10 публикаций, 7.35%
Association for Computing Machinery (ACM)	Association for Computing Machinery (ACM), 3, 2.21% Association for Computing Machinery (ACM) 3 публикации, 2.21%
SPIE-Intl Soc Optical Eng	SPIE-Intl Soc Optical Eng, 1, 0.74% SPIE-Intl Soc Optical Eng 1 публикация, 0.74%
Frontiers Media S.A.	Frontiers Media S.A., 1, 0.74% Frontiers Media S.A. 1 публикация, 0.74%
Wiley	Wiley, 1, 0.74% Wiley 1 публикация, 0.74%
	10 20 30 40 50 60

Мы не учитываем публикации, у которых нет DOI.
Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.

Войти с ORCID

Метрики

136

Цитировать

ГОСТ |

Цитировать

ГОСТ Скопировать

Xu Q. et al. Fine-Grained Visual Classification Via Internal Ensemble Learning Transformer // IEEE Transactions on Multimedia. 2023. Vol. 25. pp. 9015-9028.

ГОСТ со всеми авторами (до 50) Скопировать

Xu Q., Wang J., Jiang B., Luo B. Fine-Grained Visual Classification Via Internal Ensemble Learning Transformer // IEEE Transactions on Multimedia. 2023. Vol. 25. pp. 9015-9028.

RIS |

Цитировать

RIS Скопировать

TY - JOUR

DO - 10.1109/tmm.2023.3244340

UR - https://ieeexplore.ieee.org/document/10042971/

TI - Fine-Grained Visual Classification Via Internal Ensemble Learning Transformer

T2 - IEEE Transactions on Multimedia

AU - Xu, Qin

AU - Wang, Jiahui

AU - Jiang, Bo

AU - Luo, Bin

PY - 2023

DA - 2023/02/16

PB - Institute of Electrical and Electronics Engineers (IEEE)

SP - 9015-9028

VL - 25

SN - 1520-9210

SN - 1941-0077

ER -

BibTex

Цитировать

BibTex (до 50 авторов) Скопировать

@article{2023_Xu,

author = {Qin Xu and Jiahui Wang and Bo Jiang and Bin Luo},

title = {Fine-Grained Visual Classification Via Internal Ensemble Learning Transformer},

journal = {IEEE Transactions on Multimedia},

year = {2023},

volume = {25},

publisher = {Institute of Electrical and Electronics Engineers (IEEE)},

month = {feb},

url = {https://ieeexplore.ieee.org/document/10042971/},

pages = {9015--9028},

doi = {10.1109/tmm.2023.3244340}

}

Издатель

Institute of Electrical and Electronics Engineers (IEEE)

Журнал

IEEE Transactions on Multimedia

SCImago Q1

Tоп 10% SCImago

WOS Q1

БС1

SJR

1.989

CiteScore

12.9

Impact factor

9.7

ISSN

15209210 (Print)

19410077 (Electronic)

Ошибка в публикации?

	2 4 6 8 10
Pattern Recognition	Pattern Recognition, 10, 7.35% Pattern Recognition 10 публикаций, 7.35%
Lecture Notes in Computer Science	Lecture Notes in Computer Science, 7, 5.15% Lecture Notes in Computer Science 7 публикаций, 5.15%
IEEE Transactions on Multimedia	IEEE Transactions on Multimedia, 6, 4.41% IEEE Transactions on Multimedia 6 публикаций, 4.41%
IEEE Access	IEEE Access, 5, 3.68% IEEE Access 5 публикаций, 3.68%
IEEE Transactions on Image Processing	IEEE Transactions on Image Processing, 5, 3.68% IEEE Transactions on Image Processing 5 публикаций, 3.68%
Engineering Applications of Artificial Intelligence	Engineering Applications of Artificial Intelligence, 4, 2.94% Engineering Applications of Artificial Intelligence 4 публикации, 2.94%
Expert Systems with Applications	Expert Systems with Applications, 4, 2.94% Expert Systems with Applications 4 публикации, 2.94%
Visual Computer	Visual Computer, 4, 2.94% Visual Computer 4 публикации, 2.94%
IEEE Transactions on Circuits and Systems for Video Technology	IEEE Transactions on Circuits and Systems for Video Technology, 4, 2.94% IEEE Transactions on Circuits and Systems for Video Technology 4 публикации, 2.94%
Signal, Image and Video Processing	Signal, Image and Video Processing, 3, 2.21% Signal, Image and Video Processing 3 публикации, 2.21%
Neurocomputing	Neurocomputing, 3, 2.21% Neurocomputing 3 публикации, 2.21%
IEEE Internet of Things Journal	IEEE Internet of Things Journal, 3, 2.21% IEEE Internet of Things Journal 3 публикации, 2.21%
Applied Sciences (Switzerland)	Applied Sciences (Switzerland), 3, 2.21% Applied Sciences (Switzerland) 3 публикации, 2.21%
IEEE Transactions on Fuzzy Systems	IEEE Transactions on Fuzzy Systems, 3, 2.21% IEEE Transactions on Fuzzy Systems 3 публикации, 2.21%
Image and Vision Computing	Image and Vision Computing, 2, 1.47% Image and Vision Computing 2 публикации, 1.47%
IEEE Transactions on Geoscience and Remote Sensing	IEEE Transactions on Geoscience and Remote Sensing, 2, 1.47% IEEE Transactions on Geoscience and Remote Sensing 2 публикации, 1.47%
Electronics (Switzerland)	Electronics (Switzerland), 2, 1.47% Electronics (Switzerland) 2 публикации, 1.47%
IEEE International Joint Conference on Neural Networks (IJCNN)	IEEE International Joint Conference on Neural Networks (IJCNN), 2, 1.47% IEEE International Joint Conference on Neural Networks (IJCNN) 2 публикации, 1.47%
Computer Vision and Image Understanding	Computer Vision and Image Understanding, 2, 1.47% Computer Vision and Image Understanding 2 публикации, 1.47%
Knowledge-Based Systems	Knowledge-Based Systems, 2, 1.47% Knowledge-Based Systems 2 публикации, 1.47%
Communications in Computer and Information Science	Communications in Computer and Information Science, 2, 1.47% Communications in Computer and Information Science 2 публикации, 1.47%
Information Fusion	Information Fusion, 2, 1.47% Information Fusion 2 публикации, 1.47%
IEEE Transactions on Neural Networks and Learning Systems	IEEE Transactions on Neural Networks and Learning Systems, 2, 1.47% IEEE Transactions on Neural Networks and Learning Systems 2 публикации, 1.47%
Journal of Visual Communication and Image Representation	Journal of Visual Communication and Image Representation, 1, 0.74% Journal of Visual Communication and Image Representation 1 публикация, 0.74%
Sensors	Sensors, 1, 0.74% Sensors 1 публикация, 0.74%
Neural Computing and Applications	Neural Computing and Applications, 1, 0.74% Neural Computing and Applications 1 публикация, 0.74%
Applied Soft Computing Journal	Applied Soft Computing Journal, 1, 0.74% Applied Soft Computing Journal 1 публикация, 0.74%
Computers in Biology and Medicine	Computers in Biology and Medicine, 1, 0.74% Computers in Biology and Medicine 1 публикация, 0.74%
Computers and Electrical Engineering	Computers and Electrical Engineering, 1, 0.74% Computers and Electrical Engineering 1 публикация, 0.74%
	2 4 6 8 10