том 25 страницы 9015-9028

Fine-Grained Visual Classification Via Internal Ensemble Learning Transformer

Тип публикацииJournal Article
Дата публикации2023-02-16
SCImago Q1
Tоп 10% SCImago
WOS Q1
БС1
SJR1.989
CiteScore12.9
Impact factor9.7
ISSN15209210, 19410077
Computer Science Applications
Electrical and Electronic Engineering
Signal Processing
Media Technology
Краткое описание
Recently, vision transformers (ViTs) have been investigated in fine-grained visual recognition (FGVC) and are now considered state of the art. However, most ViT-based works ignore the different learning performances of the heads in the multi-head self-attention (MHSA) mechanism and its layers. To address these issues, in this paper, we propose a novel internal ensemble learning transformer (IELT) for FGVC. The proposed IELT involves three main modules: multi-head voting (MHV) module, cross-layer refinement (CLR) module, and dynamic selection (DS) module. To solve the problem of the inconsistent performances of multiple heads, we propose the MHV module, which considers all of the heads in each layer as weak learners and votes for tokens of discriminative regions as cross-layer feature based on the attention maps and spatial relationships. To effectively mine the cross-layer feature and suppress the noise, the CLR module is proposed, where the refined feature is extracted and the assist logits operation is developed for the final prediction. In addition, a newly designed DS module adjusts the token selection number at each layer by weighting their contributions of the refined feature. In this way, the idea of ensemble learning is combined with the ViT to improve fine-grained feature representation. The experiments demonstrate that our method achieves competitive results compared with the state of the art on five popular FGVC datasets. Source code has been released and can be found at https://github.com/mobulan/IELT .
Для доступа к списку цитирований публикации необходимо авторизоваться.

Топ-30

Журналы

2
4
6
8
10
Pattern Recognition
10 публикаций, 7.35%
Lecture Notes in Computer Science
7 публикаций, 5.15%
IEEE Transactions on Multimedia
6 публикаций, 4.41%
IEEE Access
5 публикаций, 3.68%
IEEE Transactions on Image Processing
5 публикаций, 3.68%
Engineering Applications of Artificial Intelligence
4 публикации, 2.94%
Expert Systems with Applications
4 публикации, 2.94%
Visual Computer
4 публикации, 2.94%
IEEE Transactions on Circuits and Systems for Video Technology
4 публикации, 2.94%
Signal, Image and Video Processing
3 публикации, 2.21%
Neurocomputing
3 публикации, 2.21%
IEEE Internet of Things Journal
3 публикации, 2.21%
Applied Sciences (Switzerland)
3 публикации, 2.21%
IEEE Transactions on Fuzzy Systems
3 публикации, 2.21%
Image and Vision Computing
2 публикации, 1.47%
IEEE Transactions on Geoscience and Remote Sensing
2 публикации, 1.47%
Electronics (Switzerland)
2 публикации, 1.47%
IEEE International Joint Conference on Neural Networks (IJCNN)
2 публикации, 1.47%
Computer Vision and Image Understanding
2 публикации, 1.47%
Knowledge-Based Systems
2 публикации, 1.47%
Communications in Computer and Information Science
2 публикации, 1.47%
Information Fusion
2 публикации, 1.47%
IEEE Transactions on Neural Networks and Learning Systems
2 публикации, 1.47%
Journal of Visual Communication and Image Representation
1 публикация, 0.74%
Sensors
1 публикация, 0.74%
Neural Computing and Applications
1 публикация, 0.74%
Applied Soft Computing Journal
1 публикация, 0.74%
Computers in Biology and Medicine
1 публикация, 0.74%
Computers and Electrical Engineering
1 публикация, 0.74%
2
4
6
8
10

Издатели

10
20
30
40
50
60
Institute of Electrical and Electronics Engineers (IEEE)
60 публикаций, 44.12%
Elsevier
38 публикаций, 27.94%
Springer Nature
22 публикации, 16.18%
MDPI
10 публикаций, 7.35%
Association for Computing Machinery (ACM)
3 публикации, 2.21%
SPIE-Intl Soc Optical Eng
1 публикация, 0.74%
Frontiers Media S.A.
1 публикация, 0.74%
Wiley
1 публикация, 0.74%
10
20
30
40
50
60
  • Мы не учитываем публикации, у которых нет DOI.
  • Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
 Войти с ORCID
Метрики
136
Поделиться
Цитировать
ГОСТ |
Цитировать
Xu Q. et al. Fine-Grained Visual Classification Via Internal Ensemble Learning Transformer // IEEE Transactions on Multimedia. 2023. Vol. 25. pp. 9015-9028.
ГОСТ со всеми авторами (до 50) Скопировать
Xu Q., Wang J., Jiang B., Luo B. Fine-Grained Visual Classification Via Internal Ensemble Learning Transformer // IEEE Transactions on Multimedia. 2023. Vol. 25. pp. 9015-9028.
RIS |
Цитировать
TY - JOUR
DO - 10.1109/tmm.2023.3244340
UR - https://ieeexplore.ieee.org/document/10042971/
TI - Fine-Grained Visual Classification Via Internal Ensemble Learning Transformer
T2 - IEEE Transactions on Multimedia
AU - Xu, Qin
AU - Wang, Jiahui
AU - Jiang, Bo
AU - Luo, Bin
PY - 2023
DA - 2023/02/16
PB - Institute of Electrical and Electronics Engineers (IEEE)
SP - 9015-9028
VL - 25
SN - 1520-9210
SN - 1941-0077
ER -
BibTex
Цитировать
BibTex (до 50 авторов) Скопировать
@article{2023_Xu,
author = {Qin Xu and Jiahui Wang and Bo Jiang and Bin Luo},
title = {Fine-Grained Visual Classification Via Internal Ensemble Learning Transformer},
journal = {IEEE Transactions on Multimedia},
year = {2023},
volume = {25},
publisher = {Institute of Electrical and Electronics Engineers (IEEE)},
month = {feb},
url = {https://ieeexplore.ieee.org/document/10042971/},
pages = {9015--9028},
doi = {10.1109/tmm.2023.3244340}
}
Ошибка в публикации?