volume 74 issue 4 pages 1224-1238

Hardware Accelerated Vision Transformer via Heterogeneous Architecture Design and Adaptive Dataflow Mapping

Yingxue Gao 1
Teng Wang 1
Lei Gong 1
Chao Wang 1
Dong Dai 2
Yang Yang 1
Xianglan Chen 1
Xi Li 1
Xuehai Zhou 1
Publication typeJournal Article
Publication date2025-04-01
scimago Q1
wos Q2
SJR1.156
CiteScore8.1
Impact factor3.8
ISSN00189340, 15579956, 23263814
Abstract
Vision transformer (ViT) models have demonstrated remarkable advantages in visual tasks. However, the ViT model contains various types of operators, and its sophisticated model structure imposes substantial computational complexity and storage burden. Existing hardware solutions still fail to fully unleash the ViT acceleration potential due to the mismatch between operators and hardware architectures, suffering from inefficient dataflow mapping. This work proposes HDViT, a full-fledged heterogeneous hardware accelerator on FPGA, to enhance the ViT acceleration by comprehensively analyzing and addressing the challenges of heterogeneous architecture design. Specifically, HDViT first develops a heterogeneous architecture design that is composed of multiple processing engines (PEs) to accelerate various operators in the ViT model. Then, HDViT devises a hybrid-oriented dataflow mapping strategy to reduce data transmission granularity and alleviate storage resource pressure. Lastly, to achieve the latency balancing among multiple PEs, we formulate the HDViT architecture and implement an automated exploration process to identify optimized parallelism parameters that satisfy computation and storage demands while enhancing the heterogeneous architectural performance. Experimental results indicate that HDViT achieves significant performance speedups of 2.16$\times$ and 3.51$\times$ compared to previous heterogeneous and unified accelerators, respectively. HDViT also achieves a maximum of 98.46% hardware utilization.
Found 
Found 

Top-30

Publishers

1
Institute of Electrical and Electronics Engineers (IEEE)
1 publication, 100%
1
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
1
Share
Cite this
GOST |
Cite this
GOST Copy
Gao Y. et al. Hardware Accelerated Vision Transformer via Heterogeneous Architecture Design and Adaptive Dataflow Mapping // IEEE Transactions on Computers. 2025. Vol. 74. No. 4. pp. 1224-1238.
GOST all authors (up to 50) Copy
Gao Y., Wang T., Gong L., Wang C., Dai D., Yang Y., Chen X., Li X., Zhou X. Hardware Accelerated Vision Transformer via Heterogeneous Architecture Design and Adaptive Dataflow Mapping // IEEE Transactions on Computers. 2025. Vol. 74. No. 4. pp. 1224-1238.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1109/tc.2024.3517751
UR - https://ieeexplore.ieee.org/document/10803023/
TI - Hardware Accelerated Vision Transformer via Heterogeneous Architecture Design and Adaptive Dataflow Mapping
T2 - IEEE Transactions on Computers
AU - Gao, Yingxue
AU - Wang, Teng
AU - Gong, Lei
AU - Wang, Chao
AU - Dai, Dong
AU - Yang, Yang
AU - Chen, Xianglan
AU - Li, Xi
AU - Zhou, Xuehai
PY - 2025
DA - 2025/04/01
PB - Institute of Electrical and Electronics Engineers (IEEE)
SP - 1224-1238
IS - 4
VL - 74
SN - 0018-9340
SN - 1557-9956
SN - 2326-3814
ER -
BibTex |
Cite this
BibTex (up to 50 authors) Copy
@article{2025_Gao,
author = {Yingxue Gao and Teng Wang and Lei Gong and Chao Wang and Dong Dai and Yang Yang and Xianglan Chen and Xi Li and Xuehai Zhou},
title = {Hardware Accelerated Vision Transformer via Heterogeneous Architecture Design and Adaptive Dataflow Mapping},
journal = {IEEE Transactions on Computers},
year = {2025},
volume = {74},
publisher = {Institute of Electrical and Electronics Engineers (IEEE)},
month = {apr},
url = {https://ieeexplore.ieee.org/document/10803023/},
number = {4},
pages = {1224--1238},
doi = {10.1109/tc.2024.3517751}
}
MLA
Cite this
MLA Copy
Gao, Yingxue, et al. “Hardware Accelerated Vision Transformer via Heterogeneous Architecture Design and Adaptive Dataflow Mapping.” IEEE Transactions on Computers, vol. 74, no. 4, Apr. 2025, pp. 1224-1238. https://ieeexplore.ieee.org/document/10803023/.