OpenMP offload toward the exascale using Intel® GPU Max 1550: evaluation of STREAmS compressible solver
1
HPC Department, Cineca, Rome, Italy
|
2
Intel Corporation Italia S.p.A., Milan, Italy
|
Publication type: Journal Article
Publication date: 2024-06-06
scimago Q2
wos Q2
SJR: 0.716
CiteScore: 7.1
Impact factor: 2.7
ISSN: 09208542, 15730484
Abstract
Nearly 20 years after the birth of general-purpose GPU computing, the HPC landscape is now dominated by GPUs. After years of undisputed dominance by NVIDIA, new players have entered the arena in a convincing manner, namely AMD and more recently Intel, whose devices currently power the first two clusters in the Top500 ranking. Unfortunately, code porting is still a major problem, even more due to the presence of different vendors, but at the same time the emergence of simplified standard paradigms suggests an encouraging prospect for developers. In this work, we provide a detailed OpenMP porting strategy of STREAmS, a community code for the compressible fluid dynamics. The proposed porting technique is based on the offload functionality of the OpenMP 5.x paradigm and in particular on a hybrid directives/APIs approach that fits seamlessly into the multi-backend software ecosystem of STREAmS. We further carry out a comprehensive performance analysis on the Intel® Data Center GPU Max 1550 (formerly called Ponte Vecchio or PVC). In addition, we analyze the performance of the code on two benchmark clusters powered by PVC, including the exascale Aurora cluster. The performance is evaluated at different levels of parallelism involved, i.e., the intrinsic parallelism of the PVC tile, the inter-tile parallelism within the GPU configuration, between the GPUs within the node and between the nodes within the cluster. The analysis shows that although the implementation complexity of the OpenMP porting is limited, it is necessary to follow some important guidelines to achieve satisfactory performance. The PVC GPU shows about 40% higher performance than the NVIDIA A100 or AMD MI250X GPUs, which, however, were released about 3 years earlier. Both intra-node and internode scalability show good results. Overall, the introduction of PVC into the GPU computing HPC landscape represents a positive step forward for the diversification and competitiveness of the sector.
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
1
2
|
|
|
Computer Physics Communications
2 publications, 28.57%
|
|
|
Journal of Parallel and Distributed Computing
1 publication, 14.29%
|
|
|
Lecture Notes in Computer Science
1 publication, 14.29%
|
|
|
1
2
|
Publishers
|
1
2
3
|
|
|
Elsevier
3 publications, 42.86%
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
3 publications, 42.86%
|
|
|
Springer Nature
1 publication, 14.29%
|
|
|
1
2
3
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
7
Total citations:
7
Citations from 2024:
7
(100%)
Cite this
GOST |
RIS |
BibTex |
MLA
Cite this
GOST
Copy
Salvadore F. et al. OpenMP offload toward the exascale using Intel® GPU Max 1550: evaluation of STREAmS compressible solver // Journal of Supercomputing. 2024. Vol. 80. No. 14. pp. 21094-21127.
GOST all authors (up to 50)
Copy
Salvadore F., Rossi G., Sathyanarayana S., Bernardini M. OpenMP offload toward the exascale using Intel® GPU Max 1550: evaluation of STREAmS compressible solver // Journal of Supercomputing. 2024. Vol. 80. No. 14. pp. 21094-21127.
Cite this
RIS
Copy
TY - JOUR
DO - 10.1007/s11227-024-06254-y
UR - https://link.springer.com/10.1007/s11227-024-06254-y
TI - OpenMP offload toward the exascale using Intel® GPU Max 1550: evaluation of STREAmS compressible solver
T2 - Journal of Supercomputing
AU - Salvadore, Francesco
AU - Rossi, Giacomo
AU - Sathyanarayana, Srikanth
AU - Bernardini, Matteo
PY - 2024
DA - 2024/06/06
PB - Springer Nature
SP - 21094-21127
IS - 14
VL - 80
SN - 0920-8542
SN - 1573-0484
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2024_Salvadore,
author = {Francesco Salvadore and Giacomo Rossi and Srikanth Sathyanarayana and Matteo Bernardini},
title = {OpenMP offload toward the exascale using Intel® GPU Max 1550: evaluation of STREAmS compressible solver},
journal = {Journal of Supercomputing},
year = {2024},
volume = {80},
publisher = {Springer Nature},
month = {jun},
url = {https://link.springer.com/10.1007/s11227-024-06254-y},
number = {14},
pages = {21094--21127},
doi = {10.1007/s11227-024-06254-y}
}
Cite this
MLA
Copy
Salvadore, Francesco, et al. “OpenMP offload toward the exascale using Intel® GPU Max 1550: evaluation of STREAmS compressible solver.” Journal of Supercomputing, vol. 80, no. 14, Jun. 2024, pp. 21094-21127. https://link.springer.com/10.1007/s11227-024-06254-y.