Job runtime prediction of HPC cluster based on PC-Transformer
Publication type: Journal Article
Publication date: 2023-06-12
scimago Q2
wos Q2
SJR: 0.716
CiteScore: 7.1
Impact factor: 2.7
ISSN: 09208542, 15730484
Hardware and Architecture
Information Systems
Software
Theoretical Computer Science
Abstract
Job scheduling of high performance cluster is a crucial task that affects the efficiency and performance of the system. The accuracy of job runtime prediction is one of the key factors that influences the quality of job scheduling. In this paper, we propose a novel method for job runtime prediction based on Transformer with plain connection and attention mechanism. The proposed method utilizes the job category information obtained by clustering the historical log datasets, and selects six-dimensional features that are highly correlated with job runtime. We divide the datasets into multiple job sets according to the length of job runtime, train and predict each job set separately. We evaluate the proposed method on the HPC2N dataset, and compare it with several existing methods. The results show that the proposed method achieves an average accuracy of 0.892, with 15.2% MAPE, and outperforms other methods in terms of prediction performance and training time. The proposed method can be applied to improve the efficiency and quality of job scheduling in high performance cluster.
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
1
|
|
|
IEEE Transactions on Services Computing
1 publication, 16.67%
|
|
|
Communications in Computer and Information Science
1 publication, 16.67%
|
|
|
1
|
Publishers
|
1
2
3
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
3 publications, 50%
|
|
|
Association for Computing Machinery (ACM)
2 publications, 33.33%
|
|
|
Springer Nature
1 publication, 16.67%
|
|
|
1
2
3
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
6
Total citations:
6
Citations from 2024:
6
(100%)
Cite this
GOST |
RIS |
BibTex
Cite this
RIS
Copy
TY - JOUR
DO - 10.1007/s11227-023-05470-2
UR - https://doi.org/10.1007/s11227-023-05470-2
TI - Job runtime prediction of HPC cluster based on PC-Transformer
T2 - Journal of Supercomputing
AU - Fengxian, Chen
PY - 2023
DA - 2023/06/12
PB - Springer Nature
SN - 0920-8542
SN - 1573-0484
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2023_Fengxian,
author = {Chen Fengxian},
title = {Job runtime prediction of HPC cluster based on PC-Transformer},
journal = {Journal of Supercomputing},
year = {2023},
publisher = {Springer Nature},
month = {jun},
url = {https://doi.org/10.1007/s11227-023-05470-2},
doi = {10.1007/s11227-023-05470-2}
}