Job runtime prediction of HPC cluster based on PC-Transformer

Publication typeJournal Article
Publication date2023-06-12
scimago Q2
wos Q2
SJR0.716
CiteScore7.1
Impact factor2.7
ISSN09208542, 15730484
Hardware and Architecture
Information Systems
Software
Theoretical Computer Science
Abstract
Job scheduling of high performance cluster is a crucial task that affects the efficiency and performance of the system. The accuracy of job runtime prediction is one of the key factors that influences the quality of job scheduling. In this paper, we propose a novel method for job runtime prediction based on Transformer with plain connection and attention mechanism. The proposed method utilizes the job category information obtained by clustering the historical log datasets, and selects six-dimensional features that are highly correlated with job runtime. We divide the datasets into multiple job sets according to the length of job runtime, train and predict each job set separately. We evaluate the proposed method on the HPC2N dataset, and compare it with several existing methods. The results show that the proposed method achieves an average accuracy of 0.892, with 15.2% MAPE, and outperforms other methods in terms of prediction performance and training time. The proposed method can be applied to improve the efficiency and quality of job scheduling in high performance cluster.
Found 
Found 

Top-30

Journals

1
IEEE Transactions on Services Computing
1 publication, 16.67%
Communications in Computer and Information Science
1 publication, 16.67%
1

Publishers

1
2
3
Institute of Electrical and Electronics Engineers (IEEE)
3 publications, 50%
Association for Computing Machinery (ACM)
2 publications, 33.33%
Springer Nature
1 publication, 16.67%
1
2
3
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
6
Share
Cite this
GOST |
Cite this
GOST Copy
Fengxian C. Job runtime prediction of HPC cluster based on PC-Transformer // Journal of Supercomputing. 2023.
GOST all authors (up to 50) Copy
Fengxian C. Job runtime prediction of HPC cluster based on PC-Transformer // Journal of Supercomputing. 2023.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1007/s11227-023-05470-2
UR - https://doi.org/10.1007/s11227-023-05470-2
TI - Job runtime prediction of HPC cluster based on PC-Transformer
T2 - Journal of Supercomputing
AU - Fengxian, Chen
PY - 2023
DA - 2023/06/12
PB - Springer Nature
SN - 0920-8542
SN - 1573-0484
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@article{2023_Fengxian,
author = {Chen Fengxian},
title = {Job runtime prediction of HPC cluster based on PC-Transformer},
journal = {Journal of Supercomputing},
year = {2023},
publisher = {Springer Nature},
month = {jun},
url = {https://doi.org/10.1007/s11227-023-05470-2},
doi = {10.1007/s11227-023-05470-2}
}