International Journal of Software Engineering and Knowledge Engineering

, pages 1-27

Security Development Lifecycle-based Adaptive Reward Mechanism for Reinforcement Learning in Continuous Integration Testing Optimization

Yang Yang ¹

Weiwei Wang ²

Zheng Li ³

Lieshan Zhang ¹

Chaoyue Pan ³

Hide authors affiliations Show authors affiliations: 3 affiliations

School of Information Science and Engineering, Zhejiang Sci-Tech University, Zhejiang 310018, P. R. China |

School of Information Engineering, Beijing Institute of Petrochemical Technology, Beijing 100095, P. R. China |

College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, P. R. China |

Publication type: Journal Article

Publication date: 2024-06-07

World Scientific

International Journal of Software Engineering and Knowledge Engineering

scimago Q3

wos Q4

SJR: 0.206

CiteScore: 1.8

Impact factor: 0.6

ISSN: 02181940, 17936403

DOI: 10.1142/s0218194024500244

Copy DOI

Abstract

Continuous automated testing throughout each cycle can ensure the security of the continuous integration (CI) development lifecycle. Test case prioritization (TCP) is a critical factor in optimizing automated testing, which prioritizes potentially failed test cases and improves the efficiency of automated testing. In CI automated testing, the TCP is a continuous decision-making process that can be solved with reinforcement learning (RL). RL-based CITCP can continuously generate a TCP strategy for each CI development lifecycle, with the reward mechanism as the core. The reward mechanism consists of the reward function and the reward strategy. However, there are new challenges to RL-based CITCP in real-industry CI testing. With high-frequency iteration, the reward function is often calculated with a fixed length of historical information, ignoring the spatial characteristics of the current cycle. Therefore, the dynamic time window (DTW)-based reward function is proposed to perform the reward calculation, which adaptively adjusts the recent historical information range based on the integration cycle. Moreover, with low-failure testing, the reward strategy usually only rewards failure test cases, which creates a sparse reward problem in RL. To address this issue, the similarity-based reward strategy is proposed, which increases the reward objects of some passed test cases, similar to the failure test cases. The DTW-based reward function and the similarity-based reward strategy together constitute the proposed adaptive reward mechanism in RL-based CITCP. To validate the effectiveness of the adaptive reward mechanism, experimental verification is carried out on 13 industrial data sets. The experimental results show that the adaptive reward mechanism can improve the TCP effect, where the average NAPFD is maximally improved by 7.29%, the average Recall is maximally improved by 6.04% and the average TTF is improved by 6.81 positions with a maximum of 63.77.

Found

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.

Metrics

Cite this

GOST |

Cite this

GOST Copy

Yang Y. et al. Security Development Lifecycle-based Adaptive Reward Mechanism for Reinforcement Learning in Continuous Integration Testing Optimization // International Journal of Software Engineering and Knowledge Engineering. 2024. pp. 1-27.

GOST all authors (up to 50) Copy

Yang Y., Wang W., Li Z., Zhang L., Pan C. Security Development Lifecycle-based Adaptive Reward Mechanism for Reinforcement Learning in Continuous Integration Testing Optimization // International Journal of Software Engineering and Knowledge Engineering. 2024. pp. 1-27.

RIS |

Cite this

RIS Copy

TY - JOUR

DO - 10.1142/s0218194024500244

UR - https://www.worldscientific.com/doi/10.1142/S0218194024500244

TI - Security Development Lifecycle-based Adaptive Reward Mechanism for Reinforcement Learning in Continuous Integration Testing Optimization

T2 - International Journal of Software Engineering and Knowledge Engineering

AU - Yang, Yang

AU - Wang, Weiwei

AU - Li, Zheng

AU - Zhang, Lieshan

AU - Pan, Chaoyue

PY - 2024

DA - 2024/06/07

PB - World Scientific

SP - 1-27

SN - 0218-1940

SN - 1793-6403

ER -

BibTex

Cite this

BibTex (up to 50 authors) Copy

@article{2024_Yang,

author = {Yang Yang and Weiwei Wang and Zheng Li and Lieshan Zhang and Chaoyue Pan},

title = {Security Development Lifecycle-based Adaptive Reward Mechanism for Reinforcement Learning in Continuous Integration Testing Optimization},

journal = {International Journal of Software Engineering and Knowledge Engineering},

year = {2024},

publisher = {World Scientific},

month = {jun},

url = {https://www.worldscientific.com/doi/10.1142/S0218194024500244},

pages = {1--27},

doi = {10.1142/s0218194024500244}

}

Publisher

World Scientific

Journal

International Journal of Software Engineering and Knowledge Engineering

scimago Q3

wos Q4

SJR

0.206

CiteScore

1.8

Impact factor

0.6

ISSN

02181940 (Print, Electronic)

17936403 (Electronic)