Security Development Lifecycle-based Adaptive Reward Mechanism for Reinforcement Learning in Continuous Integration Testing Optimization

Publication typeJournal Article
Publication date2024-06-07
scimago Q3
wos Q4
SJR0.206
CiteScore1.8
Impact factor0.6
ISSN02181940, 17936403
Abstract

Continuous automated testing throughout each cycle can ensure the security of the continuous integration (CI) development lifecycle. Test case prioritization (TCP) is a critical factor in optimizing automated testing, which prioritizes potentially failed test cases and improves the efficiency of automated testing. In CI automated testing, the TCP is a continuous decision-making process that can be solved with reinforcement learning (RL). RL-based CITCP can continuously generate a TCP strategy for each CI development lifecycle, with the reward mechanism as the core. The reward mechanism consists of the reward function and the reward strategy. However, there are new challenges to RL-based CITCP in real-industry CI testing. With high-frequency iteration, the reward function is often calculated with a fixed length of historical information, ignoring the spatial characteristics of the current cycle. Therefore, the dynamic time window (DTW)-based reward function is proposed to perform the reward calculation, which adaptively adjusts the recent historical information range based on the integration cycle. Moreover, with low-failure testing, the reward strategy usually only rewards failure test cases, which creates a sparse reward problem in RL. To address this issue, the similarity-based reward strategy is proposed, which increases the reward objects of some passed test cases, similar to the failure test cases. The DTW-based reward function and the similarity-based reward strategy together constitute the proposed adaptive reward mechanism in RL-based CITCP. To validate the effectiveness of the adaptive reward mechanism, experimental verification is carried out on 13 industrial data sets. The experimental results show that the adaptive reward mechanism can improve the TCP effect, where the average NAPFD is maximally improved by 7.29%, the average Recall is maximally improved by 6.04% and the average TTF is improved by 6.81 positions with a maximum of 63.77.

Found 

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
0
Share
Cite this
GOST |
Cite this
GOST Copy
Yang Y. et al. Security Development Lifecycle-based Adaptive Reward Mechanism for Reinforcement Learning in Continuous Integration Testing Optimization // International Journal of Software Engineering and Knowledge Engineering. 2024. pp. 1-27.
GOST all authors (up to 50) Copy
Yang Y., Wang W., Li Z., Zhang L., Pan C. Security Development Lifecycle-based Adaptive Reward Mechanism for Reinforcement Learning in Continuous Integration Testing Optimization // International Journal of Software Engineering and Knowledge Engineering. 2024. pp. 1-27.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1142/s0218194024500244
UR - https://www.worldscientific.com/doi/10.1142/S0218194024500244
TI - Security Development Lifecycle-based Adaptive Reward Mechanism for Reinforcement Learning in Continuous Integration Testing Optimization
T2 - International Journal of Software Engineering and Knowledge Engineering
AU - Yang, Yang
AU - Wang, Weiwei
AU - Li, Zheng
AU - Zhang, Lieshan
AU - Pan, Chaoyue
PY - 2024
DA - 2024/06/07
PB - World Scientific
SP - 1-27
SN - 0218-1940
SN - 1793-6403
ER -
BibTex
Cite this
BibTex (up to 50 authors) Copy
@article{2024_Yang,
author = {Yang Yang and Weiwei Wang and Zheng Li and Lieshan Zhang and Chaoyue Pan},
title = {Security Development Lifecycle-based Adaptive Reward Mechanism for Reinforcement Learning in Continuous Integration Testing Optimization},
journal = {International Journal of Software Engineering and Knowledge Engineering},
year = {2024},
publisher = {World Scientific},
month = {jun},
url = {https://www.worldscientific.com/doi/10.1142/S0218194024500244},
pages = {1--27},
doi = {10.1142/s0218194024500244}
}