том 193 страницы 104944

PPB-MCTS: A novel distributed-memory parallel partial-backpropagation Monte Carlo tree search algorithm

Тип публикацииJournal Article
Дата публикации2024-11-01
scimago Q1
wos Q1
БС1
SJR0.978
CiteScore9.8
Impact factor4
ISSN07437315, 10960848
Краткое описание
Monte-Carlo Tree Search (MCTS) is an adaptive and heuristic tree-search algorithm designed to uncover sub-optimal actions at each decision-making point. This method progressively constructs a search tree by gathering samples throughout its execution. Predominantly applied within the realm of gaming, MCTS has exhibited exceptional achievements. Additionally, it has displayed promising outcomes when employed to solve NP-hard combinatorial optimization problems. MCTS has been adapted for distributed-memory parallel platforms. The primary challenges associated with distributed-memory parallel MCTS are the substantial communication overhead and the necessity to balance the computational load among various processes. In this work, we introduce a novel distributed-memory parallel MCTS algorithm with partial backpropagations, referred to as Parallel Partial-Backpropagation MCTS (PPB-MCTS). Our design approach aims to significantly reduce the communication overhead while maintaining, or even slightly improving, the performance in the context of combinatorial optimization problems. To address the communication overhead challenge, we propose a strategy involving transmitting an additional backpropagation message. This strategy avoids attaching an information table to the communication messages exchanged by the processes, thus reducing the communication overhead. Furthermore, this approach contributes to enhancing the decision-making accuracy during the selection phase. The load balancing issue is also effectively addressed by implementing a shared transposition table among the parallel processes. Furthermore, we introduce two primary methods for managing duplicate states within distributed-memory parallel MCTS, drawing upon techniques utilized in addressing duplicate states within sequential MCTS. Duplicate states can transform the conventional search tree into a Directed Acyclic Graph (DAG). To evaluate the performance of our proposed parallel algorithm, we conduct an extensive series of experiments on solving instances of the Job-Shop Scheduling Problem (JSSP) and the Weighted Set-Cover Problem (WSCP). These problems are recognized for their complexity and classified as NP-hard combinatorial optimization problems with considerable relevance within industrial applications. The experiments are performed on a cluster of computers with many cores. The empirical results highlight the enhanced scalability of our algorithm compared to that of the existing distributed-memory parallel MCTS algorithms. As the number of processes increases, our algorithm demonstrates increased rollout efficiency while maintaining an improved load balance across processes.
Найдено 
Для доступа к списку цитирований публикации необходимо авторизоваться.

Топ-30

Журналы

1
Entropy
1 публикация, 25%
IEEE Robotics and Automation Letters
1 публикация, 25%
Energy Conversion and Management
1 публикация, 25%
1

Издатели

1
2
Institute of Electrical and Electronics Engineers (IEEE)
2 публикации, 50%
MDPI
1 публикация, 25%
Elsevier
1 публикация, 25%
1
2
  • Мы не учитываем публикации, у которых нет DOI.
  • Статистика публикаций обновляется еженедельно.

Вы ученый?

Создайте профиль, чтобы получать персональные рекомендации коллег, конференций и новых статей.
Метрики
4
Поделиться
Цитировать
ГОСТ |
Цитировать
Naderzadeh Y. et al. PPB-MCTS: A novel distributed-memory parallel partial-backpropagation Monte Carlo tree search algorithm // Journal of Parallel and Distributed Computing. 2024. Vol. 193. p. 104944.
ГОСТ со всеми авторами (до 50) Скопировать
Naderzadeh Y., Grosu D. S., Chinnam R. B. PPB-MCTS: A novel distributed-memory parallel partial-backpropagation Monte Carlo tree search algorithm // Journal of Parallel and Distributed Computing. 2024. Vol. 193. p. 104944.
RIS |
Цитировать
TY - JOUR
DO - 10.1016/j.jpdc.2024.104944
UR - https://linkinghub.elsevier.com/retrieve/pii/S0743731524001084
TI - PPB-MCTS: A novel distributed-memory parallel partial-backpropagation Monte Carlo tree search algorithm
T2 - Journal of Parallel and Distributed Computing
AU - Naderzadeh, Yashar
AU - Grosu, Daniel S.
AU - Chinnam, Ratna Babu
PY - 2024
DA - 2024/11/01
PB - Elsevier
SP - 104944
VL - 193
SN - 0743-7315
SN - 1096-0848
ER -
BibTex
Цитировать
BibTex (до 50 авторов) Скопировать
@article{2024_Naderzadeh,
author = {Yashar Naderzadeh and Daniel S. Grosu and Ratna Babu Chinnam},
title = {PPB-MCTS: A novel distributed-memory parallel partial-backpropagation Monte Carlo tree search algorithm},
journal = {Journal of Parallel and Distributed Computing},
year = {2024},
volume = {193},
publisher = {Elsevier},
month = {nov},
url = {https://linkinghub.elsevier.com/retrieve/pii/S0743731524001084},
pages = {104944},
doi = {10.1016/j.jpdc.2024.104944}
}