A Monte Carlo Neural Fictitious Self-Play approach to approximate Nash Equilibrium in imperfect-information dynamic games
Publication type: Journal Article
Publication date: 2021-07-16
scimago Q1
wos Q1
SJR: 0.861
CiteScore: 8.6
Impact factor: 4.6
ISSN: 20952228, 20952236
Theoretical Computer Science
General Computer Science
Abstract
Solving the optimization problem to approach a Nash Equilibrium point plays an important role in imperfect information games, e.g., StarCraft and poker. Neural Fictitious Self-Play (NFSP) is an effective algorithm that learns approximate Nash Equilibrium of imperfect-information games from purely self-play without prior domain knowledge. However, it needs to train a neural network in an off-policy manner to approximate the action values. For games with large search spaces, the training may suffer from unnecessary exploration and sometimes fails to converge. In this paper, we propose a new Neural Fictitious Self-Play algorithm that combines Monte Carlo tree search with NFSP, called MC-NFSP, to improve the performance in real-time zero-sum imperfect-information games. With experiments and empirical analysis, we demonstrate that the proposed MC-NFSP algorithm can approximate Nash Equilibrium in games with large-scale search depth while the NFSP can not. Furthermore, we develop an Asynchronous Neural Fictitious Self-Play framework (ANFSP). It uses asynchronous and parallel architecture to collect game experience and improve both the training efficiency and policy quality. The experiments with th e games with hidden state information (Texas Hold’em), and the FPS (firstperson shooter) games demonstrate effectiveness of our algorithms.
Found
Nothing found, try to update filter.
Found
Nothing found, try to update filter.
Top-30
Journals
|
1
|
|
|
Neural Processing Letters
1 publication, 8.33%
|
|
|
Electronics (Switzerland)
1 publication, 8.33%
|
|
|
Communications in Computer and Information Science
1 publication, 8.33%
|
|
|
Lecture Notes in Electrical Engineering
1 publication, 8.33%
|
|
|
IEEE Transactions on Systems, Man, and Cybernetics: Systems
1 publication, 8.33%
|
|
|
Chinese Journal of Aeronautics
1 publication, 8.33%
|
|
|
Nexus
1 publication, 8.33%
|
|
|
1
|
Publishers
|
1
2
3
4
5
|
|
|
Institute of Electrical and Electronics Engineers (IEEE)
5 publications, 41.67%
|
|
|
Springer Nature
3 publications, 25%
|
|
|
Elsevier
2 publications, 16.67%
|
|
|
MDPI
1 publication, 8.33%
|
|
|
Association for Computing Machinery (ACM)
1 publication, 8.33%
|
|
|
1
2
3
4
5
|
- We do not take into account publications without a DOI.
- Statistics recalculated weekly.
Are you a researcher?
Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
12
Total citations:
12
Citations from 2025:
2
(16.66%)
Cite this
GOST |
RIS |
BibTex
Cite this
GOST
Copy
Zhang L. et al. A Monte Carlo Neural Fictitious Self-Play approach to approximate Nash Equilibrium in imperfect-information dynamic games // Frontiers of Computer Science. 2021. Vol. 15. No. 5. 155334
GOST all authors (up to 50)
Copy
Zhang L., Chen Y., Wang W., Han Z., Li S., PAN Z., Pan G. A Monte Carlo Neural Fictitious Self-Play approach to approximate Nash Equilibrium in imperfect-information dynamic games // Frontiers of Computer Science. 2021. Vol. 15. No. 5. 155334
Cite this
RIS
Copy
TY - JOUR
DO - 10.1007/s11704-020-9307-6
UR - https://doi.org/10.1007/s11704-020-9307-6
TI - A Monte Carlo Neural Fictitious Self-Play approach to approximate Nash Equilibrium in imperfect-information dynamic games
T2 - Frontiers of Computer Science
AU - Zhang, Li
AU - Chen, Yuxuan
AU - Wang, Wei
AU - Han, Ziliang
AU - Li, Shijian
AU - PAN, ZHIJIE
AU - Pan, Gang
PY - 2021
DA - 2021/07/16
PB - Springer Nature
IS - 5
VL - 15
SN - 2095-2228
SN - 2095-2236
ER -
Cite this
BibTex (up to 50 authors)
Copy
@article{2021_Zhang,
author = {Li Zhang and Yuxuan Chen and Wei Wang and Ziliang Han and Shijian Li and ZHIJIE PAN and Gang Pan},
title = {A Monte Carlo Neural Fictitious Self-Play approach to approximate Nash Equilibrium in imperfect-information dynamic games},
journal = {Frontiers of Computer Science},
year = {2021},
volume = {15},
publisher = {Springer Nature},
month = {jul},
url = {https://doi.org/10.1007/s11704-020-9307-6},
number = {5},
pages = {155334},
doi = {10.1007/s11704-020-9307-6}
}