volume 550 issue 7676 pages 354-359

Mastering the game of Go without human knowledge

David Silver 1
Julian Schrittwieser 1
Karen Simonyan 1
Ioannis Antonoglou 1
Aja Huang 1
ARTHUR GUEZ 1
Thomas Hübert 1
Lucas Baker 1
Matthew Lai 1
Adrian Bolton 1
Yutian Chen 1
Timothy Lillicrap 1
Hui Fan 1
Laurent Sifre 1
George Van Den Driessche 1
Thore Graepel 1
Demis Hassabis 1
Publication typeJournal Article
Publication date2017-10-17
scimago Q1
wos Q1
SJR18.288
CiteScore78.1
Impact factor48.5
ISSN00280836, 14764687
PubMed ID:  29052630
Multidisciplinary
Abstract
A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa, superhuman proficiency in challenging domains. Recently, AlphaGo became the first program to defeat a world champion in the game of Go. The tree search in AlphaGo evaluated positions and selected moves using deep neural networks. These neural networks were trained by supervised learning from human expert moves, and by reinforcement learning from self-play. Here we introduce an algorithm based solely on reinforcement learning, without human data, guidance or domain knowledge beyond game rules. AlphaGo becomes its own teacher: a neural network is trained to predict AlphaGo’s own move selections and also the winner of AlphaGo’s games. This neural network improves the strength of the tree search, resulting in higher quality move selection and stronger self-play in the next iteration. Starting tabula rasa, our new program AlphaGo Zero achieved superhuman performance, winning 100–0 against the previously published, champion-defeating AlphaGo. Starting from zero knowledge and without human data, AlphaGo Zero was able to teach itself to play Go and to develop novel strategies that provide new insights into the oldest of games. To beat world champions at the game of Go, the computer program AlphaGo has relied largely on supervised learning from millions of human expert moves. David Silver and colleagues have now produced a system called AlphaGo Zero, which is based purely on reinforcement learning and learns solely from self-play. Starting from random moves, it can reach superhuman level in just a couple of days of training and five million games of self-play, and can now beat all previous versions of AlphaGo. Because the machine independently discovers the same fundamental principles of the game that took humans millennia to conceptualize, the work suggests that such principles have some universal character, beyond human bias.
Found 
Found 

Top-30

Journals

50
100
150
200
250
Lecture Notes in Computer Science
244 publications, 3.98%
IEEE Access
151 publications, 2.46%
IEEE Transactions on Neural Networks and Learning Systems
62 publications, 1.01%
Neurocomputing
48 publications, 0.78%
Communications in Computer and Information Science
48 publications, 0.78%
Applied Sciences (Switzerland)
45 publications, 0.73%
IEEE Transactions on Games
45 publications, 0.73%
Scientific Reports
44 publications, 0.72%
IEEE Transactions on Intelligent Transportation Systems
39 publications, 0.64%
Expert Systems with Applications
36 publications, 0.59%
Neural Networks
35 publications, 0.57%
Lecture Notes in Networks and Systems
34 publications, 0.55%
Applied Intelligence
33 publications, 0.54%
SSRN Electronic Journal
33 publications, 0.54%
Sensors
32 publications, 0.52%
Knowledge-Based Systems
31 publications, 0.51%
Electronics (Switzerland)
29 publications, 0.47%
Nature
28 publications, 0.46%
Lecture Notes in Electrical Engineering
27 publications, 0.44%
IEEE Transactions on Pattern Analysis and Machine Intelligence
25 publications, 0.41%
Nature Communications
22 publications, 0.36%
Nature Machine Intelligence
22 publications, 0.36%
Information Sciences
22 publications, 0.36%
Engineering Applications of Artificial Intelligence
21 publications, 0.34%
IFAC-PapersOnLine
20 publications, 0.33%
Advanced Intelligent Systems
20 publications, 0.33%
IEEE Internet of Things Journal
20 publications, 0.33%
Neural Computing and Applications
19 publications, 0.31%
Journal of Physics: Conference Series
19 publications, 0.31%
50
100
150
200
250

Publishers

200
400
600
800
1000
1200
1400
1600
1800
2000
Institute of Electrical and Electronics Engineers (IEEE)
1831 publications, 29.85%
Springer Nature
1237 publications, 20.16%
Elsevier
915 publications, 14.91%
MDPI
283 publications, 4.61%
Wiley
233 publications, 3.8%
Association for Computing Machinery (ACM)
172 publications, 2.8%
Frontiers Media S.A.
94 publications, 1.53%
IOP Publishing
94 publications, 1.53%
Taylor & Francis
79 publications, 1.29%
American Chemical Society (ACS)
69 publications, 1.12%
Oxford University Press
67 publications, 1.09%
Cold Spring Harbor Laboratory
62 publications, 1.01%
SAGE
54 publications, 0.88%
American Physical Society (APS)
50 publications, 0.81%
AIP Publishing
48 publications, 0.78%
Hindawi Limited
39 publications, 0.64%
Cambridge University Press
38 publications, 0.62%
American Institute of Aeronautics and Astronautics (AIAA)
34 publications, 0.55%
American Association for the Advancement of Science (AAAS)
33 publications, 0.54%
Social Science Electronic Publishing
28 publications, 0.46%
Public Library of Science (PLoS)
26 publications, 0.42%
IGI Global
24 publications, 0.39%
Royal Society of Chemistry (RSC)
23 publications, 0.37%
Institution of Engineering and Technology (IET)
22 publications, 0.36%
World Scientific
20 publications, 0.33%
Institute for Operations Research and the Management Sciences (INFORMS)
19 publications, 0.31%
Optica Publishing Group
18 publications, 0.29%
IOS Press
16 publications, 0.26%
Science in China Press
15 publications, 0.24%
200
400
600
800
1000
1200
1400
1600
1800
2000
  • We do not take into account publications without a DOI.
  • Statistics recalculated weekly.

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Metrics
6.1k
Share
Cite this
GOST |
Cite this
GOST Copy
Silver D. et al. Mastering the game of Go without human knowledge // Nature. 2017. Vol. 550. No. 7676. pp. 354-359.
GOST all authors (up to 50) Copy
Silver D., Schrittwieser J., Simonyan K., Antonoglou I., Huang A., GUEZ A., Hübert T., Baker L., Lai M., Bolton A., Chen Y., Lillicrap T., Fan H., Sifre L., Van Den Driessche G., Graepel T., Hassabis D. Mastering the game of Go without human knowledge // Nature. 2017. Vol. 550. No. 7676. pp. 354-359.
RIS |
Cite this
RIS Copy
TY - JOUR
DO - 10.1038/nature24270
UR - https://doi.org/10.1038/nature24270
TI - Mastering the game of Go without human knowledge
T2 - Nature
AU - Silver, David
AU - Schrittwieser, Julian
AU - Simonyan, Karen
AU - Antonoglou, Ioannis
AU - Huang, Aja
AU - GUEZ, ARTHUR
AU - Hübert, Thomas
AU - Baker, Lucas
AU - Lai, Matthew
AU - Bolton, Adrian
AU - Chen, Yutian
AU - Lillicrap, Timothy
AU - Fan, Hui
AU - Sifre, Laurent
AU - Van Den Driessche, George
AU - Graepel, Thore
AU - Hassabis, Demis
PY - 2017
DA - 2017/10/17
PB - Springer Nature
SP - 354-359
IS - 7676
VL - 550
PMID - 29052630
SN - 0028-0836
SN - 1476-4687
ER -
BibTex |
Cite this
BibTex (up to 50 authors) Copy
@article{2017_Silver,
author = {David Silver and Julian Schrittwieser and Karen Simonyan and Ioannis Antonoglou and Aja Huang and ARTHUR GUEZ and Thomas Hübert and Lucas Baker and Matthew Lai and Adrian Bolton and Yutian Chen and Timothy Lillicrap and Hui Fan and Laurent Sifre and George Van Den Driessche and Thore Graepel and Demis Hassabis},
title = {Mastering the game of Go without human knowledge},
journal = {Nature},
year = {2017},
volume = {550},
publisher = {Springer Nature},
month = {oct},
url = {https://doi.org/10.1038/nature24270},
number = {7676},
pages = {354--359},
doi = {10.1038/nature24270}
}
MLA
Cite this
MLA Copy
Silver, David, et al. “Mastering the game of Go without human knowledge.” Nature, vol. 550, no. 7676, Oct. 2017, pp. 354-359. https://doi.org/10.1038/nature24270.