Mastering the game of Go without human knowledge

doi:10.1038/nature24270

. 2017 Oct 18;550(7676):354-359.

doi: 10.1038/nature24270.

Mastering the game of Go without human knowledge

David Silver¹, Julian Schrittwieser¹, Karen Simonyan¹, Ioannis Antonoglou¹, Aja Huang¹, Arthur Guez¹, Thomas Hubert¹, Lucas Baker¹, Matthew Lai¹, Adrian Bolton¹, Yutian Chen¹, Timothy Lillicrap¹, Fan Hui¹, Laurent Sifre¹, George van den Driessche¹, Thore Graepel¹, Demis Hassabis¹

Affiliations

PMID: 29052630
DOI: 10.1038/nature24270

Mastering the game of Go without human knowledge

David Silver et al. Nature. 2017.

. 2017 Oct 18;550(7676):354-359.

doi: 10.1038/nature24270.

Authors

Affiliation

¹ DeepMind, 5 New Street Square, London EC4A 3TW, UK.

PMID: 29052630
DOI: 10.1038/nature24270

Abstract

A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa, superhuman proficiency in challenging domains. Recently, AlphaGo became the first program to defeat a world champion in the game of Go. The tree search in AlphaGo evaluated positions and selected moves using deep neural networks. These neural networks were trained by supervised learning from human expert moves, and by reinforcement learning from self-play. Here we introduce an algorithm based solely on reinforcement learning, without human data, guidance or domain knowledge beyond game rules. AlphaGo becomes its own teacher: a neural network is trained to predict AlphaGo's own move selections and also the winner of AlphaGo's games. This neural network improves the strength of the tree search, resulting in higher quality move selection and stronger self-play in the next iteration. Starting tabula rasa, our new program AlphaGo Zero achieved superhuman performance, winning 100-0 against the previously published, champion-defeating AlphaGo.

PubMed Disclaimer

Comment in

Artificial intelligence: Learning to play Go from scratch.
Singh S, Okun A, Jackson A. Singh S, et al. Nature. 2017 Oct 18;550(7676):336-337. doi: 10.1038/550336a. Nature. 2017. PMID: 29052631 No abstract available.

Cited by

Untangling the complexity of multimorbidity with machine learning.
Hassaine A, Salimi-Khorshidi G, Canoy D, Rahimi K. Hassaine A, et al. Mech Ageing Dev. 2020 Sep;190:111325. doi: 10.1016/j.mad.2020.111325. Epub 2020 Aug 6. Mech Ageing Dev. 2020. PMID: 32768443 Free PMC article. Review.
Consistency of Medical Data Using Intelligent Neuron Faster R-CNN Algorithm for Smart Health Care Application.
Kim SK, Huh JH. Kim SK, et al. Healthcare (Basel). 2020 Jun 25;8(2):185. doi: 10.3390/healthcare8020185. Healthcare (Basel). 2020. PMID: 32630436 Free PMC article.
Digital Normativity: A Challenge for Human Subjectivation.
Fourneret E, Yvert B. Fourneret E, et al. Front Artif Intell. 2020 Apr 28;3:27. doi: 10.3389/frai.2020.00027. eCollection 2020. Front Artif Intell. 2020. PMID: 33733146 Free PMC article. No abstract available.
Unanswerable Questions About Images and Texts.
Davis E. Davis E. Front Artif Intell. 2020 Jul 29;3:51. doi: 10.3389/frai.2020.00051. eCollection 2020. Front Artif Intell. 2020. PMID: 33733168 Free PMC article.
Accurate Imputation of Greenhouse Environment Data for Data Integrity Utilizing Two-Dimensional Convolutional Neural Networks.
Moon T, Lee JW, Son JE. Moon T, et al. Sensors (Basel). 2021 Mar 20;21(6):2187. doi: 10.3390/s21062187. Sensors (Basel). 2021. PMID: 33804781 Free PMC article.

See all "Cited by" articles

References

1. Nature. 2016 Jan 28;529(7587):484-9 - PubMed
1. Biol Cybern. 1980;36(4):193-202 - PubMed
1. Nature. 2000 Jun 22;405(6789):947-51 - PubMed
1. Science. 2017 May 5;356(6337):508-513 - PubMed
1. Nature. 2015 May 28;521(7553):436-44 - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
- Ovid Technologies, Inc.
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Miscellaneous
- NCI CPTAC Assay Portal

[1] Nature. 2016 Jan 28;529(7587):484-9 - PubMed

[2] Nature. 2016 Jan 28;529(7587):484-9 - PubMed

[3] Biol Cybern. 1980;36(4):193-202 - PubMed

[4] Biol Cybern. 1980;36(4):193-202 - PubMed

[5] Nature. 2000 Jun 22;405(6789):947-51 - PubMed

[6] Nature. 2000 Jun 22;405(6789):947-51 - PubMed

[7] Science. 2017 May 5;356(6337):508-513 - PubMed

[8] Science. 2017 May 5;356(6337):508-513 - PubMed

[9] Nature. 2015 May 28;521(7553):436-44 - PubMed

[10] Nature. 2015 May 28;521(7553):436-44 - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Mastering the game of Go without human knowledge

Affiliation

Mastering the game of Go without human knowledge

Authors

Affiliation

Abstract

Comment in

Similar articles

Cited by

References

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Miscellaneous