Mastering the game of Go without human knowledge
- PMID: 29052630
- DOI: 10.1038/nature24270
Mastering the game of Go without human knowledge
Abstract
A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa, superhuman proficiency in challenging domains. Recently, AlphaGo became the first program to defeat a world champion in the game of Go. The tree search in AlphaGo evaluated positions and selected moves using deep neural networks. These neural networks were trained by supervised learning from human expert moves, and by reinforcement learning from self-play. Here we introduce an algorithm based solely on reinforcement learning, without human data, guidance or domain knowledge beyond game rules. AlphaGo becomes its own teacher: a neural network is trained to predict AlphaGo's own move selections and also the winner of AlphaGo's games. This neural network improves the strength of the tree search, resulting in higher quality move selection and stronger self-play in the next iteration. Starting tabula rasa, our new program AlphaGo Zero achieved superhuman performance, winning 100-0 against the previously published, champion-defeating AlphaGo.
Comment in
-
Artificial intelligence: Learning to play Go from scratch.Nature. 2017 Oct 18;550(7676):336-337. doi: 10.1038/550336a. Nature. 2017. PMID: 29052631 No abstract available.
Similar articles
-
Mastering the game of Go with deep neural networks and tree search.Nature. 2016 Jan 28;529(7587):484-9. doi: 10.1038/nature16961. Nature. 2016. PMID: 26819042
-
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.Science. 2018 Dec 7;362(6419):1140-1144. doi: 10.1126/science.aar6404. Science. 2018. PMID: 30523106
-
Google AI algorithm masters ancient game of Go.Nature. 2016 Jan 28;529(7587):445-6. doi: 10.1038/529445a. Nature. 2016. PMID: 26819021 No abstract available.
-
[Deep Learning and AlphaGo].Brain Nerve. 2019 Jul;71(7):681-694. doi: 10.11477/mf.1416201340. Brain Nerve. 2019. PMID: 31289242 Review. Japanese.
-
Looking to the future: Learning from experience, averting catastrophe.Neural Netw. 2019 Dec;120:5-8. doi: 10.1016/j.neunet.2019.09.018. Epub 2019 Oct 10. Neural Netw. 2019. PMID: 31607596 Review.
Cited by
-
Untangling the complexity of multimorbidity with machine learning.Mech Ageing Dev. 2020 Sep;190:111325. doi: 10.1016/j.mad.2020.111325. Epub 2020 Aug 6. Mech Ageing Dev. 2020. PMID: 32768443 Free PMC article. Review.
-
Consistency of Medical Data Using Intelligent Neuron Faster R-CNN Algorithm for Smart Health Care Application.Healthcare (Basel). 2020 Jun 25;8(2):185. doi: 10.3390/healthcare8020185. Healthcare (Basel). 2020. PMID: 32630436 Free PMC article.
-
Digital Normativity: A Challenge for Human Subjectivation.Front Artif Intell. 2020 Apr 28;3:27. doi: 10.3389/frai.2020.00027. eCollection 2020. Front Artif Intell. 2020. PMID: 33733146 Free PMC article. No abstract available.
-
Unanswerable Questions About Images and Texts.Front Artif Intell. 2020 Jul 29;3:51. doi: 10.3389/frai.2020.00051. eCollection 2020. Front Artif Intell. 2020. PMID: 33733168 Free PMC article.
-
Accurate Imputation of Greenhouse Environment Data for Data Integrity Utilizing Two-Dimensional Convolutional Neural Networks.Sensors (Basel). 2021 Mar 20;21(6):2187. doi: 10.3390/s21062187. Sensors (Basel). 2021. PMID: 33804781 Free PMC article.
References
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous