Mastering the game of Go with deep neural networks and tree search Chinese

Source: Internet
Author: User
This is a creation in Article, where the information may have evolved or changed.

Http://pan.baidu.com/s/1hr3kxog

http://download.csdn.net/detail/nehemiah666/9472669


There are nature on the paper, I translated the Chinese version, and recorded a narration alphago working principle of the video, is a summary of the principle of alphago work.

Here is the summary section:

For artificial intelligence, Weiqi has always been considered the most challenging classic game, due to its huge search space and difficult to evaluate the board surface and the walking sub. Here we introduce a new method: value networks to evaluate the board surface and use the policy networks to select the Sub. To train these deep neural networks, we have an innovative combination of supervised learning (learning from human professional competitions) and enhanced learning (learning from self-confrontational competitions). In the absence of any prospective search, these neural networks have the same level of sophistication as the most advanced use of the Monte Carlo Search (Mcts:monte) program, which simulates tens of thousands of random self-opposing disk boards. We also propose a new search algorithm that combines Monte Carlo simulation with a value network and a strategy network. Using the search algorithm, Alphago in the game with other go programs, won 99.8% of the board, and 5:0 defeated the European go champion. This is the first time that a computer program has defeated a professional Weiqi player in a full-size go confrontation, a feat that was previously thought to occur at least 10 years later.


Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.