AI: AlphaGo European champion : Fan Hui A feat previously thought to be at least a decade away!!!

1997, AI named “Deep Blue” beat chess world champion.

Mastering the game of Go with deep neural networks and tree search In Nature, Januray, 2016 Google DeepMind

Key Points Given current game state, – how to decide next action “a”? – How to evaluate each legal action?

Monte Carlo Tree Search (MCTS) A popular heuristic search algorithm for game play – By lots of simulations and select the most visited action. Evaluation Node value: Wining rate Key point: how to calculate this value One playout to simulate the game

Monte Carlo Tree Search (MCTS) AlphaGo – Key point: how to calculate the value for each action (path)

The visited number for this edge The mean value of all simulations passing through it. Winning rateWinning rate predicted by fast rollout playoutWinning rate predicted by deep reinforcement learning Action probability predicted by supervised and reinforcement learning

30 million games 30 million self-play games

Now AlphaGo will face its ultimate challenge: a 5-game challenge match in Seoul against the legendary Lee Sedol, the top Go player in the world over the past decade. Lee Sedol March 9 th, 10 th, 12 th, 13 th, 15 th

AI: AlphaGo European champion : Fan Hui A feat previously thought to be at least a decade away!!!

Similar presentations

Presentation on theme: "AI: AlphaGo European champion : Fan Hui A feat previously thought to be at least a decade away!!!"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

AI: AlphaGo European champion : Fan Hui A feat previously thought to be at least a decade away!!!

Similar presentations

Presentation on theme: "AI: AlphaGo European champion : Fan Hui A feat previously thought to be at least a decade away!!!"— Presentation transcript:

Similar presentations

About project

Feedback