Videos NYT Video: DeepMind's alphaGo: Match 4 Summary: see 11 min.

Slides:



Advertisements
Similar presentations
Adversarial Search Chapter 6 Sections 1 – 4. Outline Optimal decisions α-β pruning Imperfect, real-time decisions.
Advertisements

Computational Intelligence: Some Insights from Board Games Fernand Gobet Brunel University.
Adversarial Search Chapter 6 Section 1 – 4. Types of Games.
Adversarial Search Reference: “Artificial Intelligence: A Modern Approach, 3 rd ed” (Russell and Norvig)
CS 484 – Artificial Intelligence
Adversarial Search Chapter 5.
G5AIAI Introduction to AI Graham Kendall Game Playing Garry Kasparov and Deep Blue. © 1997, GM Gabriel Schwartzman's Chess Camera, courtesy IBM.
The Implementation of Artificial Intelligence and Temporal Difference Learning Algorithms in a Computerized Chess Programme By James Mannion Computer Systems.
Artificial Intelligence for Games Game playing Patrick Olivier
Games CPSC 386 Artificial Intelligence Ellen Walker Hiram College.
Game Playing in the Real World Oct 8 th : Uncertainty and Probabilistic reasoning Oct 10th: How should we define artificial intelligence? Reading for Oct.
Monte Carlo Go Has a Way to Go Haruhiro Yoshimoto (*1) Kazuki Yoshizoe (*1) Tomoyuki Kaneko (*1) Akihiro Kishimoto (*2) Kenjiro Taura (*1) (*1)University.
double AlphaBeta(state, depth, alpha, beta) begin if depth
Game Playing in the Real World Next time: Wrap-up on Game playing and Knowledge Representation Reading: IBM’s Deep Blue Chess Grandmaster Chips, Feng-hsiung.
ICS-270a:Notes 5: 1 Notes 5: Game-Playing ICS 270a Winter 2003.
1 Reinforcement Learning: Learning algorithms Function Approximation Yishay Mansour Tel-Aviv University.
Go An ancient Oriental board game Andrew Simons. Introduction 2 player game of skill. Popular in the Far East, growing in the West. Simple rules, extremely.
Development of a Machine-Learning-Based AI For Go By Justin Park.
1 Adversarial Search CS 171/271 (Chapter 6) Some text and images in these slides were drawn from Russel & Norvig’s published material.
Introduction to Machine Learning Kamal Aboul-Hosn Cornell University Chess, Chinese Rooms, and Learning.
Artificial Intelligence
Explorations in Artificial Intelligence Prof. Carla P. Gomes Module 5 Adversarial Search (Thanks Meinolf Sellman!)
Deep Learning and Deep Reinforcement Learning. Topics 1.Deep learning with convolutional neural networks 2.Learning to play Atari video games with Deep.
AI: AlphaGo European champion : Fan Hui A feat previously thought to be at least a decade away!!!
ConvNets for Image Classification
Adversarial Search Chapter 5 Sections 1 – 4. AI & Expert Systems© Dr. Khalid Kaabneh, AAU Outline Optimal decisions α-β pruning Imperfect, real-time decisions.
Understanding AlphaGo. Go Overview Originated in ancient China 2,500 years ago Two players game Goal - surround more territory than the opponent 19X19.
Reinforcement Learning
CS4341 Introduction to Artificial Intelligence
Stochastic tree search and stochastic games
4. Games and adversarial search
Machine Learning for Big Data
Do Computers think?.
Deep Reinforcement Learning
Artificial Intelligence
Bridge (and Card Games) in General
Status Report on Machine Learning
Games and adversarial search (Chapter 5)
I Know it’s an Idiot But it’s MY artificial idiot!
Adversarial Learning for Neural Dialogue Generation
Mastering the game of Go with deep neural network and tree search
AlphaGo with Deep RL Alpha GO.
Status Report on Machine Learning
Reinforcement Learning
CSC 110 – Fluency in Information Technology Chess
AlphaGo and learning methods
Deep reinforcement learning
AlphaGO from Google DeepMind in 2016, beat human grandmasters
CS 4700: Foundations of Artificial Intelligence
Adversarial Search Chapter 5.
A note given in BCC class on March 15, 2016
Solving Legal And Human Resource Problems With Artificial Intelligence
Artificial Intelligence – Deep Learning and its Applications
Artificial Intelligence
Adversarial Search.
The intelligent piece of paper: so what is an algorithm?
Emergence of Intelligent Machines: Challenges and Opportunities
How Computers, Software, and Robotics are Transforming the Economy
CS6700 Advanced AI Prof. Carla Gomes Prof. Bart Selman
Pruned Search Strategies
Artificial Intelligence and Future of Education
Artificial Intelligence
Notes 6: Two-Player Games and Search
Artificial Intelligence in Medicine
These neural networks take a description of the Go board as an input and process it through 12 different network layers containing millions of neuron-like.
Adversarial Search CS 171/271 (Chapter 6)
Introduction to Artificial Intelligence
A compressed Overview over Artificial Intelligence (AI)
Adversarial Search Chapter 6 Section 1 – 4.
Prabhas Chongstitvatana Chulalongkorn University
Presentation transcript:

Videos NYT Video: http://www.nytimes.com/video/us/100000004255656/the-terminator-and-the-washing-machine.html DeepMind's alphaGo: https://www.youtube.com/watch?v=g-dKXOlsf98 Match 4 Summary: see 11 min. https://www.youtube.com/watch?v=G5gJ-pVo1gs DQN: http://www.nature.com/nature/journal/v518/n7540/fig_tab/nature14236_SV2.html

Human vs. artificial intelligence In 1997, IBM's DeepBlue computer beat World Chess Champion, G. Kasparov. In March 2016, Google's DeepMind (alphaGo) played the game of Go against the World Go Champion, Sedol Lee. In this informal presentation, we will watch a few short videos that highlights some of the recent advances in artificial neural network algorithms (in particular, Deep Learning) and have a discussion about biological vs. artificial intelligence systems.

DeepBlue (1997)

IBM Watson on Jeopardy! (2011) https://www.youtube.com/watch?v=DywO4zksfXw (6 min)

Google's alphaGo (March, 2016) deepmind.com

DeepMind's alphaGo "Mastering the game of Go with deep neural networks and tree search" Nature Video: Computer that mastered Go https://www.youtube.com/watch?v=g-dKXOlsf98 (7 min)

Demis Hassabis Child prodigy Game design (Theme Park) CS (U Cambridge) Neuroscience (PhD, UCL) on hippocampus and imagination DeepMind (start-up) Google DeepMind ($650M?) Technology Review: "Google's intelligence designer"

alphaGo Deep Neural Network Supervised Learning Reinforcement Learning Monte Carlo Tree Search

Analogy y Find a line that minimizes the error (total distance between the data and line). x

y x

y x

y x

action of the expert human player Maximize likelihood. action of the expert human player 30 million positions from 160,000 games state of the game

SL: Try to predict the human expert, based on the game data. RL: Reward the change, if the action led to a win. [Silver et al., 2016]

Learning = Choosing the best = Optimization Objective Function Move in the "correct" direction (= gradient). w2 w1

alphaGo [Silver et al., 2016]

[Silver et al., 2016]

MC Tree Search efficient sampling (not a brute search) [Silver et al., 2016]

[Silver et al., 2016]

DeepMind's earlier project "Human-level control through deep reinforcement learning" Nature Video: Inside DeepMind https://www.youtube.com/watch?v=xN1d3qHMIEQ DQN playing a game. http://www.nature.com/nature/journal/v518/n7540/fig_tab/nature14236_SV2.html

The Terminator and the Washing Machine What the legendary matches between supercomputer Deep Blue and chess grandmaster Garry Kasparov reveal about today’s artificial intelligence and machine learning fears. NY Times Video http://www.nytimes.com/video/us/100000004255656/the-terminator-and-the-washing-machine.html

Predictions are hard...... 1970: "In from three to eight years we will have a machine with the general intelligence of an average human being."

"The end of an era, the beginning of another "The end of an era, the beginning of another? HAL, Deep Blue and Kasparov" by Stork It has been said that when computers become world champions, we will either: Think more of computers Think less of humans or Think less of the game of chess (2016, NYT Opinions) ======================================= Does AlphaGo Mean Artificial Intelligence Is the Real Deal? AlphaGo Will Enable Us To Enhance Human Capabilities AlphaGo’s Success Shows the Human Advantage Is Eroding Fast Where Computers Defeat Humans, and Where They Can’t AlphaGo’s Artificial Intelligence Can Only Be Extrapolated So Far