By James Mannion Computer Systems Lab Period 3

Slides:



Advertisements
Similar presentations
1 CS 385 Fall 2006 Chapter 4 Heuristic Search. 2 Heuristics eurisko ("I discover" in Greek) "the study of the methods and rules of discovery and invention."
Advertisements

Adversarial Search Reference: “Artificial Intelligence: A Modern Approach, 3 rd ed” (Russell and Norvig)
The Implementation of Artificial Intelligence and Temporal Difference Learning Algorithms in a Computerized Chess Programme By James Mannion Computer Systems.
Artificial Intelligence for Games Game playing Patrick Olivier
Computers Playing Games Arif Zaman CS 101. Acknowledgements Portions of this are taken from MIT’s open-courseware
The Implementation of Machine Learning in the Game of Checkers Billy Melicher Computer Systems lab
How Computers Play Chess Peter Barnum November 15, 2007 Artificial Intelligence 101.
Game Playing CSC361 AI CSC361: Game Playing.
Adversarial Search: Game Playing Reading: Chess paper.
Othello Sean Farrell June 29, Othello Two-player game played on 8x8 board All pieces have one white side and one black side Initial board setup.
Group 1 : Ashutosh Pushkar Ameya Sudhir From. Motivation  Game playing was one of the first tasks undertaken in AI  Study of games brings us closer.
Hex Combinatorial Search in Game Strategy by Brandon Risberg May 2006Menlo School.
PSU CS 370 – Introduction to Artificial Intelligence Game MinMax Alpha-Beta.
Lecture 5 Note: Some slides and/or pictures are adapted from Lecture slides / Books of Dr Zafar Alvi. Text Book - Aritificial Intelligence Illuminated.
Artificial Intelligence
Evaluation Function in Game Playing Programs M1 Yasubumi Nozawa Chikayama & Taura Lab.
Othello Playing AI Matt Smith. Othello 8x8 Board game 8x8 Board game Try to outflank opponents pieces Try to outflank opponents pieces Winner ends up.
1 Adversarial Search CS 171/271 (Chapter 6) Some text and images in these slides were drawn from Russel & Norvig’s published material.
Machine Learning for an Artificial Intelligence Playing Tic-Tac-Toe Computer Systems Lab 2005 By Rachel Miller.
A Naïve Treatment of Self- Contained Digital Sentience in Pursuit of Erudition in a Stochastic Game By Robert Brady Computer Systems Research
Jack Chen TJHSST Computer Systems Lab Abstract The purpose of this project is to explore Artificial Intelligence techniques in the board game.
CSCI 4310 Lecture 6: Adversarial Tree Search. Book Winston Chapter 6.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 9 of 42 Wednesday, 14.
DEEP RED An Intelligent Approach to Chinese Checkers.
Game tree search Chapter 6 (6.1 to 6.3 and 6.6) cover games. 6.6 covers state of the art game players in particular. 6.5 covers games that involve uncertainty.
The Implementation of Artificial Intelligence and Temporal Difference Learning Algorithms in a Computerized Chess Program By James Mannion Computer Systems.
Parallel Programming in Chess Simulations Tyler Patton.
George F Luger ARTIFICIAL INTELLIGENCE 5th edition Structures and Strategies for Complex Problem Solving HEURISTIC SEARCH Luger: Artificial Intelligence,
Lecture 4 Note: Some slides and/or pictures are adapted from Lecture slides / Books of Dr Zafar Alvi. Text Book - Aritificial Intelligence Illuminated.
Adversarial Search and Game Playing Russell and Norvig: Chapter 6 Slides adapted from: robotics.stanford.edu/~latombe/cs121/2004/home.htm Prof: Dekang.
Understanding AI of 2 Player Games. Motivation Not much experience in AI (first AI project) and no specific interests/passion that I wanted to explore.
Artificial Intelligence AIMA §5: Adversarial Search
Game Playing Why do AI researchers study game playing?
Artificial Intelligence
Instructor: Vincent Conitzer
Stochastic tree search and stochastic games
CONTENTS 1. Introduction 2. The Basic Checker-playing Program
AI Classnotes #5, John Shieh, 2012
Iterative Deepening A*
Next Level Tic-Tac-Toe
CS 460 Spring 2011 Lecture 4.
Done Done Course Overview What is AI? What are the Major Challenges?
Adversarial Search and Game Playing (Where making good decisions requires respecting your opponent) R&N: Chap. 6.
Pengantar Kecerdasan Buatan
The Implementation of Machine Learning in the Game of Checkers
Othello Artificial Intelligence With Machine Learning
The Implementation of Machine Learning in the Game of Checkers
Adversarial Search Chapter 5.
Dakota Ewigman Jacob Zimmermann
Artificial Intelligence
Artificial Intelligence for Engineers
Adversarial Search.
Artificial Intelligence
NIM - a two person game n objects are in one pile
Artificial Intelligence
Instructor: Vincent Conitzer
Introduction to Artificial Intelligence Lecture 9: Two-Player Games I
Pruned Search Strategies
Adversarial Search and Game Playing
Mini-Max search Alpha-Beta pruning General concerns on games
Introduction to Artificial Intelligence
Artificial Intelligence
Adversarial Search CMPT 420 / CMPG 720.
Introduction to Artificial Intelligence
Adversarial Search CS 171/271 (Chapter 6)
Games & Adversarial Search
CS51A David Kauchak Spring 2019
Othello Artificial Intelligence With Machine Learning
Adversarial Search Chapter 6 Section 1 – 4.
Presentation transcript:

By James Mannion Computer Systems Lab 08-09 Period 3 The Implementation of Artificial Intelligence and Temporal Difference Learning Algorithms in a Computerized Chess Programme By James Mannion Computer Systems Lab 08-09 Period 3

Abstract Searching through large sets of data Complex, vast domains Heuristic searches Chess Evaluation Function Machine Learning

Introduction Simple domains, simple heuristics The domain of chess Deep Blue – brute force Looking at 30^6 moves before making the first Supercomputer Too many calculations Not efficient

Introduction (cont’d) Minimax search Alpha-beta pruning Only look 2-3 moves into the future Estimate strength of position Evaluation function Can improve heuristic by learning

Introduction (cont’d) Seems simple, but can become quite complex. Chess masters spend careers learning how to “evaluate” moves Purpose: can a computer learn a good evaluation function?

Background Claude Shannon, 1950 Brute force would take too long Discusses evaluation function 2-ply algorithm, but looks further into the future for moves that could lead to checkmate Possibility of learning in distant future

Development Python Stage 1: Text based chess game Two humans input their moves Illegal moves not allowed

Development (cont’d)

Development (cont’d)

Development (cont’d)

Development (cont’d) Stage 2: Introduce a computer player 2-3 ply Evaluation function will start out such that choices are based on a simple piece- differential where each piece is waited equally

Development (cont’d) Stage 3: Learning Temporal Difference Learning Weight adjustment: w_i < − − w_i + a((n_ic − n_ip)/(n_ic)) Heuristic function: h = c_1(p_1) + c_2(p_2) + c_3(p_3) + c_4(p_4) + c_5(p_5) Piece values: p-i = Sum(w_i) – Sum(b_i) over i

Testing Learning vs No Learning Two equal, piece-differential players pitted against each other. One will have the ability to learn Thousands of games Win-loss differential tracked over the length of the test By the end, the learner should be winning significantly more games.

Data

Data (cont'd)

References Shannon, Claude. “Programming a Computer for Playing Chess.” 1950 Beal, D.F., Smith, M.C. “Temporal Difference Learning for Heuristic Search and Game Playing.” 1999 Moriarty, David E., Miikkulainen, Risto. “Discovering Complex Othello Strategies Through Evolutionary Neural Networks.” Huang, Shiu-li, Lin, Fu-ren. “Using Temporal- Difference Learning for Multi-Agent Bargaining.” 2007 Russell, Stuart, Norvig, Peter. Artificial Intelligence: A Modern Approach. Second Edition. 2003. Asgharbeygi, Nima, Stracuzzi, David and Langley, Pat.“Relational Temporal Difference Learning”.