By James Mannion Computer Systems Lab Period 3

Slides:

Advertisements

Similar presentations

1 CS 385 Fall 2006 Chapter 4 Heuristic Search. 2 Heuristics eurisko ("I discover" in Greek) "the study of the methods and rules of discovery and invention."

Advertisements

Adversarial Search Reference: “Artificial Intelligence: A Modern Approach, 3 rd ed” (Russell and Norvig)

The Implementation of Artificial Intelligence and Temporal Difference Learning Algorithms in a Computerized Chess Programme By James Mannion Computer Systems.

Artificial Intelligence for Games Game playing Patrick Olivier

Computers Playing Games Arif Zaman CS 101. Acknowledgements Portions of this are taken from MIT’s open-courseware

The Implementation of Machine Learning in the Game of Checkers Billy Melicher Computer Systems lab

How Computers Play Chess Peter Barnum November 15, 2007 Artificial Intelligence 101.

Game Playing CSC361 AI CSC361: Game Playing.

Adversarial Search: Game Playing Reading: Chess paper.

Othello Sean Farrell June 29, Othello Two-player game played on 8x8 board All pieces have one white side and one black side Initial board setup.

Group 1 : Ashutosh Pushkar Ameya Sudhir From. Motivation  Game playing was one of the first tasks undertaken in AI  Study of games brings us closer.

Hex Combinatorial Search in Game Strategy by Brandon Risberg May 2006Menlo School.

PSU CS 370 – Introduction to Artificial Intelligence Game MinMax Alpha-Beta.

Lecture 5 Note: Some slides and/or pictures are adapted from Lecture slides / Books of Dr Zafar Alvi. Text Book - Aritificial Intelligence Illuminated.

Artificial Intelligence

Evaluation Function in Game Playing Programs M1 Yasubumi Nozawa Chikayama & Taura Lab.

Othello Playing AI Matt Smith. Othello 8x8 Board game 8x8 Board game Try to outflank opponents pieces Try to outflank opponents pieces Winner ends up.

1 Adversarial Search CS 171/271 (Chapter 6) Some text and images in these slides were drawn from Russel & Norvig’s published material.

Machine Learning for an Artificial Intelligence Playing Tic-Tac-Toe Computer Systems Lab 2005 By Rachel Miller.

A Naïve Treatment of Self- Contained Digital Sentience in Pursuit of Erudition in a Stochastic Game By Robert Brady Computer Systems Research

Jack Chen TJHSST Computer Systems Lab Abstract The purpose of this project is to explore Artificial Intelligence techniques in the board game.

CSCI 4310 Lecture 6: Adversarial Tree Search. Book Winston Chapter 6.

Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 9 of 42 Wednesday, 14.

DEEP RED An Intelligent Approach to Chinese Checkers.

Game tree search Chapter 6 (6.1 to 6.3 and 6.6) cover games. 6.6 covers state of the art game players in particular. 6.5 covers games that involve uncertainty.

The Implementation of Artificial Intelligence and Temporal Difference Learning Algorithms in a Computerized Chess Program By James Mannion Computer Systems.

Parallel Programming in Chess Simulations Tyler Patton.

George F Luger ARTIFICIAL INTELLIGENCE 5th edition Structures and Strategies for Complex Problem Solving HEURISTIC SEARCH Luger: Artificial Intelligence,

Lecture 4 Note: Some slides and/or pictures are adapted from Lecture slides / Books of Dr Zafar Alvi. Text Book - Aritificial Intelligence Illuminated.

Adversarial Search and Game Playing Russell and Norvig: Chapter 6 Slides adapted from: robotics.stanford.edu/~latombe/cs121/2004/home.htm Prof: Dekang.

Understanding AI of 2 Player Games. Motivation Not much experience in AI (first AI project) and no specific interests/passion that I wanted to explore.

Artificial Intelligence AIMA §5: Adversarial Search

Game Playing Why do AI researchers study game playing?

Artificial Intelligence

Instructor: Vincent Conitzer

Stochastic tree search and stochastic games

CONTENTS 1. Introduction 2. The Basic Checker-playing Program

AI Classnotes #5, John Shieh, 2012

Iterative Deepening A*

Next Level Tic-Tac-Toe

CS 460 Spring 2011 Lecture 4.

Done Done Course Overview What is AI? What are the Major Challenges?

Adversarial Search and Game Playing (Where making good decisions requires respecting your opponent) R&N: Chap. 6.

Pengantar Kecerdasan Buatan

The Implementation of Machine Learning in the Game of Checkers

Othello Artificial Intelligence With Machine Learning

The Implementation of Machine Learning in the Game of Checkers

Adversarial Search Chapter 5.

Dakota Ewigman Jacob Zimmermann

Artificial Intelligence

Artificial Intelligence for Engineers

Adversarial Search.

Artificial Intelligence

NIM - a two person game n objects are in one pile

Artificial Intelligence

Instructor: Vincent Conitzer

Introduction to Artificial Intelligence Lecture 9: Two-Player Games I

Pruned Search Strategies

Adversarial Search and Game Playing

Mini-Max search Alpha-Beta pruning General concerns on games

Introduction to Artificial Intelligence

Artificial Intelligence

Adversarial Search CMPT 420 / CMPG 720.

Introduction to Artificial Intelligence

Adversarial Search CS 171/271 (Chapter 6)

Games & Adversarial Search

CS51A David Kauchak Spring 2019

Othello Artificial Intelligence With Machine Learning

Adversarial Search Chapter 6 Section 1 – 4.

Presentation transcript:

By James Mannion Computer Systems Lab 08-09 Period 3 The Implementation of Artificial Intelligence and Temporal Difference Learning Algorithms in a Computerized Chess Programme By James Mannion Computer Systems Lab 08-09 Period 3

Abstract Searching through large sets of data Complex, vast domains Heuristic searches Chess Evaluation Function Machine Learning

Introduction Simple domains, simple heuristics The domain of chess Deep Blue – brute force Looking at 30^6 moves before making the first Supercomputer Too many calculations Not efficient

Introduction (cont’d) Minimax search Alpha-beta pruning Only look 2-3 moves into the future Estimate strength of position Evaluation function Can improve heuristic by learning

Introduction (cont’d) Seems simple, but can become quite complex. Chess masters spend careers learning how to “evaluate” moves Purpose: can a computer learn a good evaluation function?

Background Claude Shannon, 1950 Brute force would take too long Discusses evaluation function 2-ply algorithm, but looks further into the future for moves that could lead to checkmate Possibility of learning in distant future

Development Python Stage 1: Text based chess game Two humans input their moves Illegal moves not allowed

Development (cont’d)

Development (cont’d)

Development (cont’d)

Development (cont’d) Stage 2: Introduce a computer player 2-3 ply Evaluation function will start out such that choices are based on a simple piece- differential where each piece is waited equally

Development (cont’d) Stage 3: Learning Temporal Difference Learning Weight adjustment: w_i < − − w_i + a((n_ic − n_ip)/(n_ic)) Heuristic function: h = c_1(p_1) + c_2(p_2) + c_3(p_3) + c_4(p_4) + c_5(p_5) Piece values: p-i = Sum(w_i) – Sum(b_i) over i

Testing Learning vs No Learning Two equal, piece-differential players pitted against each other. One will have the ability to learn Thousands of games Win-loss differential tracked over the length of the test By the end, the learner should be winning significantly more games.

Data

Data (cont'd)

References Shannon, Claude. “Programming a Computer for Playing Chess.” 1950 Beal, D.F., Smith, M.C. “Temporal Difference Learning for Heuristic Search and Game Playing.” 1999 Moriarty, David E., Miikkulainen, Risto. “Discovering Complex Othello Strategies Through Evolutionary Neural Networks.” Huang, Shiu-li, Lin, Fu-ren. “Using Temporal- Difference Learning for Multi-Agent Bargaining.” 2007 Russell, Stuart, Norvig, Peter. Artificial Intelligence: A Modern Approach. Second Edition. 2003. Asgharbeygi, Nima, Stracuzzi, David and Langley, Pat.“Relational Temporal Difference Learning”.