Othello Sean Farrell June 29, 2015. Othello Two-player game played on 8x8 board All pieces have one white side and one black side Initial board setup.

Slides:



Advertisements
Similar presentations
Adversarial Search Chapter 6 Sections 1 – 4. Outline Optimal decisions α-β pruning Imperfect, real-time decisions.
Advertisements

METAGAMER: An Agent for Learning and Planning in General Games Barney Pell NASA Ames Research Center.
Adversarial Search Chapter 6 Section 1 – 4. Types of Games.
Constructing Complex NPC Behavior via Multi- Objective Neuroevolution Jacob Schrum – Risto Miikkulainen –
Games & Adversarial Search Chapter 5. Games vs. search problems "Unpredictable" opponent  specifying a move for every possible opponent’s reply. Time.
Games & Adversarial Search
For Monday Read chapter 7, sections 1-4 Homework: –Chapter 4, exercise 1 –Chapter 5, exercise 9.
On the Genetic Evolution of a Perfect Tic-Tac-Toe Strategy
ICS-271:Notes 6: 1 Notes 6: Game-Playing ICS 271 Fall 2008.
CS 484 – Artificial Intelligence
Adversarial Search Chapter 5.
COMP-4640: Intelligent & Interactive Systems Game Playing A game can be formally defined as a search problem with: -An initial state -a set of operators.
Adversarial Search: Game Playing Reading: Chapter next time.
The Implementation of Artificial Intelligence and Temporal Difference Learning Algorithms in a Computerized Chess Programme By James Mannion Computer Systems.
Artificial Intelligence for Games Game playing Patrick Olivier
Artificial Intelligence in Game Design Heuristics and Other Ideas in Board Games.
Games CPSC 386 Artificial Intelligence Ellen Walker Hiram College.
Computers Playing Games Arif Zaman CS 101. Acknowledgements Portions of this are taken from MIT’s open-courseware
1 Evolving Board Game Players Without Using Expert Knowledge A presentation of research by Amit Benbassat Advisor: Moshe Sipper. A. Benbassat and M. Sipper.
Minimax and Alpha-Beta Reduction Borrows from Spring 2006 CS 440 Lecture Slides.
Game Playing CSC361 AI CSC361: Game Playing.
Reinforcement Learning of Local Shape in the Game of Atari-Go David Silver.
Computers Playing Games Arif Zaman CS 101. Acknowledgements Portions of this are taken from MIT’s open-courseware
How computers play games with you CS161, Spring ‘03 Nathan Sturtevant.
ICS-271:Notes 6: 1 Notes 6: Game-Playing ICS 271 Fall 2006.
1 Game Playing Chapter 6 (supplement) Various deterministic board games Additional references for the slides: Luger’s AI book (2005). Robert Wilensky’s.
Adversarial Search: Game Playing Reading: Chess paper.
Optimizing a Chess Heuristic Using Evolutionary Algorithms Benjamin Rhew
Game Playing State-of-the-Art  Checkers: Chinook ended 40-year-reign of human world champion Marion Tinsley in Used an endgame database defining.
Game Trees: MiniMax strategy, Tree Evaluation, Pruning, Utility evaluation Adapted from slides of Yoonsuck Choe.
Minimax Trees: Utility Evaluation, Tree Evaluation, Pruning CPSC 315 – Programming Studio Spring 2008 Project 2, Lecture 2 Adapted from slides of Yoonsuck.
Game Playing.
1 Learning Agents Laboratory Computer Science Department George Mason University Prof. Gheorghe Tecuci 3. Rote Learning.
Introduction Many decision making problems in real life
Game Playing ECE457 Applied Artificial Intelligence Spring 2007 Lecture #5.
Computer Go : A Go player Rohit Gurjar CS365 Project Proposal, IIT Kanpur Guided By – Prof. Amitabha Mukerjee.
Evaluation Function in Game Playing Programs M1 Yasubumi Nozawa Chikayama & Taura Lab.
Game-playing AIs Part 1 CIS 391 Fall CSE Intro to AI 2 Games: Outline of Unit Part I (this set of slides)  Motivation  Game Trees  Evaluation.
Adversarial Search Chapter 6 Section 1 – 4. Outline Optimal decisions α-β pruning Imperfect, real-time decisions.
Instructor: Vincent Conitzer
Othello Playing AI Matt Smith. Othello 8x8 Board game 8x8 Board game Try to outflank opponents pieces Try to outflank opponents pieces Winner ends up.
Game Playing. Towards Intelligence? Many researchers attacked “intelligent behavior” by looking to strategy games involving deep thought. Many researchers.
Games. Adversaries Consider the process of reasoning when an adversary is trying to defeat our efforts In game playing situations one searches down the.
Senior Project Poster Day 2007, CIS Dept. University of Pennsylvania Reversi Meng Tran Faculty Advisor: Dr. Barry Silverman Strategies: l Corners t Corners.
CHECKERS: TD(Λ) LEARNING APPLIED FOR DETERMINISTIC GAME Presented By: Presented To: Amna Khan Mis Saleha Raza.
Jack Chen TJHSST Computer Systems Lab Abstract The purpose of this project is to explore Artificial Intelligence techniques in the board game.
Blondie24 Presented by Adam Duffy and Josh Hill. Overview Introduction to new concepts Design of Blondie24 Testing and results Other approaches to checkers.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 9 of 42 Wednesday, 14.
Games as adversarial search problems Dynamic state space search.
Quoridor and Artificial Intelligence
The Implementation of Artificial Intelligence and Temporal Difference Learning Algorithms in a Computerized Chess Program By James Mannion Computer Systems.
Learning to Play the Game of GO Lei Li Computer Science Department May 3, 2007.
ARTIFICIAL INTELLIGENCE (CS 461D) Princess Nora University Faculty of Computer & Information Systems.
Well Posed Learning Problems Must identify the following 3 features –Learning Task: the thing you want to learn. –Performance measure: must know when you.
The Standard Genetic Algorithm Start with a “population” of “individuals” Rank these individuals according to their “fitness” Select pairs of individuals.
A Scalable Machine Learning Approach to Go Pierre Baldi and Lin Wu UC Irvine.
A Heuristic Hillclimbing Algorithm for Mastermind Alexandre Temporel and Tim Kovacs.
Explorations in Artificial Intelligence Prof. Carla P. Gomes Module 5 Adversarial Search (Thanks Meinolf Sellman!)
Teaching Computers to Think:
Adversarial Search Chapter 5 Sections 1 – 4. AI & Expert Systems© Dr. Khalid Kaabneh, AAU Outline Optimal decisions α-β pruning Imperfect, real-time decisions.
ADVERSARIAL SEARCH Chapter 6 Section 1 – 4. OUTLINE Optimal decisions α-β pruning Imperfect, real-time decisions.
Understanding AI of 2 Player Games. Motivation Not much experience in AI (first AI project) and no specific interests/passion that I wanted to explore.
An Evolutionary Algorithm for Neural Network Learning using Direct Encoding Paul Batchis Department of Computer Science Rutgers University.
Game Playing Why do AI researchers study game playing?
D1 Miwa Makoto Chikayama & Taura Lab
By James Mannion Computer Systems Lab Period 3
Adversarial Search Chapter 5.
Counter propagation network (CPN) (§ 5.3)
Games & Adversarial Search
CS51A David Kauchak Spring 2019
Presentation transcript:

Othello Sean Farrell June 29, 2015

Othello Two-player game played on 8x8 board All pieces have one white side and one black side Initial board setup is shown right, with valid moves marked as dots

The Evolution of Strong Othello Programs Michael Buro, 2003 First computer Othello tournaments in 1979 In 1980, first time a World-champion lost a game of skill against a computer 6-0 defeat of then human World-champion in 1997

Evaluation Function Evolution Mini-max search used to estimate the chance of winning for the player with current move Construct function using features of board state that correlate with winning Important features  Disc stability – stable discs cannot be flipped  Disc mobility – move options  Disc parity – last move opportunities for every empty board region

IAGO (1982) Classic, hand-crafted evaluation function Features were chosen based on analysis of Othello  Edge Stability & Internal Stability  Current Mobility & Potential Mobility Evaluation parameters were done manually

BILL (1990) Partly pattern based, feature weights are learned  Edge Stability  Current Mobility & Potential Mobility  Sequence Penalty Evaluation speed increased by using pre- computed tables In comparison, BILL wins all games against IAGO, with 20% the thinking time

Logistello-1(1994) Pattern values learned independently from sample data Uses logistic regression to combine features  Current Mobility & Potential Mobility  Patterns

Logistello-1(1994) Evaluation function is entirely table based All evaluation parameters are learned from sample positions Logistello uses selective search heuristic called ProbCut Dominated computer Othello until 1996

Logistello-2(1997) Joint learning of pattern values Assigning pattern values independently neglected pattern correlations Can assign arbitrary values to patterns whose meaning is not bound by limited human understanding of the problem Strength increase from 1994 version is ten- fold

Other Important Improvements Opening books  Saves time  Avoids falling into known strategic traps  Less chance of losing two games in the same way End game search  Allows for optimal play near end

Takeshi Murakami vs. Logistello Michael Buro, 1997 Both humans and computers have improved playing Othello Against imperfect human players it pays off to complicate endgame positions, not so with strong Othello programs

Human vs. Computer Takeshi MurakamiMichael Buro and is program Logistello

Discovering Complex Othello Strategies… D. E. Moriarty & Risto Miikkulainen, 1997 Found that experts search selective paths through pattern recognition Most programs used deeper searches to be effective Wanted a “human-like” approach

Neural Networks Networks learned Othello without previous knowledge No hand-coded rules or heuristics Strategies evolved through play No search mechanism Goal was to discover strategies

Implementation Neural networks relied on pattern recognition Used marker-based scheme with genetic algorithms Network architecture and weights evolve Population of 50 networks Initially evolved against random move maker

Results Positional strategy against random mover after 100 generations

Results Mobility strategy against searcher after 2000 generations

Articles M. Buro, The Evolution of Strong Othello Programs, 2003 M. Buro, Takeshi Murakami vs. Logistello, 1997 D. E. Moriarty & R. Miikkulainen, Discovering Complex Othello Strategies Through Evolutionary Neural Networks, 1995

Websites Michael Buro Neural Networks Research Group Free online Othello game

Questions ??