Temperature Discovery Martin M ü ller, Markus Enzenberger and Jonathan Schaeffer zIntroduction: local and global search yLocal search algorithms yTemperature.

Slides:



Advertisements
Similar presentations
Adversarial Search Chapter 6 Sections 1 – 4. Outline Optimal decisions α-β pruning Imperfect, real-time decisions.
Advertisements

Adversarial Search Chapter 6 Section 1 – 4. Types of Games.
Adversarial Search We have experience in search where we assume that we are the only intelligent being and we have explicit control over the “world”. Lets.
Games & Adversarial Search Chapter 5. Games vs. search problems "Unpredictable" opponent  specifying a move for every possible opponent’s reply. Time.
ICS-271:Notes 6: 1 Notes 6: Game-Playing ICS 271 Fall 2008.
CS 484 – Artificial Intelligence
Adversarial Search Chapter 6 Section 1 – 4.
Adversarial Search Chapter 5.
Lecture 12 Last time: CSPs, backtracking, forward checking Today: Game Playing.
Artificial Intelligence for Games Game playing Patrick Olivier
An Introduction to Artificial Intelligence Lecture VI: Adversarial Search (Games) Ramin Halavati In which we examine problems.
1 Adversarial Search Chapter 6 Section 1 – 4 The Master vs Machine: A Video.
10/19/2004TCSS435A Isabelle Bichindaritz1 Game and Tree Searching.
Games CPSC 386 Artificial Intelligence Ellen Walker Hiram College.
Chess AI’s, How do they work? Math Club 10/03/2011.
Minimax and Alpha-Beta Reduction Borrows from Spring 2006 CS 440 Lecture Slides.
State Space 4 Chapter 4 Adversarial Games. Two Flavors Games of Perfect Information ◦Each player knows everything that can be known ◦Chess, Othello Games.
Game Playing CSC361 AI CSC361: Game Playing.
Solving Probabilistic Combinatorial Games Ling Zhao & Martin Mueller University of Alberta September 7, 2005 Paper link:
Strategies Based On Threats Ling Zhao University of Alberta March 10, 2003 Comparative evaluation of strategies based on the values of direct threats by.
UNIVERSITY OF SOUTH CAROLINA Department of Computer Science and Engineering CSCE 580 Artificial Intelligence Ch.6: Adversarial Search Fall 2008 Marco Valtorta.
Game-Playing Read Chapter 6 Adversarial Search. Game Types Two-person games vs multi-person –chess vs monopoly Perfect Information vs Imperfect –checkers.
Computer Go Research at the University of Alberta Martin Müller.
Choose A Subgame to Play by Zhichao “Leaf” Li October 20th, 2003.
Sum of Combinatorial Games by Leaf Wednesday, October 12th, 2004.
Combinatorial Games Martin Müller. Contents Combinatorial game theory Thermographs Go and Amazons as combinatorial games.
ICS-271:Notes 6: 1 Notes 6: Game-Playing ICS 271 Fall 2006.
Choose A Subgame to Play by Zhichao “Leaf” Li August 1 st, 2003.
Adversarial Search: Game Playing Reading: Chess paper.
Games & Adversarial Search Chapter 6 Section 1 – 4.
Choose A Subgame to Play by Zhichao “Leaf” Li January 27 th, 2004.
Game Playing State-of-the-Art  Checkers: Chinook ended 40-year-reign of human world champion Marion Tinsley in Used an endgame database defining.
Introduction Many decision making problems in real life
Prepared by : Walaa Maqdasawi Razan Jararah Supervised by: Dr. Aladdin Masri.
Computer Go : A Go player Rohit Gurjar CS365 Project Proposal, IIT Kanpur Guided By – Prof. Amitabha Mukerjee.
Game Playing Chapter 5. Game playing §Search applied to a problem against an adversary l some actions are not under the control of the problem-solver.
Agents that can play multi-player games. Recall: Single-player, fully-observable, deterministic game agents An agent that plays Peg Solitaire involves.
Traditional game playing 2 player adversarial (win => lose) based on search but... huge game trees can't be fully explored.
Alpha-beta Search. Two-player games zThe object of a search is to find a path from the starting position to a goal position zIn a puzzle-type problem,
Temperature Discovery Search Temperature Discovery Search (TDS) is a new minimaxbased game tree search method designed to compute or approximate the temperature.
Mark Dunlop, Computer and Information Sciences, Strathclyde University 1 Algorithms & Complexity 5 Games Mark D Dunlop.
Adversarial Search Chapter 6 Section 1 – 4. Outline Optimal decisions α-β pruning Imperfect, real-time decisions.
Game Playing. Towards Intelligence? Many researchers attacked “intelligent behavior” by looking to strategy games involving deep thought. Many researchers.
Games. Adversaries Consider the process of reasoning when an adversary is trying to defeat our efforts In game playing situations one searches down the.
1 Adversarial Search CS 171/271 (Chapter 6) Some text and images in these slides were drawn from Russel & Norvig’s published material.
CHAPTER 4 PROBABILITY THEORY SEARCH FOR GAMES. Representing Knowledge.
Amazons Experiments in Computer Amazons, Martin Mueller and Theodore Tegos, 2002 Exhaustive Search in the Game Amazons Raymond Georg Snatzke, 2002 Presented.
Search exploring the consequences of possible actions.
Game Playing. Introduction One of the earliest areas in artificial intelligence is game playing. Two-person zero-sum game. Games for which the state space.
Cilk Pousse James Process CS534. Overview Introduction to Pousse Searching Evaluation Function Move Ordering Conclusion.
Adversarial Search Chapter Games vs. search problems "Unpredictable" opponent  specifying a move for every possible opponent reply Time limits.
Adversarial Games. Two Flavors  Perfect Information –everything that can be known is known –Chess, Othello  Imperfect Information –Player’s have each.
Game tree search Chapter 6 (6.1 to 6.3 and 6.6) cover games. 6.6 covers state of the art game players in particular. 6.5 covers games that involve uncertainty.
Learning to Play the Game of GO Lei Li Computer Science Department May 3, 2007.
ARTIFICIAL INTELLIGENCE (CS 461D) Princess Nora University Faculty of Computer & Information Systems.
Adversarial Search Chapter 6 Section 1 – 4. Games vs. search problems "Unpredictable" opponent  specifying a move for every possible opponent reply Time.
Explorations in Artificial Intelligence Prof. Carla P. Gomes Module 5 Adversarial Search (Thanks Meinolf Sellman!)
Adversarial Search Chapter 5 Sections 1 – 4. AI & Expert Systems© Dr. Khalid Kaabneh, AAU Outline Optimal decisions α-β pruning Imperfect, real-time decisions.
ADVERSARIAL SEARCH Chapter 6 Section 1 – 4. OUTLINE Optimal decisions α-β pruning Imperfect, real-time decisions.
Understanding AI of 2 Player Games. Motivation Not much experience in AI (first AI project) and no specific interests/passion that I wanted to explore.
1 Chapter 6 Game Playing. 2 Chapter 6 Contents l Game Trees l Assumptions l Static evaluation functions l Searching game trees l Minimax l Bounded lookahead.
Game Playing Why do AI researchers study game playing?
By Kevin Madison and Emma Drobina
Machine Learning for Go
State Space 4 Chapter 4 Adversarial Games.
David Kauchak CS52 – Spring 2016
Adversarial Search Chapter 5.
Adversarial Search.
Adversarial Search Chapter 6 Section 1 – 4.
Presentation transcript:

Temperature Discovery Martin M ü ller, Markus Enzenberger and Jonathan Schaeffer zIntroduction: local and global search yLocal search algorithms yTemperature zEnvironments and coupon stacks zTemperature discovery search zFirst results

Local and Global Search Local search Partition game into sum of subgames Local analysis Problem: how to evaluate local results? Central question: which sums of games are wins? Global search Single, monolithic game state Full board evaluation Single game tree, minimax backup Central question: what is the minimax score?

Why Local Search? zGlobal Alpha-beta: Search time exponential in size of full problem zLocal search: time exponential in size of subproblems

Results of Local Searches z1. Exact: combinatorial game value (Winning Ways, my Ph.D. thesis on Go endgames)  2. Inexact, but “ very good ” : temperatures, thermographs (Go: Berlekamp, Spight, Fraser, M ü ller, Amazons: Theo Tegos) z3. Even less exact: heuristic search to estimate the temperature (This work, with Markus and Jonathan)

1. Decomposition Search zUsual: global game tree search zDS: Divide-and-conquer approach zIdea: yDivide game into sub-games yDo a local search zCombine local results: Combinatorial game theory

2. Temperatures, Thermographs

3. Temperature Discovery  Problem: Thermographs computed “ bottom-up ” zNeeds complete local game tree zSometimes too expensive zHeuristic evaluation works well in global search zIdea: use it in local search to estimate temperature

Temperature Discovery zA different way to compute temperatures (Berlekamp):  Play local game + “ Coupon stack ”  Choose between play on the board and “ coupon ” (move of known value) zTemperature of coupon of value t is t. So can estimate temp of board!

Example zCoupon stack 3,2,1,0,-1 zAmazons board zSearch depth 4 z1. B: Coupon(3) 2. W: C8-C7xC8 3. B: Coupon(2) z4. W: Coupon(1) 9.. X. 8.. W. 7 X.. B 6. X.. A B C D

Example (cont ’ d) zUses heuristic evaluation of board zDepth-limited search zResult: ywhen does it change from taking coupons to board? yEstimate for the temperature

Experiments (1) zRun temp. discovery search on small areas  Compare estimated t against exact t from Theo Tegos ’ Databases zPlot real t vs estimated t zWorks OK, but still some problems/bugs?

Experiments (2) zSample starting positions with 2, 4 and 6 subgames zSubgame size 4x4, 5x5 zTemperature discovery in each local game  Simple ‘ hotstrat ’ player zPlay 2x200 games against Arrow (full board search)

 ‘ Coupon player ’ vs Arrow yAbout 10 sec./move Two, four, six 4x4 subgames

Two and Four 5x5 subgames

 average over 200 pairs of games (stdDev 11.5) 5x5 subgames

zArrow(10sec) vs Arrow on four 4x4 zDifferent time limits for opponent Control experiment 5s1s30s10s

Sample 4x5x5 Game

zMore experiments, e.g. 6x5x5, 6x6,... zTry on real games zBetter sum game algorithm zTune, fix temperature discovery search zOptimal solver? (Needs global search too) zThe real goal - apply to Go! To Do...

Summary zLocal search algorithm zTry to discover temperature by minimax search zApplications: Amazons, future: Go zFirst results: it works... zStill lots of open questions