Strategies Based On Threats Ling Zhao University of Alberta March 10, 2003 Comparative evaluation of strategies based on the values of direct threats by.

Slides:



Advertisements
Similar presentations
On the Complexity of Parallel Hardness Amplification for One-Way Functions Chi-Jen Lu Academia Sinica, Taiwan.
Advertisements

Exploring Machine Learning in Computer Games Presented by: Matthew Hayden Thurs, 25 th March 2010.
An Introduction to Game Theory Part V: Extensive Games with Perfect Information Bernhard Nebel.
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
Adversarial Search: Game Playing Reading: Chapter next time.
Application of Artificial intelligence to Chess Playing Capstone Design Project 2004 Jason Cook Bitboards  Bitboards are 64 bit unsigned integers, with.
Artificial Intelligence in Game Design Heuristics and Other Ideas in Board Games.
Chess AI’s, How do they work? Math Club 10/03/2011.
Game-theoretic analysis tools Necessary for building nonmanipulable automated negotiation systems.
An Alternative Approach for Playing Complex Games like Chess. 1Alternative Game Playing Approach Jan Lemeire May 19 th 2008.
Minimax and Alpha-Beta Reduction Borrows from Spring 2006 CS 440 Lecture Slides.
Satisfaction Equilibrium Stéphane Ross. Canadian AI / 21 Problem In real life multiagent systems :  Agents generally do not know the preferences.
Abstract Proof Search Studied by Tristan Cazenave Surveyed by Akihiro Kishimoto.
Search for Transitive Connections Ling Zhao University of Alberta October 27, 2003 Author: T. Cazenave and B. Helmstetter published in JCIS'03.
Mathematical Morphology Applied to Computer Go Author: Bruno Bouzy Presenter: Ling Zhao June 30, 2004.
Progressive Strategies For Monte-Carlo Tree Search Presenter: Ling Zhao University of Alberta November 5, 2007 Authors: G.M.J.B. Chaslot, M.H.M. Winands,
Combining Tactical Search and Monte-Carlo in the Game of Go Presenter: Ling Zhao University of Alberta November 1, 2005 by Tristan Cazenave & Bernard Helmstetter.
Towards Multi-Objective Game Theory - With Application to Go A.B. Meijer and H. Koppelaar Presented by Ling Zhao University of Alberta October 5, 2006.
Solving Probabilistic Combinatorial Games Ling Zhao & Martin Mueller University of Alberta September 7, 2005 Paper link:
Generalized Threats Search Paper Review Paper Author: T. Cazenave Review by: A. Botea.
Temperature Discovery Martin M ü ller, Markus Enzenberger and Jonathan Schaeffer zIntroduction: local and global search yLocal search algorithms yTemperature.
Metarules To Improve Tactical Go Knowledge By Tristan Cazenave Presented by Leaf Wednesday, April 28 th, 2004.
Hex A game of connections. The Beginning Invented independently by Piet Hein in 1942 and John Nash in Invented independently by Piet Hein in 1942.
Choose A Subgame to Play by Zhichao “Leaf” Li October 20th, 2003.
Pattern Matching in Computer Go Ling Zhao University of Alberta August 7, 2002.
Nash Q-Learning for General-Sum Stochastic Games Hu & Wellman March 6 th, 2006 CS286r Presented by Ilan Lobel.
Playing Konane Mathematically with Combinatorial Game Theory
Sum of Combinatorial Games by Leaf Wednesday, October 12th, 2004.
Static Games of Complete Information: Subgame Perfection
Combinatorial Games Martin Müller. Contents Combinatorial game theory Thermographs Go and Amazons as combinatorial games.
Go Meeting Talk1 Generation of Patterns with External Conditions for the Game of Go Paper presentation
Choose A Subgame to Play by Zhichao “Leaf” Li August 1 st, 2003.
1 Solving Ponnuki-Go on Small Board Paper: Solving Ponnuki-Go on small board Authors: Erik van der Werf, Jos Uiterwijk, Jaap van den Herik Presented by:
Depth Increment in IDA* Ling Zhao Dept. of Computing Science University of Alberta July 4, 2003.
Adversarial Search: Game Playing Reading: Chess paper.
Multipurpose Adversary Planning in the Game of Go Ph.D thesis by Shui Hu Presenter: Ling Zhao Date: November 18, 2002.
Tree-Building. Methods in Tree Building Phylogenetic trees can be constructed by: clustering method optimality method.
Choose A Subgame to Play by Zhichao “Leaf” Li January 27 th, 2004.
Accidentals, steps, scales Sharps(#) Flats(b). Look at the keyboard and notice how the sharps and flats are laid out.. A sharp is defined as the note.
Brute Force Average of 88 checks Worst possible algorithm if there is a ship in the bottom right cell Best search: 25.
Artificial Intelligence in Game Design Lecture 22: Heuristics and Other Ideas in Board Games.
Computer Go : A Go player Rohit Gurjar CS365 Project Proposal, IIT Kanpur Guided By – Prof. Amitabha Mukerjee.
Temperature Discovery Search Temperature Discovery Search (TDS) is a new minimaxbased game tree search method designed to compute or approximate the temperature.
Instructor: Vincent Conitzer
Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
1 Sketch tools and Related Research Rachel Patel.
GOMOKU ALGORITHM STUDY MIN-MAX AND MONTE CARLO APPROACHING
Application of Artificial Intelligence to Chess Playing Jason Cook Capstone Project.
05/2007ORNL Presentation Distributed Denial of Service Games by Chinar Dingankar, Student Dr. R. R. Brooks, Associate Professor Holcombe Department of.
Amazons Experiments in Computer Amazons, Martin Mueller and Theodore Tegos, 2002 Exhaustive Search in the Game Amazons Raymond Georg Snatzke, 2002 Presented.
Topic 3 Games in Extensive Form 1. A. Perfect Information Games in Extensive Form. 1 RaiseFold Raise (0,0) (-1,1) Raise (1,-1) (-1,1)(2,-2) 2.
Backtracking and Games Eric Roberts CS 106B January 28, 2013.
Final Thoughts. Health Communication Process Pre-work Formative Research Strategy Development Pre-testing Implementation Evaluation.
GAME PLAYING 1. There were two reasons that games appeared to be a good domain in which to explore machine intelligence: 1.They provide a structured task.
DEEP RED An Intelligent Approach to Chinese Checkers.
Adversarial Search Chapter Games vs. search problems "Unpredictable" opponent  specifying a move for every possible opponent reply Time limits.
Introduction to Game Theory Kamal Aboul-Hosn Cornell University Computers Playing Games.
PROBLEM-SOLVING TECHNIQUES Rocky K. C. Chang November 10, 2015.
1Computer Sciences. 2 HEAP SORT TUTORIAL 4 Objective O(n lg n) worst case like merge sort. Sorts in place like insertion sort. A heap can be stored as.
Othello Artificial Intelligence With Machine Learning Computer Systems TJHSST Nick Sidawy.
GOMOKU ALGORITHM STUDY MIN-MAX AND MONTE CARLO APPROACHING
Error-Correcting Code
Conjecture: PQ = ½ (AD – BC). 2.In quadrilateral ABCD, the measures of angles A, B, C and D, in that order, form an increasing arithmetic sequence.
Instructor: Vincent Conitzer
AlphaGo with Deep RL Alpha GO.
Parallel Nested Monte-Carlo Search
تصنيف التفاعلات الكيميائية
CSE 589 Applied Algorithms Spring 1999
Dynamic Programming Dynamic Programming 1/15/ :41 PM
Instructor: Vincent Conitzer
Presentation transcript:

Strategies Based On Threats Ling Zhao University of Alberta March 10, 2003 Comparative evaluation of strategies based on the values of direct threats by Tristan Cazenave in Board Games in Academia V, Barcelona, 2002.

Outline Motivations Direct threats Thermograph Threat strategies Experimental results

Motivations Apply combinatorial game theory to Go - independent subgames - approach the best stategy

Direct threats Each subgame has at most two moves

Root of the tree Black: 22 points White: 15 points Value: 7 points for black

Sequence after two black moves Black: 25 points White: 14 points Value: 11 points for black

Direct threat: 4 values A, B, C, D with A >= B >= C >= D Denoted as (A, B, C, D) After one black move => (A,B) After one white move => (C,D)

Problem model Given a set of independent subgames with direct threats, how to compute good moves fast and accurately. (A1,B1,C1,D1) (A2,B2,C2,D2)... (An, Bn, Cn, Dn) Choose which subgame to play a move?

Thermograph Temperature: 6.5 Mean value: -4.5 Thermograph for direct threat (4,0,-6,-16)

Strategies Optimal stategy: brute force (NP-hard?) BMove: compare the best Left move (A,B,C,D) => value B (A,B) => value A MaxMove: (A,B,C,D) => value B-C (A,B) => value A-B

Stategies Sente Strategy A – B > MaxV => Sente C – D > MaxV => Reverse Sente Sente or Reverse Sente => MaxV *= 2 both => MaxV *= 4 SenteQ Strategy Always choose sente or reverse sente moves first. Use MaxV to break ties.

MaxThreat stategy

Stategies HotStrat: compare temperature ThermoStrat: adding all the thermographs of all the subgame to choose the best subgame to play in.

Experimental results Randomly choose 5 subgames (A>=B>=C>=D) 8 Strategies playing against each other (56 games one round) 100 rounds

Conclusions Simple strategy is good enough! Hotstrat strategy is easy to implement and can get 12% improvement. Problems (my thoughts) Impact of imprecise evaluation? How to generalize to more than 2 plies?