GOMOKU ALGORITHM STUDY MIN-MAX AND MONTE CARLO APPROACHING

Slides:



Advertisements
Similar presentations
1 UM Stratego Collin Schepers Daan Veltman Enno Ruijters Leon Gerritsen Niek den Teuling Yannick Thimister.
Advertisements

An Introduction to Game Theory Part V: Extensive Games with Perfect Information Bernhard Nebel.
P ROBLEM S OLVING WITH 6C Probability and strategy.
A Whist AI Jason Fong CS261A, Spring What is Whist? Old card game, driven into obscurity by Bridge Similar to other trick taking games Bridge, Spades,
Artificial Intelligence Presentation
Todd W. Neller Gettysburg College
ChooseMove=16  19 V =30 points. LegalMoves= 16  19, or SimulateMove = 11  15, or …. 16  19,
Adversarial Search We have experience in search where we assume that we are the only intelligent being and we have explicit control over the “world”. Lets.
For Friday Finish chapter 5 Program 1, Milestone 1 due.
AI for Connect-4 (or other 2-player games) Minds and Machines.
Games and adversarial search
10/29/01Reinforcement Learning in Games 1 Colin Cherry Oct 29/01.
ICS-271:Notes 6: 1 Notes 6: Game-Playing ICS 271 Fall 2008.
University College Cork (Ireland) Department of Civil and Environmental Engineering Course: Engineering Artificial Intelligence Dr. Radu Marinescu Lecture.
Chess AI’s, How do they work? Math Club 10/03/2011.
Minimax and Alpha-Beta Reduction Borrows from Spring 2006 CS 440 Lecture Slides.
Lecture 13 Last time: Games, minimax, alpha-beta Today: Finish off games, summary.
GENETIC PROGRAMMING FOR CHECKERS PLAY Dustin Shutes Abstract: This project uses genetic programming to develop a program that plays the game of checkers.
CPSC 322 Introduction to Artificial Intelligence October 25, 2004.
Games with Chance Other Search Algorithms CPSC 315 – Programming Studio Spring 2008 Project 2, Lecture 3 Adapted from slides of Yoonsuck Choe.
Games and adversarial search
Generalized Threats Search Paper Review Paper Author: T. Cazenave Review by: A. Botea.
A TIE IS NOT A LOSS Paul Adamiak T02 Aruna Meiyeppen T01.
ICS-271:Notes 6: 1 Notes 6: Game-Playing ICS 271 Fall 2006.
Game Tree Search based on Russ Greiner and Jean-Claude Latombe’s notes.
THE RENJU GAME BY ABHISHEK JAIN, PRANSHU GUPTA & RHYTHM DAS PCLUB SUMMER PROJECT PRESENTATION JUNE, L7 IIT KANPUR MENTOR – SANIL JAIN.
The Poker Squares Challenge Todd W. Neller. What is the Poker Squares Challenge? A semester-long contest where Gettysburg College students (individuals.
Reinforcement Learning in the Presence of Hidden States Andrew Howard Andrew Arnold {ah679
The Parameterized Poker Squares EAAI NSG Challenge
1 Adversary Search Ref: Chapter 5. 2 Games & A.I. Easy to measure success Easy to represent states Small number of operators Comparison against humans.
CSC 412: AI Adversarial Search
Game Trees: MiniMax strategy, Tree Evaluation, Pruning, Utility evaluation Adapted from slides of Yoonsuck Choe.
PSU CS 370 – Introduction to Artificial Intelligence Game MinMax Alpha-Beta.
Minimax Trees: Utility Evaluation, Tree Evaluation, Pruning CPSC 315 – Programming Studio Spring 2008 Project 2, Lecture 2 Adapted from slides of Yoonsuck.
Minimax.
Lecture 6: Game Playing Heshaam Faili University of Tehran Two-player games Minmax search algorithm Alpha-Beta pruning Games with chance.
Introduction Many decision making problems in real life
Upper Confidence Trees for Game AI Chahine Koleejan.
Introduction to Artificial Intelligence CS 438 Spring 2008 Today –AIMA, Ch. 6 –Adversarial Search Thursday –AIMA, Ch. 6 –More Adversarial Search The “Luke.
14.3 Simulation Techniques and the Monte Carlo Method simulation technique A simulation technique uses a probability experiment to mimic a real-life situation.
Connect Four AI Robert Burns and Brett Crawford. Connect Four  A board with at least six rows and seven columns  Two players: one with red discs and.
1 N -Queens via Relaxation Labeling Ilana Koreh ( ) Luba Rashkovsky ( )
Games. Adversaries Consider the process of reasoning when an adversary is trying to defeat our efforts In game playing situations one searches down the.
Senior Project Poster Day 2007, CIS Dept. University of Pennsylvania Reversi Meng Tran Faculty Advisor: Dr. Barry Silverman Strategies: l Corners t Corners.
CHECKERS: TD(Λ) LEARNING APPLIED FOR DETERMINISTIC GAME Presented By: Presented To: Amna Khan Mis Saleha Raza.
Game Playing. Introduction One of the earliest areas in artificial intelligence is game playing. Two-person zero-sum game. Games for which the state space.
Cilk Pousse James Process CS534. Overview Introduction to Pousse Searching Evaluation Function Move Ordering Conclusion.
Backtracking and Games Eric Roberts CS 106B January 28, 2013.
CSCI 4310 Lecture 6: Adversarial Tree Search. Book Winston Chapter 6.
For Friday Finish chapter 6 Program 1, Milestone 1 due.
DEEP RED An Intelligent Approach to Chinese Checkers.
RADHA-KRISHNA BALLA 19 FEBRUARY, 2009 UCT for Tactical Assault Battles in Real-Time Strategy Games.
ARTIFICIAL INTELLIGENCE (CS 461D) Princess Nora University Faculty of Computer & Information Systems.
Will Britt and Bryan Silinski
CompSci Backtracking, Search, Heuristics l Many problems require an approach similar to solving a maze ä Certain mazes can be solved using the.
Graph Search II GAM 376 Robin Burke. Outline Homework #3 Graph search review DFS, BFS A* search Iterative beam search IA* search Search in turn-based.
CPS Backtracking, Search, Heuristics l Many problems require an approach similar to solving a maze ä Certain mazes can be solved using the “right-hand”
GOMOKU ALGORITHM STUDY MIN-MAX AND MONTE CARLO APPROACHING
Adaptive Reinforcement Learning Agents in RTS Games Eric Kok.
Teaching Computers to Think:
Understanding AI of 2 Player Games. Motivation Not much experience in AI (first AI project) and no specific interests/passion that I wanted to explore.
Artificial Intelligence AIMA §5: Adversarial Search
CSE 4705 Artificial Intelligence
Improving Monte Carlo Tree Search Policies in StarCraft
Stochastic tree search and stochastic games
Mastering the game of Go with deep neural network and tree search
AlphaGO from Google DeepMind in 2016, beat human grandmasters
The Parameterized Poker Squares EAAI NSG Challenge
Kevin Mason Michael Suggs
Minimax Trees: Utility Evaluation, Tree Evaluation, Pruning
Presentation transcript:

GOMOKU ALGORITHM STUDY MIN-MAX AND MONTE CARLO APPROACHING Xie Guochen, Ge weixun, Jingtong Liu, Sun Wei

GOMOKU ALGORITHM STUDY Introduction Approaching MiniMax Monte Tests Conclusion

GOMOKU ALGORITHM STUDY Introduction Approaching MiniMax Monte Tests Conclusion

Introduction Gomoku is an abstract strategy board game, also called Gobang or Five in a Row, played on a board of 15X15 intersections. Complicated, not very complicated Algorithm Minimax Monte Carlo

Why Monte? (Motivation) For some games, Minimax works really well. But for some other games, the search tree could be very large. It motivates us to implement an alternative algorithm called MonteCarlo Tree Search. For some games, We believe, if we use simulate annealing with applying local Minimax search, the agent will get better than simply adopted Minimax search.

Assumptions(Modification later) (1) Moves are performed randomly with the probabilities assigned by the method of simulated annealing, (2) The value of a position is defined by the win rate of the given position (3)To find the best move in a given position, play the game to the very end as suggested by (1) and then evaluate as in (2); play thousands of such random games, and the best move will be the one doing the best. Unique game, so modifications later

Win Time (Evaluation Function)

Updating the win time Update the Win Time by each roll out. The best move should always be played.

Issues we need to solve or improve How to choose the roots to build the search tree Two idiots play or something else? Uniqueness of the game: Order of the moves is important

GOMOKU ALGORITHM STUDY Introduction Approaching MiniMax Monte Tests Conclusion

Minimax

Minimax Evaluation Function:

GOMOKU ALGORITHM STUDY Introduction Approaching MiniMax Monte Tests Conclusion

Monte Carlo Gomoku simulate

How to build the roots(Genetic Algorithm) Instead of sing root or 5 roots, we enlarge it to 20.

Monte Carlo Gomoku Smart Simulate (trained by minimax as opponent)

Monte Carlo Gomoku When minimax trained twice, perform worse

Order importance (short-cut) Best moves should be played immediately. Urgent moves are important than big moves.

Win Time History(improvement) Update the win time history with weight

GOMOKU ALGORITHM STUDY Introduction Approaching MiniMax Monte Tests Conclusion

GOMOKU ALGORITHM STUDY Introduction Approaching MiniMax Monte Tests Conclusion