Tetris Agent Optimization Using Harmony Search Algorithm

Slides:



Advertisements
Similar presentations
Local optimization technique G.Anuradha. Introduction The evaluation function defines a quality measure score landscape/response surface/fitness landscape.
Advertisements

G5BAIM Artificial Intelligence Methods
Artificial Intelligence Presentation
ANALYSIS OF VARIANCE (ONE WAY)
Tetris – Genetic Algorithm Presented by, Jeethan & Jun.
Tetris and Genetic Algorithms Math Club 5/30/2011.
On the Genetic Evolution of a Perfect Tic-Tac-Toe Strategy
Minimax and Alpha-Beta Reduction Borrows from Spring 2006 CS 440 Lecture Slides.
Efficiency concerns in Privacy Preserving methods Optimization of MASK Shipra Agrawal.
Planning under Uncertainty
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Chapter 4 DECISION SUPPORT AND ARTIFICIAL INTELLIGENCE
Multiobjective VLSI Cell Placement Using Distributed Simulated Evolution Algorithm Sadiq M. Sait, Mustafa I. Ali, Ali Zaidi.
Lecture 5: Learning models using EM
MAE 552 – Heuristic Optimization Lecture 6 February 6, 2002.
Intelligent Agents What is the basic framework we use to construct intelligent programs?
Uninformed Search Reading: Chapter 3 by today, Chapter by Wednesday, 9/12 Homework #2 will be given out on Wednesday DID YOU TURN IN YOUR SURVEY?
1 Hybrid Agent-Based Modeling: Architectures,Analyses and Applications (Stage One) Li, Hailin.
1 An Open Boundary Safety-of- Territory Solver for the Game of Go Author: Xiaozhen Niu, Martin Mueller Dept of Computing Science University of Alberta.
Genetic Programming.
Photo-realistic Rendering and Global Illumination in Computer Graphics Spring 2012 Stochastic Radiosity K. H. Ko School of Mechatronics Gwangju Institute.
MAKING COMPLEX DEClSlONS
Vilalta&Eick: Informed Search Informed Search and Exploration Search Strategies Heuristic Functions Local Search Algorithms Vilalta&Eick: Informed Search.
Slides are based on Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems.
Optimum Design of Steel Space Frames by Hybrid Teaching-Learning Based Optimization and Harmony Search Algorithms & Dr.Alper AKIN Dr. IbrahIm AYDOGDU Dear.
Transportation Problem
Gapped BLAST and PSI- BLAST: a new generation of protein database search programs By Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schäffer, Jinghui.
October 3, 2012Introduction to Artificial Intelligence Lecture 9: Two-Player Games 1 Iterative Deepening A* Algorithm A* has memory demands that increase.
Applying reinforcement learning to Tetris A reduction in state space Underling : Donald Carr Supervisor : Philip Sterne.
Comparison of Differential Evolution and Genetic Algorithm in the Design of a 2MW Permanent Magnet Wind Generator A.D.Lilla, M.A.Khan, P.Barendse Department.
Reinforcement Learning for the game of Tetris using Cross Entropy
Monte Carlo Methods Versatile methods for analyzing the behavior of some activity, plan or process that involves uncertainty.
Connect Four AI Robert Burns and Brett Crawford. Connect Four  A board with at least six rows and seven columns  Two players: one with red discs and.
1 N -Queens via Relaxation Labeling Ilana Koreh ( ) Luba Rashkovsky ( )
Applying Genetic Algorithm to the Knapsack Problem Qi Su ECE 539 Spring 2001 Course Project.
Advanced Computer Architecture and Parallel Processing Rabie A. Ramadan http:
CHECKERS: TD(Λ) LEARNING APPLIED FOR DETERMINISTIC GAME Presented By: Presented To: Amna Khan Mis Saleha Raza.
Resource Mapping and Scheduling for Heterogeneous Network Processor Systems Liang Yang, Tushar Gohad, Pavel Ghosh, Devesh Sinha, Arunabha Sen and Andrea.
FORS 8450 Advanced Forest Planning Lecture 11 Tabu Search.
Cilk Pousse James Process CS534. Overview Introduction to Pousse Searching Evaluation Function Move Ordering Conclusion.
AN INTELLIGENT AGENT is a software entity that senses its environment and then carries out some operations on behalf of a user, with a certain degree of.
Mechanical Engineering Department 1 سورة النحل (78)
Taguchi. Abstraction Optimisation of manufacturing processes is typically performed utilising mathematical process models or designed experiments. However,
Neural Network Implementation of Poker AI
Monte-Carlo method for Two-Stage SLP Lecture 5 Leonidas Sakalauskas Institute of Mathematics and Informatics Vilnius, Lithuania EURO Working Group on Continuous.
Outline Intro to Representation and Heuristic Search Machine Learning (Clustering) and My Research.
Is One Better than None? Does foresight allow an artificial intelligence to survive longer in Tetris? William Granger and Liqun Tracy Yang.
Journal Club Journal of Chemometrics May 2010 August 23, 2010.
 2008 Pearson Education, Inc. All rights reserved. 1 Arrays and Vectors.
Hub Location–Allocation in Intermodal Logistic Networks Hüseyin Utku KIYMAZ.
ARTIFICIAL INTELLIGENCE (CS 461D) Princess Nora University Faculty of Computer & Information Systems.
An Exact Algorithm for Difficult Detailed Routing Problems Kolja Sulimma Wolfgang Kunz J. W.-Goethe Universität Frankfurt.
Metaheuristics for the New Millennium Bruce L. Golden RH Smith School of Business University of Maryland by Presented at the University of Iowa, March.
R. Brafman and M. Tennenholtz Presented by Daniel Rasmussen.
Artificial Intelligence By Mr. Ejaz CIIT Sahiwal Evolutionary Computation.
Teaching Computers to Think:
Journal of Computational and Applied Mathematics Volume 253, 1 December 2013, Pages 14–25 Reporter : Zong-Dian Lee A hybrid quantum inspired harmony search.
Understanding AI of 2 Player Games. Motivation Not much experience in AI (first AI project) and no specific interests/passion that I wanted to explore.
A Presentation on Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and it’s Application By Sumanta Kundu (En.R.No.
March 1, 2016Introduction to Artificial Intelligence Lecture 11: Machine Evolution 1 Let’s look at… Machine Evolution.
 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.
Chapter 11: Artificial Intelligence
Iterative Deepening A*
Computer Science cpsc322, Lecture 14
Subject Name: Operation Research Subject Code: 10CS661 Prepared By:Mrs
CS 416 Artificial Intelligence
Kevin Mason Michael Suggs
Introduction to Artificial Intelligence Lecture 9: Two-Player Games I
Chapter 6 Network Flow Models.
Unit II Game Playing.
Presentation transcript:

Tetris Agent Optimization Using Harmony Search Algorithm Computational Intelligence Ali Ahmed Thawerani

Tetris Gameboard Tetrominos Standard Tetris game board has a dimension of 10 x 20 Game pieces fall from the top of the game board one at a time Player can manipulate falling game piece and also view the next piece that is going to fall Player positions the pieces such that there are no gaps in between lines and when the gaps are filled then lines disappear and score increases There is no win condition, the game continues till the game board disallows further entry of game pieces There are seven tetrominos or game pieces Game pieces differ significantly by the maximum number of rows they can clear simultaneously All are pieces able to clear one or two rows “J” and “L” have capability to clear three but “I” can clear four rows

Harmony Search Algorithm Meta heuristic algorithm Stochastic optimization – employ some degree of randomness in searching for a solution. Solve problems through a series of intelligent guesses Musical improvisation process – a band of musicians continuously tries to create better harmony Discover the high performance regions of solution space in reasonable amount of time

Harmony Search Algorithm Musician’s approach Musician’s approach is one or combination of three possible methods Playing the original piece Playing in a way similar to the original piece Creating a piece through random notes HS Algorithm Three parts of HS algorithm Harmony memory consideration – potential solutions consideration Pitch adjustment – exploitation Randomization – exploration

Harmony Search Algorithm

Harmony Search Algorithm Initialization: by filling it up with random solutions; each harmony is evaluated using an evaluation or objective function. Harmony improvisation: A new solution is created. The three methods are used to decide on the value that will be assigned to each decision variable in the solution. Creation of a new solution: A new solution is created either randomly with a probability of 1 - raccept by copying an existing solution in the harmony memory, with a probability equal to raccept. Adjustment: With a probability of rpa, the elements of the new harmony are then modified. Using the objective function, the new harmony is evaluated. Selection: When a terminating condition is met, the best harmony (solution) in the harmony memory is selected. Maximum number of cycles or iterations – is the basis for terminating the optimization process. Harmony memory size – refers to the number of harmonies that will be stored in the harmony memory. Number of decision variables – each harmony is composed of several decision variables. Harmony Memory Consideration rate (raccept) – determines the rate at which decision variables in the harmony are considered as elements of the new harmony that will be created. Pitch Adjustment rate (rpa ) - defines the probability for adjusting the values of decision variables copied from an existing harmony in the harmony memory by adding a certain value.

Harmony Search Algorithm

HS Based Tetris Intelligent Agent Pile Height Holes Connected Holes Removed rows Altitude difference Maximum well depth Sum of all wells Landing height Blocks Weighted blocks Row transitions Column transitions Highest hole Block above highest hole Potential rows Smoothness Eroded pieces Row holes Hole depth

Harmonetris Highest Holes –height of the topmost hole on the game board. 8 Blocks above highest hole –number of blocks on top of the highest hole. 2 Potential Rows – number of rows located above the highest hole and in use by more than 8 cells. 0

Harmonetris

Harmonetris

Harmonetris

Harmonetris Removed rows – rows cleared in last step Blocks – number of cells that has been occupied in the board Eroded pieces – number of rows cleared in last move multiplied with the number of cells of the last piece that were eliminated in the last move

Tetris Game Simulator Every generated harmony of the solution optimizer in the form of a vector of weights is fed to the Tetris Game Simulator. The simulator performs a simulation of the game using the feature functions as basis for determining the best move, and then it returns the maximum number of cleared rows, which is the objective function value of the harmony. Two main component Tetris Game Tetris Agent Tetris Game – defines the logical characteristics of the game and the rules on how it must be played Tetris Agent – plays the Tetris game until a termination condition is met and returns results Positions – 10 possibilities translations Orientation – 4 possibilities

Tetris Game Simulator

Results The goal of the experiments is to determine the efficiency of the Harmony Search algorithm as the underlying optimization algorithm for the Tetris intelligent agent and to test the ability of the intelligent agent, Harmonetris, in finding the best possible solution with respect to the spawned game pieces. Each setup was subjected to 30 runs to make sure that the results are statistically acceptable. In this experimental setup, the following parameters are defined: Memory Improvisation (Number of Cycles) = 100 Harmony Memory Size (Musical Pieces) = 5 Harmony Consideration / Acceptance Rate = 0.95 Pitch Adjustment Rate = 0.99

Results The results of our first experimental setup determined the performance of Harmony Search algorithm as the underlying optimization algorithm. After executing 120 runs in all, it has been observed that the maximum number of rows that the Tetris agent can clear is determined by

Results

Results

Results Another experiment was conducted to determine the best configuration for the feature functions, thus the terminating condition was set to “Game Over” only, with no restriction on the number of cycles and spawned pieces. After allowing the program to run for two weeks straight, the Tetris agent was able to obtain the following harmonies, xi, on its 304th cycle of harmony improvisation. Table 4 shows the results obtained by the 5 harmonies on the 304th cycle.

Conclusion The best harmony (weight configuration) found in a span of two weeks was able to clear 416,928 rows in 1,042,354 spawned pieces yielding the Spawned Pieces to Cleared Rows ratio (SP/CR ratio) of 2.50008154885256. Thus, it can be observed that as the number of cleared rows increases, the SP/CR ratio approaches the optimum value of 2.5.