Evolving Multi-modal Behavior in NPCs Jacob Schrum – Risto Miikkulainen –

Slides:

Advertisements

Similar presentations

Yuri R. Tsoy, Vladimir G. Spitsyn, Department of Computer Engineering

Advertisements

Using Parallel Genetic Algorithm in a Predictive Job Scheduling

Tetris – Genetic Algorithm Presented by, Jeethan & Jun.

Constructing Complex NPC Behavior via Multi- Objective Neuroevolution Jacob Schrum – Risto Miikkulainen –

Multiobjective Optimization Chapter 7 Luke, Essentials of Metaheuristics, 2011 Byung-Hyun Ha R1.

Generating Diverse Opponents with Multi-Objective Evolution Generating Diverse Opponents with Multi-Objective Evolution Alexandros Agapitos, Julian Togelius,

Estimation of Distribution Algorithms Ata Kaban School of Computer Science The University of Birmingham.

1 Autonomous Controller Design for Unmanned Aerial Vehicles using Multi-objective Genetic Programming Choong K. Oh and Gregory J. Barlow U.S. Naval Research.

A New Evolutionary Algorithm for Multi-objective Optimization Problems Multi-objective Optimization Problems (MOP) –Definition –NP hard By Zhi Wei.

Multi-Objective Evolutionary Algorithms Matt D. Johnson April 19, 2007.

Evolving Neural Network Agents in the NERO Video Game Author ： Kenneth O. Stanley, Bobby D. Bryant, and Risto Miikkulainen Presented by Yi Cheng Lin.

Using a GA to Create Prey Tactics Presented by Tony Morelli on 11/29/04.

Genetic Algorithms in Materials Processing N. Chakraborti Department of Metallurgical & Materials Engineering Indian Institute of Technology Kharagpur.

Design of Autonomous Navigation Controllers for Unmanned Aerial Vehicles using Multi-objective Genetic Programming Gregory J. Barlow March 19, 2004.

1 Autonomous Controller Design for Unmanned Aerial Vehicles using Multi-objective Genetic Programming Gregory J. Barlow North Carolina State University.

The Pareto fitness genetic algorithm: Test function study Wei-Ming Chen

A New Algorithm for Solving Many-objective Optimization Problem Md. Shihabul Islam ( ) and Bashiul Alam Sabab ( ) Department of Computer Science.

Genetic Algorithms Overview Genetic Algorithms: a gentle introduction –What are GAs –How do they work/ Why? –Critical issues Use in Data Mining –GAs.

Marcus Gallagher and Mark Ledwich School of Information Technology and Electrical Engineering University of Queensland, Australia Sumaira Saeed Evolving.

By Jacob Schrum and Risto Miikkulainen

1 Reasons for parallelization Can we make GA faster? One of the most promising choices is to use parallel implementations. The reasons for parallelization.

Evolutionary Multi-objective Optimization – A Big Picture Karthik Sindhya, PhD Postdoctoral Researcher Industrial Optimization Group Department of Mathematical.

A Genetic Algorithms Approach to Feature Subset Selection Problem by Hasan Doğu TAŞKIRAN CS 550 – Machine Learning Workshop Department of Computer Engineering.

Evolution, Brains and Multiple Objectives

Slides are based on Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems.

Cristian Urs and Ben Riveira. Introduction The article we chose focuses on improving the performance of Genetic Algorithms by: Use of predictive models.

Soft Computing Lecture 18 Foundations of genetic algorithms (GA). Using of GA.

Constructing Intelligent Agents via Neuroevolution By Jacob Schrum

Study on Genetic Network Programming (GNP) with Learning and Evolution Hirasawa laboratory, Artificial Intelligence section Information architecture field.

More on coevolution and learning Jing Xiao April, 2008.

Design of a real time strategy game with a genetic AI By Bharat Ponnaluri.

StarCraft Learning Algorithms By Logan Yarnell, Steven Raines, and Dean Antel.

Pareto Coevolution Presented by KC Tsui Based on [1]

Fuzzy Genetic Algorithm

Evolving Virtual Creatures & Evolving 3D Morphology and Behavior by Competition Papers by Karl Sims Presented by Sarah Waziruddin.

CAP6938 Neuroevolution and Developmental Encoding Real-time NEAT Dr. Kenneth Stanley October 18, 2006.

2005MEE Software Engineering Lecture 11 – Optimisation Techniques.

Learning by Simulating Evolution Artificial Intelligence CSMC February 21, 2002.

Mike Taks Bram van de Klundert. About Published 2005 Cited 286 times Kenneth O. Stanley Associate Professor at University of Central Florida Risto Miikkulainen.

A Parallel Genetic Algorithm with Distributed Environment Scheme

ELeaRNT: Evolutionary Learning of Rich Neural Network Topologies Authors: Slobodan Miletic 3078/2010 Nikola Jovanovic 3077/2010

Kanpur Genetic Algorithms Laboratory IIT Kanpur 25, July 2006 (11:00 AM) Multi-Objective Dynamic Optimization using Evolutionary Algorithms by Udaya Bhaskara.

Evolutionary Programming

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Evolving Reactive NPCs for the Real-Time Simulation Game.

Pac-Man AI using GA. Why Machine Learning in Video Games? Better player experience Agents can adapt to player Increased variety of agent behaviors Ever-changing.

2/29/20121 Optimizing LCLS2 taper profile with genetic algorithms: preliminary results X. Huang, J. Wu, T. Raubenhaimer, Y. Jiao, S. Spampinati, A. Mandlekar,

Artificial Intelligence Research in Video Games By Jacob Schrum

CAP6938 Neuroevolution and Artificial Embryogeny Competitive Coevolution Dr. Kenneth Stanley February 20, 2006.

Evolving Multimodal Networks for Multitask Games

Evolving Agent Behavior in Multiobjective Domains Using Fitness-Based Shaping Jacob Schrum and Risto Miikkulainen University of Texas at Austin Department.

Riza Erdem Jappie Klooster Dirk Meulenbelt EVOLVING MULTI-MODAL BEHAVIOR IN NPC S.

Neural and Evolutionary Computing - Lecture 9 1 Evolutionary Multiobjective Optimization  Particularities of multiobjective optimization  Multiobjective.

Organic Evolution and Problem Solving Je-Gun Joung.

Zhengli Huang and Wenliang (Kevin) Du

Artificial Intelligence By Mr. Ejaz CIIT Sahiwal Evolutionary Computation.

Robot Intelligence Technology Lab. 10. Complex Hardware Morphologies: Walking Machines Presented by In-Won Park

Botprize 2010 Jacob Schrum, Igor Karpov, and Risto Miikkulainen

CAP6938 Neuroevolution and Artificial Embryogeny Real-time NEAT Dr. Kenneth Stanley February 22, 2006.

An Evolutionary Algorithm for Neural Network Learning using Direct Encoding Paul Batchis Department of Computer Science Rutgers University.

March 1, 2016Introduction to Artificial Intelligence Lecture 11: Machine Evolution 1 Let’s look at… Machine Evolution.

 Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems n Introduction.

UT^2: Human-like Behavior via Neuroevolution of Combat Behavior and Replay of Human Traces Jacob Schrum Igor V. Karpov

Solving Interleaved and Blended Sequential Decision-Making Problems through Modular Neuroevolution Jacob Schrum Risto Miikkulainen.

Evolving the goal priorities of autonomous agents

Dr. Kenneth Stanley September 25, 2006

Evolving Multimodal Networks for Multitask Games

UT^2: Human-like Behavior via Neuroevolution of Combat Behavior and Replay of Human Traces Jacob Schrum, Igor Karpov, and Risto Miikkulainen

Introduction to Artificial Intelligence Lecture 11: Machine Evolution

Dr. Kenneth Stanley February 6, 2006

CHAPTER I. of EVOLUTIONARY ROBOTICS Stefano Nolfi and Dario Floreano

Presentation transcript:

Evolving Multi-modal Behavior in NPCs Jacob Schrum – Risto Miikkulainen – University of Texas at Austin Department of Computer Sciences

Introduction Goal: discover NPC behavior automatically Goal: discover NPC behavior automatically Benefits Benefits Save production time/effort Save production time/effort Learn counterintuitive behaviors Learn counterintuitive behaviors Find weaknesses in static scripts Find weaknesses in static scripts Tailor behavior to human players Tailor behavior to human players

Introduction Challenges Challenges Games are complex Games are complex Multiple objectives Multiple objectives Multi-modal behavior required Multi-modal behavior required RL & Evolution popular approaches RL & Evolution popular approaches How to encourage multi-modal behavior? How to encourage multi-modal behavior?

Typical Agent Architecture One policy One policy Why not several policies? Why not several policies? Agent Environment policy Sensor input Actions

Agent With Multiple Policies Agent Environment policy 2 Sensor input Actions policy 1 policy n … arbitrate Policy for each mode Policy for each mode Individual policies simpler than monolithic policy Individual policies simpler than monolithic policy Must choose which policy to use Must choose which policy to use

Multi-modal Game Game to test multi-modal architecture Game to test multi-modal architecture Make task delineation clear Make task delineation clear Same NPCs perform two distinct tasks Same NPCs perform two distinct tasks Must determine their task from sensors Must determine their task from sensors New Game: “Fight or Flight” New Game: “Fight or Flight”

Fight or Flight Fight Task Player fights with bat Player fights with bat NPCs avoid bat NPCs avoid bat NPCs fight back NPCs fight back Flight Task Player has no weapon Player runs away NPCs confine/attack

NPC Objectives Fight Task Deal damage Deal damage Avoid damage Avoid damage Stay alive Stay alive Flight Task Deal damage Not the same objective as in the Fight task! How do we deal with multiple, competing objectives?

Multi-Objective Optimization Imagine game with two objectives: Imagine game with two objectives: Damage Dealt Damage Dealt Health Remaining Health Remaining A dominates B iff A is strictly better in one objective and at least as good in others A dominates B iff A is strictly better in one objective and at least as good in others Population of points not dominated are best: Pareto Front Population of points not dominated are best: Pareto Front High health but did not deal much damage Dealt lot of damage, but lost lots of health Tradeoff between objectives

NSGA-II Evolution: natural approach for finding optimal population Evolution: natural approach for finding optimal population Non-Dominated Sorting Genetic Algorithm II* Non-Dominated Sorting Genetic Algorithm II* Population P with size N; Evaluate P Population P with size N; Evaluate P Use mutation to get P´ size N; Evaluate P´ Use mutation to get P´ size N; Evaluate P´ Calculate non-dominated fronts of {P  P´} size 2N Calculate non-dominated fronts of {P  P´} size 2N New population size N from highest fronts of {P  P´} New population size N from highest fronts of {P  P´} *K. Deb et al. 2000

Neuroevolution Genetic Algorithms + Artificial Neural Networks Genetic Algorithms + Artificial Neural Networks NNs good at generating behavior NNs good at generating behavior GA creates new nets, evaluates them GA creates new nets, evaluates them Four basic mutations (no crossover used) Four basic mutations (no crossover used) Perturb WeightAdd ConnectionAdd Neuron Merge Neurons

New Mode Mutation New mode with inputs from preexisting mode New mode with inputs from preexisting mode Maximum preference neuron determines mode Maximum preference neuron determines mode

Experiment Compare 1Mode vs. ModeMutation Compare 1Mode vs. ModeMutation 10 trials each 10 trials each What to evolve against? What to evolve against? Bot with static policy (instead of player) Bot with static policy (instead of player) Bot has a first person perspective Bot has a first person perspective Fight Task Swing bat constantly Swing bat constantly Approach nearest bot in front Approach nearest bot in front Flight Task Back away from nearest bot in front Back away from nearest bot in front

Incremental Evolution Hard to evolve against proposed bot strategies Hard to evolve against proposed bot strategies Could easily fail to evolve interesting behavior Could easily fail to evolve interesting behavior Incremental evolution against increasing speeds Incremental evolution against increasing speeds 0%, 40%, 80%, 100% 0%, 40%, 80%, 100% Increase speed when all goals are met Increase speed when all goals are met End when goals met at 100% End when goals met at 100%

Goals Average population performance high enough? Average population performance high enough?  Then increase speed Each objective has a goal: Each objective has a goal: Fight Fight At least 50 damage to bot (1 kill) At least 50 damage to bot (1 kill) Less than 20 damage per NPC on average (2 hits) Less than 20 damage per NPC on average (2 hits) Survive at least 800 time steps (80% of trial) Survive at least 800 time steps (80% of trial) Flight Flight At least 100 damage to bot (2 kills) At least 100 damage to bot (2 kills) Average population objective score met goal value? Average population objective score met goal value?  Goal met

Mode Mutation Results Performs well in both tasks Performs well in both tasks Fight Task Fight Task Baiting behavior Baiting behavior One NPC takes damage so others can sneak up behind One NPC takes damage so others can sneak up behind Bot knocked back and forth Bot knocked back and forth Flight Task Flight Task Corralling behavior Corralling behavior Keep bot confined in ring of NPCs Keep bot confined in ring of NPCs Move to scare the bot into enclosure Move to scare the bot into enclosure

Use of Multiple Modes Different modes for baiting and attacking Different modes for baiting and attacking Similar elements of modes co-opted for different tasks Similar elements of modes co-opted for different tasks Many unselected modes Many unselected modes As many as 7 unused modes As many as 7 unused modes Still have outward connections Still have outward connections Are they vestigial? Are they vestigial?

1 Mode Results Only performs well in one task Only performs well in one task Example 1 Example 1 Runs away in Fight task Runs away in Fight task Corralling behavior in Flight task Corralling behavior in Flight task Example 2 Example 2 Overly aggressive in Fight task Overly aggressive in Fight task Lets bot escape in Flight task Lets bot escape in Flight task Population averages of individual objectives are high enough, but few individuals do well in all objectives Population averages of individual objectives are high enough, but few individuals do well in all objectives

Why Different Behaviors? Progression method Progression method Numerically similar performance Numerically similar performance Drastically different distribution of behaviors Drastically different distribution of behaviors 1Mode evolves groups for subsets of objectives 1Mode evolves groups for subsets of objectives ModeMutation biases towards solving all objectives ModeMutation biases towards solving all objectives Changes shape of fitness landscape Changes shape of fitness landscape

Future Work Improve progression Improve progression More granularity in tougher end of task sequence More granularity in tougher end of task sequence Can incremental evolution be avoided? Can incremental evolution be avoided? Improve multiobjective selection Improve multiobjective selection Bias towards middle of trade-off surface Bias towards middle of trade-off surface Other algorithms: Other algorithms: SPEA2 SPEA2 PESA-II PESA-II

Future Work Improve ModeMutation Improve ModeMutation Should new modes be strongly differentiated? Should new modes be strongly differentiated? Different arbitration mechanism? Different arbitration mechanism? Better option than randomly applying mutation? Better option than randomly applying mutation? Different initial connectivity? Different initial connectivity? P(y)P(x)

Conclusion ModeMutation encourages multi-modal behavior ModeMutation encourages multi-modal behavior Biases search toward multi-modal solutions Biases search toward multi-modal solutions ModeMutation better than 1Mode ModeMutation better than 1Mode More successes in shorter amount of time More successes in shorter amount of time Lead to multi-modal behavior in future games Lead to multi-modal behavior in future games

Questions? Movies: Movies:

Auxiliary Slides

Ignore Achieved Goals for Objectives Goal is met → Drop objective Goal is met → Drop objective Focus selection on most difficult objectives Focus selection on most difficult objectives Prevents stagnation Prevents stagnation Reshaping fitness landscape helps escape peaks Reshaping fitness landscape helps escape peaks Project scores into lower dimension Project scores into lower dimension