Presenter: Robert Holte. 2 Helping the world understand … and make informed decisions. * * Potential beneficiaries: commercial games companies, and their.

Slides:



Advertisements
Similar presentations
METAGAMER: An Agent for Learning and Planning in General Games Barney Pell NASA Ames Research Center.
Advertisements

Adversarial Search Chapter 6 Section 1 – 4. Types of Games.
Data Mining in Computer Games By Adib Adam Hussain & Mohammed Sarfraz.
Introduction to Game Theory
Evaluation Through Conflict Martin Zinkevich Yahoo! Inc.
Artificial Intelligence in Real Time Strategy Games Dan Li.
Randomized Strategies and Temporal Difference Learning in Poker Michael Oder April 4, 2002 Advisor: Dr. David Mutchler.
Machine Learning in Computer Games Learning in Computer Games By: Marc Ponsen.
Half life 2/ Counter Strike: Source bot Charlie Cross CIS
Short introduction to game theory 1. 2  Decision Theory = Probability theory + Utility Theory (deals with chance) (deals with outcomes)  Fundamental.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
A camper awakens to the growl of a hungry bear and sees his friend putting on a pair of running shoes, “You can’t outrun a bear,” scoffs the camper. His.
INFORMS 2006, Pittsburgh, November 8, 2006 © 2006 M. A. Zinkevich, AICML 1 Games, Optimization, and Online Algorithms Martin Zinkevich University of Alberta.
This time: Outline Game playing The minimax algorithm
Presenter: Michael Bowling. 2 Helping the world understand data and make informed decisions Potential beneficiaries: Growing robotics and UVS sector,
Poker for Fun and Profit (and intellectual challenge) Robert Holte Computing Science Dept. University of Alberta.
Intelligence for Games and Puzzles1 Poker: Opponent Modelling Early AI work on poker used simplified.
AWESOME: A General Multiagent Learning Algorithm that Converges in Self- Play and Learns a Best Response Against Stationary Opponents Vincent Conitzer.
1 Game Theory Here we study a method for thinking about oligopoly situations. As we consider some terminology, we will see the simultaneous move, one shot.
What is AI  An attempt to imitate human reactions by scripting reactions to happen when a certain cause is brought about.
How computers play games with you CS161, Spring ‘03 Nathan Sturtevant.
Presenter: Dale Schuurmans. 2 Helping the world understand data and make informed decisions Core technology research: Underlies prediction and control,
Better automated abstraction techniques for imperfect information games, with application to Texas Hold’em poker * Andrew Gilpin and Tuomas Sandholm, CMU,
Monte Carlo Go Has a Way to Go Haruhiro Yoshimoto (*1) Kazuki Yoshizoe (*1) Tomoyuki Kaneko (*1) Akihiro Kishimoto (*2) Kenjiro Taura (*1) (*1)University.
PE Easter School Higher – Int2Thursday 14 th April Intermediate 1Tuesday 6 th April.
Models of Strategic Deficiency and Poker Workflow Inference: What to do with One Example and no Semantics.
Reinforcement Learning in the Presence of Hidden States Andrew Howard Andrew Arnold {ah679
The Parameterized Poker Squares EAAI NSG Challenge
Chapter 9 Games with Imperfect Information Bayesian Games.
9/14/20151 Game Theory and Game Balance CIS 487/587 Bruce R. Maxim UM-Dearborn.
1 A unified approach to comparative statics puzzles in experiments Armin Schmutzler University of Zurich, CEPR, ENCORE.
Chapter 12 Choices Involving Strategy Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written.
Game Playing.
Introduction Many decision making problems in real life
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
Fall 2006CS4455 Prototyping Maribeth Gandy Jeff Wilson
Game-playing AIs Part 1 CIS 391 Fall CSE Intro to AI 2 Games: Outline of Unit Part I (this set of slides)  Motivation  Game Trees  Evaluation.
Strategic Decisions in Noncooperative Games Introduction to Game Theory.
SARTRE: System Overview A Case-Based Agent for Two-Player Texas Hold'em Jonathan Rubin & Ian Watson University of Auckland Game AI Group
AI and Computer Games (informational session) Lecture by: Dustin Dannenhauer Professor Héctor Muñoz-Avila Computer Science and Eng.
Memory and Analogy in Game-Playing Agents Jonathan Rubin & Ian Watson University of Auckland Game AI Group
1 Adversarial Search CS 171/271 (Chapter 6) Some text and images in these slides were drawn from Russel & Norvig’s published material.
Extensive Games with Imperfect Information
Higher Grade / Intermediate 2 Skills & Techniques.
Poker as a Testbed for Machine Intelligence Research By Darse Billings, Dennis Papp, Jonathan Schaeffer, Duane Szafron Presented By:- Debraj Manna Gada.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Evolving Reactive NPCs for the Real-Time Simulation Game.
Today’s Topics Playing Deterministic (no Dice, etc) Games –Mini-max –  -  pruning –ML and games? 1997: Computer Chess Player (IBM’s Deep Blue) Beat Human.
Lecture 12. Game theory So far we discussed: roulette and blackjack Roulette: – Outcomes completely independent and random – Very little strategy (even.
Models of Competition Part III: Imperfect Competition
Pay for Performance A strategic approach to design Dermot Hand August 2012.
Intermediate Microeconomics Game Theory. So far we have only studied situations that were not “strategic”. The optimal behavior of any given individual.
ARTIFICIAL INTELLIGENCE (CS 461D) Princess Nora University Faculty of Computer & Information Systems.
Game Development Essentials An Introduction
Econ 805 Advanced Micro Theory 1 Dan Quint Fall 2009 Lecture 1 A Quick Review of Game Theory and, in particular, Bayesian Games.
Naïve Bayes Classifier April 25 th, Classification Methods (1) Manual classification Used by Yahoo!, Looksmart, about.com, ODP Very accurate when.
Oligopoly and Game Theory Topic Students should be able to: Use simple game theory to illustrate the interdependence that exists in oligopolistic.
OPPONENT EXPLOITATION Tuomas Sandholm. Traditionally two approaches to tackling games Game theory approach (abstraction+equilibrium finding) –Safe in.
Advanced Subjects in GT Outline of the tutorials Static Games of Complete Information Introduction to games Normal-form (strategic-form) representation.
Understanding AI of 2 Player Games. Motivation Not much experience in AI (first AI project) and no specific interests/passion that I wanted to explore.
Artificial Intelligence AIMA §5: Adversarial Search
Lecture 13.
Stochastic tree search and stochastic games
Joint work with Sam Ganzfried
Extensive-form games and how to solve them
Noam Brown and Tuomas Sandholm Computer Science Department
Strategies for Poker AI player
Lecture 12.
Econ 805 Advanced Micro Theory 1
Game Theory in Wireless and Communication Networks: Theory, Models, and Applications Lecture 2 Bayesian Games Zhu Han, Dusit Niyato, Walid Saad, Tamer.
Adversarial Search CS 171/271 (Chapter 6)
Presentation transcript:

Presenter: Robert Holte

2 Helping the world understand … and make informed decisions. * * Potential beneficiaries: commercial games companies, and their customers. games and the people who play them *

3 Multi-billion dollar industry, with considerable Canadian activity U. of A. has one of the best AI & Games research groups in the world Games are good testbeds for A.I. research Machine learning has a key role to play: Opponent/user modelling Massive datasets (e.g. play logs) Challenging problems for machine learning Opponent modelling: very short time frame, weak data Massive datasets: large number of low-level features Active learning opportunities Human element in the overall system

4 1. Gameplay Analysis (ongoing) 2. Poker (ongoing, poster) 3. Counter-strike Log Analysis (new, poster) 4. Go (ongoing, poster) 5. General Game Playing (new) 6. Threat Modelling (complete, poster)

5 AICML PI ’ s: M. Bowling, R. Holte, J. Schaeffer 8 Software developers 3 Postdoctoral Fellows 14 Grad students

6 Electronic Arts BioWare BioTools 3 UofA CS profs

7 Grants $490K over 3 years, NSERC strategic grant $10k/year BioWare gift Portion of Jonathan Schaeffer ’ s iCORE chair In-kind Neverwinter Nights source code (BioWare) FIFA ’ 2004 source code (EA) with our gameplay analysis hooks installed at their expense BioTools support of competitions we organize

8 IJCAI ’ 03 best paper award Winner of AAAI ’ 06 poker-bot competitions, competitive with top human players World ’ s first man-versus-machine poker match Currently world ’ s best 9x9 Go program, competitive with very good humans (Scientific American article) Electronic Arts interest in gameplay analysis GDC paper HQP to EA, BioWare, BioTools, Invidi, Google, Yahoo!

Technical Details

10 Large game tree (10 18 ) Stochastic element Variable number of players (2 – 10) Imperfect information (during play, and after) Aim is to maximize winnings not just win The last two make it essential to discover and exploit the opponent ’ s weaknesses

11 Rule-based ( “ expert system ” ) – Loki Search-based – Poki Game-theoretic – PsOpti and others Opponent modelling Vexbot PDF cutting Parameter Estimation (Bayesian) Strategy Value estimation ( “ experts ” )

12 Nash Equilibrium of an abstract poker game Bluffing, slow play, etc. fall out from the mathematics. Best paper award at IJCAI ’ 03 Won the AAAI ’ 06 poker-bot competitions Has held its own against 2 world-class humans

13 DIVAT: an unbiased, low variance estimator of winnings

14 The equilibrium strategy for the highly abstract game is far from perfect. No opponent modelling. Nash equilibrium not the best strategy: Non-adaptive Defensive Even the best humans have weaknesses that should be exploited

15 Short time to learn and exploit model (< 200 hands). Want to simultaneously: Collect information about the opponent Use the information to get higher payoff Not “ pay ” too much for the information Not be exploitable ourselves Imperfect information, even after hand finishes High variance chance in the game (the shuffled deck) stochastic opponent strategies Properties of the opponent … (next slide)

16 We assume a “ smart ” opponent – it has exploitable weaknesses but does not make outright errors plays a non-equilibrium strategy does not play a dominated strategy Opponent ’ s strategy is non-stationary changes during the game may be modelling me to exploit my weaknesses

17 In Kuhn poker against exploitable, stationary opponents … Convergence to best-response is slow. Opponent modelling is superior to a static Nash equilibrium strategy. often produces positive expected value robust to game length (50-400) and opponent type Bad initial estimates of P2 ’ s parameters overcome in hands. “ Aggressive ” exploration strategies slightly superior to “ safe ” exploration strategies.

18 Improved Algorithms for Information- Gathering and Modelling Scaling up Non-stationary Opponents Other poker variants: no-limit, multi- player

Introduction

20 How to test if game software behaves as intended by the designer ?

21

22 Corner kicks to the coloured areas score. This was discovered by our SAGA-ML system.

23 Machine Learning rules behaviour control Sampling