Effect of Information on Collusion Strategies in Single winner, multi-agent games December 2, 2010 Nick Gramsky Ken Knudsen.

Slides:

Advertisements

Similar presentations

Rule 7: Penalty Enforcement. Once you have determined that a foul has been committed, you must determine which of the following situations you are in:

Advertisements

Building Agents for the Lemonade Game Using a Cognitive Hierarchy Population Model Michael Wunder Michael Kaisers Michael Littman John Yaros.

A Whist AI Jason Fong CS261A, Spring What is Whist? Old card game, driven into obscurity by Bridge Similar to other trick taking games Bridge, Spades,

Halves Practice – Quick Play In & Around The Box

Lecture 5 Memory Management Part I. Lecture Highlights  Introduction to Memory Management  What is memory management  Related Problems of Redundancy,

CASPA Comparison and Analysis of Special Pupil Attainment SGA Systems SGA Systems Limited A brief overview of CASPA's graphs and reports.

Modeling Maze Navigation Consider the case of a stationary robot and a mobile robot moving towards a goal in a maze. We can model the utility of sharing.

Evolution and Repeated Games D. Fudenberg (Harvard) E. Maskin (IAS, Princeton)

An Introduction to... Evolutionary Game Theory

© 2015 McGraw-Hill Education. All rights reserved. Chapter 15 Game Theory.

Evolving Cooperative Strategies in Multi-Agent Systems Using a Coevolutionary Algorithm Cesario C. Julaton III, Ramanathan S. Thinniyam, Una-May O’Reilly.

Table of Contents Why Play Chess? Setting Up the Board Get to Know the Pieces Check and Checkmate What the Chess Pieces Are Worth Opening Goals Endgame.

Module 3 Chess 101 Strategy Strategy refers to an overall plan to achieve a goal In every game you play your goal should be to checkmate your opponent,

Java Risk game Slide 1 The rules of RISK Simon Forey.

CIS 700 Programming & Problem Solving Fall Instruction Staff Instructor: Chris Murphy –PhD Computer Science, Columbia Univ –Seven years professional.

Intro to Game Theory Revisiting the territory we have covered.

CompSci Recursion & Minimax Playing Against the Computer Recursion & the Minimax Algorithm Key to Acing Computer Science If you understand everything,

Online Performance Auditing Using Hot Optimizations Without Getting Burned Jeremy Lau (UCSD, IBM) Matthew Arnold (IBM) Michael Hind (IBM) Brad Calder (UCSD)

A Game Of Strategy … Or Luck? Serene Li Hui Heng Xiaojun Jiang Cheewei Ng Li Xue Alison Then Team 5, MS&E220 Autumn 2008.

Lord of Fries Team: Order of Fries. Team Members Carson Lee - Documentator Daniel McCue - Coder Franchesca Chung - Tester Michael Zhu - Coder James Sheldon.

6/2/2001 Cooperative Agent Systems: Artificial Agents Play the Ultimatum Game Steven O. Kimbrough Presented at FMEC 2001, Oslo Joint work with Fang Zhong.

Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.

RULES Each player begins the game with twelve normal pieces (either white or black). The pieces are automatically set in their proper positions. The object.

Introduction to Game Theory and Behavior Networked Life CIS 112 Spring 2009 Prof. Michael Kearns.

Introduction What is this ? What is this ? This project is a part of a scientific research in machine learning, whose objective is to develop a system,

Introduction to Game Theory Yale Braunstein Spring 2007.

Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.

UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.

1 On the Agenda(s) of Research on Multi-Agent Learning by Yoav Shoham and Rob Powers and Trond Grenager Learning against opponents with bounded memory.

Game Theory Statistics 802. Lecture Agenda Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for.

GoogolHex CS4701 Final Presentation Anand Bheemarajaiah Chet Mancini Felipe Osterling.

Marcus Gallagher and Mark Ledwich School of Information Technology and Electrical Engineering University of Queensland, Australia Sumaira Saeed Evolving.

Protein Structure Alignment by Incremental Combinatorial Extension (CE) of the Optimal Path Ilya N. Shindyalov, Philip E. Bourne.

Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)

Unit 1.4 Recurrence Relations

Chapter 12 & Module E Decision Theory & Game Theory.

Analysis and Visualization Approaches to Assess UDU Capability Presented at MBSW May 2015 Jeff Hofer, Adam Rauk 1.

Agents that can play multi-player games. Recall: Single-player, fully-observable, deterministic game agents An agent that plays Peg Solitaire involves.

Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.

Key Question Do you know how to play Rock Paper Scissors? Two volunteers to demonstrate.

Learning to Play KardKuro Goals: Have Fun while Practicing Addition and Subtraction. Improve Social Learning Opportunities with Classmates. Become familiar.

Created By: Kevin Jiang, Cullen Wong, Stephen Halter.

Game Playing. Towards Intelligence? Many researchers attacked “intelligent behavior” by looking to strategy games involving deep thought. Many researchers.

Connect Four AI Robert Burns and Brett Crawford. Connect Four  A board with at least six rows and seven columns  Two players: one with red discs and.

TEACHER EFFECTIVENESS INITIATIVE VALUE-ADDED TRAINING Value-Added Research Center (VARC)

Top level learning Pass selection using TPOT-RL. DT receiver choice function DT is trained off-line in artificial situation DT used in a heuristic, hand-coded.

Neural Network Implementation of Poker AI

Coaching Pack 9 – 11 Years. What Am I Coaching Today? What Might the Players Learn or Get Better at? TechnicalPsychological example PhysicalSocial example.

Lecture 5 Introduction to Game theory. What is game theory? Game theory studies situations where players have strategic interactions; the payoff that.

Sample: Probability “Fair Game” Project (borrowed from Intel®, then adjusted)

Part 3 Linear Programming

Southern Regional Education Board HSTW MMGW The Power of the “I” Teaching and Learning to Standards: Eliminating Zeros and Getting More Students to Complete.

Offensive Strategy BASKETBALL.

CMSC 100 Multi-Agent Game Day Professor Marie desJardins Tuesday, November 20, 2012 Tue 11/20/12 1 Multi-Agent Game Day.

Structures, Strategies and Compositions Lesson 10.

Mix networks with restricted routes PET 2003 Mix Networks with Restricted Routes George Danezis University of Cambridge Computer Laboratory Privacy Enhancing.

Towards a Scalable and Robust DHT Baruch Awerbuch Johns Hopkins University Christian Scheideler Technical University of Munich.

CSC490 – Effect of Internship Experience on Technical Knowledge of Graduating CS Students By Tong Zou.

Modeling Changes in Exploitative vs. Protective Behavior Joseph Blass Motivation and Questions Humans exploit others for selfish reasons Humans also protect.

Understanding AI of 2 Player Games. Motivation Not much experience in AI (first AI project) and no specific interests/passion that I wanted to explore.

Grade Three: Fractions Unit 7 Finding Fair Shares.

Evolutionary Computing Systems Lab (ECSL), University of Nevada, Reno 1 Authors : Siming Liu, Christopher Ballinger, Sushil Louis

Four in a Line activities

Break-Away Strategy Game A turn based strategy game used to simulate Break-Away Brought to you by Team 33.

Key Rule: Each Number has only one valid combination of Prime Factors.

A Sampling of Chess and Chip Games

Checkers Move Prediction Algorithms

Presentation transcript:

Effect of Information on Collusion Strategies in Single winner, multi-agent games December 2, 2010 Nick Gramsky Ken Knudsen

Contents 1. Motivation 2. Identification of Collusion 3. Classification of Coalitions 4. Implementation 5. Results 6. Conclusions

Motivation Explicit Collusions Alliances Survival Truces Implicit Collusions Minimax against strongest player Tit-for-tat Reasons to Collude Improve position relative to other agent(s) Self-preservation / Survival

Contents 1. Motivation 2. Identification of Collusion 3. Classification of Coalitions 4. Implementation 5. Results 6. Conclusions

Identification Find course grained collusive behavior 1.Offensive-based collusion Multiple agents attacking a single agent for a fixed number of rounds In our examples, we limited this to 1 round. 2. Defensive-based collusion Multiple agents not attacking each other over a fixed number of rounds. In our examples, we limited this to 2 rounds.

Identification Offensive based coalitions

Identification Defensive based coalitions

Contents 1. Motivation 2. Identification of Collusion 3. Classification of Coalitions 4. Implementation 5. Results 6. Conclusions

1. Socially inclined behavior For some predefined time, if target satisfies the following, then we define the actions of the attacking players as being 'socially oriented‘ h(x) is a heuristic function for any adversary.  vh(x) when dealing with different layers of fog 2. Else: Some other collusive behavior Classification Offensive based behaviors

Classification Offensive based algorithm

Classification Defensive based algorithm

Classification Missed opportunities Classify a missed opportunity by finding players that: for a predefined period were not attacked above a certain percentage and… satisfy either their power heuristic or visual heuristic (below) threshold

Contents 1. Motivation 2. Identification of Collusion 3. Classification of Coalitions 4. Implementation 5. Results 6. Conclusions

Implementation Used Warfish to play games of Risk. Free website warfish.net Risk is a zero-sum game where players seek (simulated) world domination! Only one winner, the last remaining contestant. Attacks are made via dice (random number generator) Amass armies, grow in power, rule the world! Or at least the world represented on a board...

Implementation Environment Reduced resource strategies Randomized players Set card trade-in values to be constant (5) Disabled card capture on elimination Multiple map types Larger than original Risk board Reduces board specific strategies in analysis

Implementation World Map

Implementation Europe Map

Implementation Fog of War Varied amount of information available to all agents via different levels of 'fog of war'. 6 different levels of fog available in game Level 0: No fog (perfect information) Level 1: See all occupations, neighboring units only Level 2: See all occupations (no units) Level 3: Only see neighboring occupations and units Level 4: See only neighboring occupations Level 5: Complete fog (only know about self) Tested with 3 levels of fog {0,1,3}

Implementation Oracles Participants who annotated their strategies and behaviors as games were played Compared oracle annotations to game data Spot-check that analysis found collusion Though noisy, analysis and annotations were inline with game history.

Contents 1. Motivation 2. Identification of Collusion 3. Classification of Coalitions 4. Implementation 5. Results 6. Conclusions

Results Collusion vs Game length x-axis: Number of turns y-axis: Number of "interesting" windows θ h = 1.3 per 1 turn window

Results Offensive 1. Players all gang up on Yellow. 2. Validated by Oracle annotations. Game: Map: World Fog Level: 1

Results Offensive 1. Minmax against Blue 2. Confirmed by reading through the transcript. 1. Blue quickly gained power 2. Challenged remaining players to team up against him Game: Map: Europe Fog Level: 0 “Right now (Yellow) knows that if he does not get both you (Red) and (Green) on his side, this game will be won by me”

Results Offensive x-axis: Number of turns y-axis: Number of "interesting" windows θ h = 1.3 / 1 turn window Games (left) and (right)

Results Offensive & Defensive 1. Minimax against strongest player 2. Towards the end of the game, explicit truce between top 2 players Game: Map: Europe Fog Level: 0

Scatter plot of number of windows classified as defensive-oriented for all games. x-axis: number of turns y-axis: number of interesting windows θ = 0.05 *Game: Results Defensive

Results Oracle 1. Oracle self-interest annotations (Blue) Game: Map: World Fog Level: 1 x-axis: Number of turns y-axis: Number of "interesting" windows θ h = 1.3 / 1 turn window

Results Fog Level 3 1. Typical of the layer 3 games. 2. Everything breaks down. Players can’t figure out who is in the lead until it is too late. Game: Map: Europe Fog Level: 3

Results Collusion % is percentage of available windows where remaining players direct more than 75% of attacks towards target. Social % is percentage of available windows with same criteria as above BUT the target satisfies heuristic thresholds from earlier θ h = 1.3 / 1 turn window Target’s residual power 43.3% (4-player) 65% (3 player) θ h = 1.6 / 1 turn window Target’s residual power 53.3% (4-player) 80% (3 player)

Results Europe Map θ h = 1.3 θ h = 1.6

Results World Map θ h = 1.3 θ h = 1.6

Contents 1. Motivation 2. Identification of Collusion 3. Classification of Coalitions 4. Implementation 5. Results 6. Conclusions

Conclusions Presented a basic algorithm to identify and classify collusion Games with unusually large number of collusive behaviors tended to prolong games beyond the average. As fog increased (information decreased), collusive behaviors diminished. Results were consistent across maps. Level 0 data was consistent between our volunteers and the public. Analysis supported by Oracle annotations and in-game conversations.

Conclusions Visual heuristic does not hold well for fog games Based on a knowledge of territories and bonuses Limited data sets Time limitation Short time-frame for project Games averaged 20 days to complete Require more experiments with fog levels Data integrity Games had large variance in player abilities Players were involved in multiple simultaneous games May have forgotten strategy Players may have a predefined disposition towards other players (Social Value Orientation)

Conclusions Future Work Investigate possible equilibrium in collusions versus game length. Lag response for social orientation. Once the strongest player is removed from power, it can take a few rounds for the coalition to change strategies. As information decreases, agents tend to collude less. Why? fairness poor assessment of board Mix socially oriented bots with human players