Mohsen Afsharchi Multiagent Interaction. What are Multiagent Systems?

Slides:



Advertisements
Similar presentations
Concepts of Game Theory I
Advertisements

Concepts of Game Theory II. 2 The prisioners reasoning… Put yourself in the place of prisoner i (or j)… Reason as follows: –Suppose I cooperate… If j.
The Basics of Game Theory
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
SIF8072 Distributed Artificial Intelligence and Intelligent Agents
Game theory (Sections )
Multiagent Interactions
Game Theory “Доверяй, Но Проверяй” - Russian Proverb (Trust, but Verify) - Ronald Reagan Mike Shor Lecture 6.
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Non-Cooperative Game Theory To define a game, you need to know three things: –The set of players –The strategy sets of the players (i.e., the actions they.
6-1 LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to MultiAgent Systems
The basics of Game Theory Understanding strategic behaviour.
1 Game Theory. By the end of this section, you should be able to…. ► In a simultaneous game played only once, find and define:  the Nash equilibrium.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
An Introduction to... Evolutionary Game Theory
 1. Introduction to game theory and its solutions.  2. Relate Cryptography with game theory problem by introducing an example.  3. Open questions and.
Coye Cheshire & Andrew Fiore March 21, 2012 // Computer-Mediated Communication Collective Action and CMC: Game Theory Approaches and Applications.
Multi-player, non-zero-sum games
Games What is ‘Game Theory’? There are several tools and techniques used by applied modelers to generate testable hypotheses Modeling techniques widely.
Economics 202: Intermediate Microeconomic Theory 1.HW #6 on website. Due Thursday. 2.No new reading for Thursday, should be done with Ch 8, up to page.
Prisoner’s Dilemma. The scenario In the Prisoner’s Dilemma, you and Lucifer are picked up by the police and interrogated in separate cells without the.
An Introduction to Game Theory Part I: Strategic Games
Game Theory Part 5: Nash’s Theorem.
Chapter 6 © 2006 Thomson Learning/South-Western Game Theory.
5/16/20151 Game Theory Game theory was developed by John Von Neumann and Oscar Morgenstern in Economists! One of the fundamental principles of.
Chapter 11 What is Utility? A way of representing preferences Utility is not money (but it is a useful analogy) Typical relationship between utility &
GAME THEORY By Ben Cutting & Rohit Venkat. Game Theory: General Definition  Mathematical decision making tool  Used to analyze a competitive situation.
Slide 1 of 13 So... What’s Game Theory? Game theory refers to a branch of applied math that deals with the strategic interactions between various ‘agents’,
Arguments for Recovering Cooperation Conclusions that some have drawn from analysis of prisoner’s dilemma: – the game theory notion of rational action.
A Memetic Framework for Describing and Simulating Spatial Prisoner’s Dilemma with Coalition Formation Sneak Review by Udara Weerakoon.
Lecture 1 - Introduction 1.  Introduction to Game Theory  Basic Game Theory Examples  Strategic Games  More Game Theory Examples  Equilibrium  Mixed.
Review: Game theory Dominant strategy Nash equilibrium
Static Games of Complete Information: Equilibrium Concepts
APEC 8205: Applied Game Theory Fall 2007
Design of Multi-Agent Systems Teacher Bart Verheij Student assistants Albert Hankel Elske van der Vaart Web site
An introduction to game theory Today: The fundamentals of game theory, including Nash equilibrium.
QR 38, 2/22/07 Strategic form: dominant strategies I.Strategic form II.Finding Nash equilibria III.Strategic form games in IR.
On Bounded Rationality and Computational Complexity Christos Papadimitriou and Mihallis Yannakakis.
Today: Some classic games in game theory
Chapter 12 Choices Involving Strategy Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written.
A Game-Theoretic Approach to Strategic Behavior. Chapter Outline ©2015 McGraw-Hill Education. All Rights Reserved. 2 The Prisoner’s Dilemma: An Introduction.
Nash equilibrium Nash equilibrium is defined in terms of strategies, not payoffs Every player is best responding simultaneously (everyone optimizes) This.
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
THE “CLASSIC” 2 x 2 SIMULTANEOUS CHOICE GAMES Topic #4.
LECTURE 6: MULTIAGENT INTERACTIONS
Chapters 29, 30 Game Theory A good time to talk about game theory since we have actually seen some types of equilibria last time. Game theory is concerned.
Lecture 5 Introduction to Game theory. What is game theory? Game theory studies situations where players have strategic interactions; the payoff that.
Section 2 – Ec1818 Jeremy Barofsky
Mixed Strategies and Repeated Games
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Normal Form Games, Normal Form Games, Rationality and Iterated Rationality and Iterated Deletion of Dominated Strategies Deletion of Dominated Strategies.
Game theory (Sections )
1 Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. Franz J. Kurfess CPE/CSC 580: Intelligent Agents 1.
Arguments for Recovering Cooperation Conclusions that some have drawn from analysis of prisoner’s dilemma: – the game theory notion of rational action.
Advanced Subjects in GT Outline of the tutorials Static Games of Complete Information Introduction to games Normal-form (strategic-form) representation.
Game Theory By Ben Cutting & Rohit Venkat.
Game theory Chapter 28 and 29
LECTURE 6: MULTIAGENT INTERACTIONS
Chapter 15: Game Theory: The Mathematics Lesson Plan of Competition
LECTURE 6: MULTIAGENT INTERACTIONS
Introduction to Game Theory
Game theory Chapter 28 and 29
Decision Theory and Game Theory
Multiagent Systems Game Theory © Manfred Huber 2018.
LECTURE 6: MULTIAGENT INTERACTIONS
Chapter 15: Game Theory: The Mathematics Lesson Plan of Competition
Presentation transcript:

Mohsen Afsharchi Multiagent Interaction

What are Multiagent Systems?

We are…  Self-interested agents  It does not mean that:  they want to harm other agents  they only care about things that benefit them  It means that  the agent has its own description of states of the world that it likes, and that its actions are motivated by this description

Utilities and Preferences  Assume we have just two agents: Ag = {i, j}  Agents are assumed to be self-interested: they have preferences over how the environment is  Assume  {  1  2  …} is the set of “outcomes” that agents have preferences over  We capture preferences by utility functions: u i =   R u j =   R  Utility functions lead to preference orderings over outcomes:   ’ means u i (  ) > u i (  ’)

What is Utility?  Utility is not money (but it is a useful analogy)  Typical relationship between utility & money:

Multiagent Encounters  We need a model of the environment in which these agents will act…  agents simultaneously choose an action to perform, and as a result of the actions they select, an outcome in  will result  the actual outcome depends on the combination of actions  assume each agent has just two possible actions that it can perform, C (“cooperate”) and D (“defect”)  Environment behavior given by state transformer function:

Multiagent Encounters  Here is a state transformer function: (This environment is sensitive to actions of both agents.)  Here is another: (Neither agent has any influence in this environment.)  And here is another: (This environment is controlled by j.)

Rational Action  Suppose we have the case where both agents can influence the outcome, and they have utility functions as follows:  With a bit of abuse of notation:  Then agent i’s preferences are:  “C” is the rational choice for i. (Because i prefers all outcomes that arise through C over all outcomes that arise through D.)

Payoff Matrices  We can characterize the previous scenario in a payoff matrix:  Do we need to engage in strategic thinking?  Pay-off matrix in fact defines a game (technically strategic game)

Game Theory…  The study of games!  Bluffing in poker  What move to make in chess  How to play Rock-Scissors-Paper  Also  Also study of auction design, strategic deterrence, election laws, coaching decisions, routing protocols,...

Game theory…  Game theory is a formal way to analyze interactions among a group of rational agents who behave strategically.  Group: Must have more than one decision maker  Otherwise you have a decision problem, not a game  Interaction: What one agent does directly affects at least one other agent  Strategic: Agents take into account that their actions influence the game  Rational: An agent chooses its best action (maximizes its expected utility)

Game in Normal Form

Game in Normal Form…

Strategy

Strategy…

Solution Concepts

Dominant Strategies  Given any particular strategy (either C or D) of agent i, there will be a number of possible outcomes  We say s 1 dominates s 2 if every outcome possible by i playing s 1 is preferred over every outcome possible by i playing s 2  A rational agent will never play a dominated strategy  So in deciding what to do, we can delete dominated strategies  Unfortunately, there isn’t always a unique undominated strategy

Dominant Strategies…  Iterated elimination of strictly dominated actions (IESDA) is a solution technique that iteratively eliminates strictly dominated actions from all agents, until no more actions are strictly dominated.

Nash Equilibrium  In general, we will say that two strategies s 1 and s 2 are in Nash equilibrium if: 1. under the assumption that agent i plays s 1, agent j can do no better than play s 2 ; and 2. under the assumption that agent j plays s 2, agent i can do no better than play s 1.  Neither agent has any incentive to deviate from a Nash equilibrium  Unfortunately: 1. Not every interaction scenario has a Nash equilibrium 2. Some interaction scenarios have more than one Nash equilibrium

Nash Equilibrium There is no dominant strategy. Moreover there is no Nash Equilibrium. Whichever cell you choose, at least one of you would have preferred to make another choice, assuming the other player did not changed their choice.

Mixed Strategy  Introducing Randomness…  If player has k possible choices, s 1, s 2,…, s k then the mixed strategy over these choices takes the form  Play s 1 with probability p 1  Play s 2 with probability p 2  …  Play s k with probability p k  Where p 1 + p 2 +…+ p k =1  Every game in which every player has a finite set of possible strategies has a Nash equilibrium in mixed strategies

Matching Pennies game  Each of two agents chooses either Head or Tail. If the choices differ, agent 1 pays agent 2 a cent; if they are the same, agent 2 pays agent 1 a cent.  There is no (pure strategy) Nash equilibrium in this game. If we play this game, we should be “unpredictable.”  That is, we should randomize (or mix) between strategies so that we do not get exploited.

Matching Pennies game…  But not any randomness will do: Suppose Player 1 plays.75 Heads and.25 Tails (that is, Heads with 75% chance and Tails with 25% chance). Then Player 2 by choosing Tails (with 100% chance) can get an expected payoff of 0.75× ×(-1) =0.5. But that cannot happen at equilibrium since Player 1 then wants to play Tails (with 100% chance) deviating from the original mixed strategy.  Since this game is completely symmetric it is easy to see that at mixed strategy Nash equilibrium both players will choose Heads with 50% chance and Tails with 50% chance.  In this case the expected payoff to both players is 0.5× ×(-1) = 0 and neither can do better by deviating to another strategy.

Pareto Efficiency  Is not really a solution concept.  An outcome is Pareto efficient if there is no other outcome that improves one player’s utility without making somebody else worse off.  Battle of the sexes

Maximizing Social Welfare  We measure how much utility is created by an outcome in total. SW(  ) Becomes relevant if all the agents within a system have the same owner

Competitive and Zero-Sum Interactions  Where preferences of agents are diametrically opposed we have strictly competitive scenarios  Zero-sum encounters are those where utilities sum to zero: u i (  ) + u j (  ) = 0 for all    Zero sum implies strictly competitive  Zero sum encounters in real life are very rare … but people tend to act in many scenarios as if they were zero sum

The Prisoner’s Dilemma  Two men are collectively charged with a crime and held in separate cells, with no way of meeting or communicating. They are told that:  if one confesses and the other does not, the confessor will be freed, and the other will be jailed for three years  if both confess, then each will be jailed for two years  Both prisoners know that if neither confesses, then they will each be jailed for one year

The Prisoner’s Dilemma  Payoff matrix for prisoner’s dilemma:  Top left: If both defect, then both get punishment for mutual defection  Top right: If i cooperates and j defects, i gets sucker’s payoff of 1, while j gets 4  Bottom left: If j cooperates and i defects, j gets sucker’s payoff of 1, while i gets 4  Bottom right: Reward for mutual cooperation

The Prisoner’s Dilemma  The individual rational action is defect This guarantees a payoff of no worse than 2, whereas cooperating guarantees a payoff of at most 1  So defection is the best response to all possible strategies: both agents defect, and get payoff = 2  But intuition says this is not the best outcome: Surely they should both cooperate and each get payoff of 3!  We have better choice (?) and it is not rational. Why?

The Prisoner’s Dilemma  This apparent paradox is the fundamental problem of multi- agent interactions. It appears to imply that cooperation will not occur in societies of self-interested agents.  Real world examples:  nuclear arms reduction (“why don’t I keep mine... ”)  free rider systems — public transport;  The prisoner’s dilemma is ubiquitous.  Can we recover cooperation?

Arguments for Recovering Cooperation  Conclusions that some have drawn from this analysis:  the game theory notion of rational action is wrong!  somehow the dilemma is being formulated wrongly  Arguments to recover cooperation:  We are not all Machiavelli! and not all games are PD  Yes but there are always punishments !!!  The other prisoner is my twin!  Then I am damn fool to behave any other way (Because it is not a game)  The shadow of the future…

The Iterated Prisoner’s Dilemma  One answer: play the game more than once  If you know you will be meeting your opponent again, then the incentive to defect appears to evaporate  Cooperation is the rational choice in the infinititely repeated prisoner’s dilemma (Hurrah!)

Backwards Induction  But…suppose you both know that you will play the game exactly n times  On round n - 1, you have an incentive to defect, to gain that extra bit of payoff…  But this makes round n – 2 the last “real”, and so you have an incentive to defect there, too.  This is the backwards induction problem.  Playing the prisoner’s dilemma with a fixed, finite, pre- determined, commonly known number of rounds, defection is the best strategy

Axelrod’s Tournament  Suppose you play iterated prisoner’s dilemma against a range of opponents… What strategy should you choose, so as to maximize your overall payoff?  Axelrod (1984) investigated this problem, with a computer tournament for programs playing the prisoner’s dilemma

Strategies in Axelrod’s Tournament  ALLD:  “Always defect” — the hawk strategy;  TIT-FOR-TAT: 1. On round u = 0, cooperate 2. On round u > 0, do what your opponent did on round u – 1  TESTER:  On 1st round, defect. If the opponent retaliated, then play TIT- FOR-TAT. Otherwise intersperse cooperation and defection.  JOSS:  As TIT-FOR-TAT, except periodically defect

Recipes for Success in Axelrod’s Tournament  Axelrod suggests the following rules for succeeding in his tournament:  Don’t be envious: Don’t play as if it were zero sum!  Be nice: Start by cooperating, and reciprocate cooperation  Retaliate appropriately: Always punish defection immediately, but use “measured” force — don’t overdo it  Don’t hold grudges: Always reciprocate cooperation immediately

Game of Chicken  The game of chicken: (Swerving = coop, Driving straight = defect.)  Difference to prisoner’s dilemma: Mutual defection is most feared outcome. (Whereas sucker’s payoff is most feared in prisoner’s dilemma.)  Strategies (C,S) and (S,C) are in Nash equilibrium

Other Symmetric 2 x 2 Games  Given the 4 possible outcomes of (symmetric) cooperate/defect games, there are 24 possible orderings on outcomes  CC š i CD š i DC š i DD Cooperation dominates  DC š i DD š i CC š i CD Deadlock. You will always do best by defecting  DC š i CC š i DD š i CD Prisoner’s dilemma  DC š i CC š i CD š i DD Chicken  CC š i DC š i DD š i CD Stag hunt