EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium

Slides:



Advertisements
Similar presentations
Mixed Strategies.
Advertisements

Game Theory Lecture 4 Game Theory Lecture 4.
Some Problems from Chapt 13
Mixed Strategies CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte.
Introduction to Game theory Presented by: George Fortetsanakis.
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Congestion Games with Player- Specific Payoff Functions Igal Milchtaich, Department of Mathematics, The Hebrew University of Jerusalem, 1993 Presentation.
Non-Cooperative Game Theory To define a game, you need to know three things: –The set of players –The strategy sets of the players (i.e., the actions they.
ECO290E: Game Theory Lecture 5 Mixed Strategy Equilibrium.
1 Chapter 14 – Game Theory 14.1 Nash Equilibrium 14.2 Repeated Prisoners’ Dilemma 14.3 Sequential-Move Games and Strategic Moves.
The basics of Game Theory Understanding strategic behaviour.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.5.Repeated Games Lecture
For any player i, a strategy weakly dominates another strategy if (With at least one S -i that gives a strict inequality) strictly dominates if where.
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Non-cooperative Games Elon Kohlberg February 2, 2015.
Learning in games Vincent Conitzer
What is a game?. Game: a contest between players with rules to determine a winner. Strategy: a long term plan of action designed to achieve a particular.
Network Theory and Dynamic Systems Game Theory: Mixed Strategies
Part 3: The Minimax Theorem
Working Some Problems. Telephone Game How about xexed strategies? Let Winnie call with probability p and wait with probability 1-p. For what values of.
An Introduction to Game Theory Part I: Strategic Games
Chapter 6 © 2006 Thomson Learning/South-Western Game Theory.
Nash Equilibrium: Theory. Strategic or Simultaneous-move Games Definition: A simultaneous-move game consists of: A set of players For each player, a set.
EC941 - Game Theory Prof. Francesco Squintani Lecture 8 1.
Eponine Lupo.  Game Theory is a mathematical theory that deals with models of conflict and cooperation.  It is a precise and logical description of.
EC102: Class 9 Christina Ammon.
A camper awakens to the growl of a hungry bear and sees his friend putting on a pair of running shoes, “You can’t outrun a bear,” scoffs the camper. His.
Intro to Game Theory Revisiting the territory we have covered.
Game Theory Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents. Been used to study –Company.
An Introduction to Game Theory Part IV: Games with Imperfect Information Bernhard Nebel.
An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.
Social Networks 101 P ROF. J ASON H ARTLINE AND P ROF. N ICOLE I MMORLICA.
Game Theory Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents. Been used to study –Market.
Nash Equilibrium - definition A mixed-strategy profile σ * is a Nash equilibrium (NE) if for every player i we have u i (σ * i, σ * -i ) ≥ u i (s i, σ.
Game Applications Chapter 29. Nash Equilibrium In any Nash equilibrium (NE) each player chooses a “best” response to the choices made by all of the other.
DANSS Colloquium By Prof. Danny Dolev Presented by Rica Gonen
EC941 - Game Theory Francesco Squintani Lecture 3 1.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman.
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
EC941 - Game Theory Prof. Francesco Squintani Lecture 2 1.
CPS 170: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.
Reading Osborne, Chapters 5, 6, 7.1., 7.2, 7.7 Learning outcomes
Social Choice Session 7 Carmen Pasca and John Hey.
© 2009 Institute of Information Management National Chiao Tung University Lecture Note II-3 Static Games of Incomplete Information Static Bayesian Game.
ECO290E: Game Theory Lecture 12 Static Games of Incomplete Information.
Learning in Multiagent systems
Nash equilibrium Nash equilibrium is defined in terms of strategies, not payoffs Every player is best responding simultaneously (everyone optimizes) This.
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
Games with Imperfect Information Bayesian Games. Complete versus Incomplete Information So far we have assumed that players hold the correct belief about.
Introduction to Game Theory application to networks Joy Ghosh CSE 716, 25 th April, 2003.
THE “CLASSIC” 2 x 2 SIMULTANEOUS CHOICE GAMES Topic #4.
Chapters 29, 30 Game Theory A good time to talk about game theory since we have actually seen some types of equilibria last time. Game theory is concerned.
QR 38, 3/1/07 and 3/8/07 Mixed strategies I.Keeping the opponent indifferent II.Finding a mixed strategy equilibrium III.Odds ratios IV.Mixing in practice.
Topic 3 Games in Extensive Form 1. A. Perfect Information Games in Extensive Form. 1 RaiseFold Raise (0,0) (-1,1) Raise (1,-1) (-1,1)(2,-2) 2.
Lecture 5 Introduction to Game theory. What is game theory? Game theory studies situations where players have strategic interactions; the payoff that.
Mixed Strategies and Repeated Games
Strategic Behavior in Business and Econ Static Games of complete information: Dominant Strategies and Nash Equilibrium in pure and mixed strategies.
Vincent Conitzer CPS Learning in games Vincent Conitzer
5.1.Static Games of Incomplete Information
1 The Volunteer’s Dilemma (Mixed Strategies). 2 The Volunteer Dilemma Game Simultaneously and independently, players have to decide if they wish to volunteer.
Topics to be Discussed Gaming and Strategic Decisions
Games of pure conflict two-person constant sum games.
Lec 23 Chapter 28 Game Theory.
By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.
Working Some Problems.
Games of pure conflict two person constant sum
LECTURE 2 MIXED STRATEGY GAME
Presentation transcript:

EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium Reading Osborne Chapter 4.1 to 4.10 By the end of this week you should be able to: find a mixed strategy Nash Equilibrium of a game explain why mixed strategies can be important in applications

Example: Matching Pennies Player 2 Head Tail Player 1 1,-1 -1,1 Matching pennies does not have a Nash equilibrium (in the game with ordinal preferences), i.e. there is no steady state where all players 1 choose the same action and all players 2 choose the same action and nobody wants to change

Example: Matching Pennies Player 2 Head Tail Player 1 1,-1 -1,1 Matching pennies does not have a Nash equilibrium (in the game with ordinal preferences), i.e. there is no steady state where all players 1 choose the same action and all players 2 choose the same action and nobody wants to change But: we can assume that if a player faces getting 1 with probability p and -1 with probability 1-p, then the higher p the better

Example: Matching Pennies Player 2 Head Tail Player 1 1,-1 -1,1 Matching pennies does not have a Nash equilibrium (in the game with ordinal preferences), i.e. there is no steady state where all players 1 choose the same action and all players 2 choose the same action and nobody wants to change But: we can assume that if a player faces getting 1 with probability p and -1 with probability 1-p, then the higher p the better Now if player 2 chooses H with probability ½, then both H and T yield the same lottery to player 1 (namely getting 1 with probability ½ and -1 with ½)

Example: Matching Pennies Thus no matter which strategy player 1 chooses, she has no incentive to change it The same holds for player 2: if player 1 chooses H with probability ½ and T with probability ½, both H and T yield the same lottery for him Thus player 2 has no incentive to change his strategy Therefore, if both players choose H with probability ½ and T with probability ½, no one has an incentive to change Thus we have a steady state This can be interpreted either as a state where each player randomizes with equal probability between H and T or as game between populations where half of the players choose H and the other half choose T

Generalization: Preliminaries Ordinal payoff functions are generally not sufficient in order to express preferences over lotteries For example in this case we need to know how much worse is -2 than -1 compared to the difference of 1 and -1 Player 2 Head Tail Player 1 1,-1 -1,1 -2,2 Note: In matching pennies ordinal preferences are enough (together with the simple assumption that getting 1 with a higher probability is better) because for both actions the possible outcomes are either 1 or -1, so that we can rank lotteries by whichever gives the higher probability to 1

Generalization: Preliminaries Ordinal payoff functions are generally not sufficient in order to express preferences over lotteries For example in this case we need to know how much worse is -2 than -1 compared to the difference of 1 and -1 Player 2 Head Tail Player 1 1,-1 -1,1 -2,2 Thus we need a cardinal payoff function, where differences have meaning von Neumann – Morgenstern (vNM) preferences: Preferences over lotteries represented by the expected value of a payoff function over the deterministic outcomes (“Bernoulli payoff function”) Note: In matching pennies ordinal preferences are enough (together with the simple assumption that getting 1 with a higher probability is better) because for both actions the possible outcomes are either 1 or -1, so that we can rank lotteries by whichever gives the higher probability to 1

Definitions A strategic game (with vNM preferences) consists of a set of players for each player, a set of actions for each player, preferences regarding lotteries over action profiles that may be represented by the expected value of a (“Bernoulli”) payoff function over action profiles Definition: A mixed strategy of a player is a probability distribution over the player’s actions can denote this by vector of probabilities (p1,…, pn) if Ai is a set of n actions {a1,…, an} Let Δ(Ai) be the set of mixed strategies over Ai Pure strategy: mixed strategy that puts probability 1 on a single action, i.e. a deterministic action

Mixed-strategy Nash Equilibrium Let α* be a mixed strategy profile in a strategic game with vNM preferences Then α* is a (mixed-strategy) Nash equilibrium if for every player i and every mixed strategy αiΔ(Ai): Ui(α*)  Ui (αi, α-i*) where Ui(α) is i’s expected payoff given α Proposition: every game with vNM preferences with each player having finitely many available actions has a (mixed-strategy) equilibrium Note: this does not mean that there is no equilibrium if players have infinitely many actions. There just might be no equilibrium

Best Response Functions Best response functions are defined as for deterministic actions Best response function of i: Bi(α-i) = {αi: Ui(αi,α-i ) ≥ Ui(αi',α-i ) for all mixed strategies αi'Δ(Ai)} Set-valued, each member of Bi(α-i) is a best response to α-i Again, we get: Proposition: α* is a Nash equilibrium if and only if αi*  Bi(α-i*) for every i

Best Response Functions for Matching Pennies Player 2 Head Tail Player 1 1,-1 -1,1 p: Probability that 1 chooses Head q: Probability that 2 chooses Head Note: if a player has only two available actions, then one probability alone completely characterizes his mixed strategy, so we can write the equilibrium just as a profile of probabilities, one per player

Best Response Functions for Matching Pennies Player 2 Head Tail Player 1 1,-1 -1,1 p: Probability that 1 chooses Head q: Probability that 2 chooses Head B1(q) = 1 if q >1/2; = 0 if q <1/2; =[0,1] if q =1/2 B2(p) = 0 if p >1/2; = 1 if p <1/2; =[0,1] if p =1/2 Note: if a player has only two available actions, then one probability alone completely characterizes his mixed strategy, so we can write the equilibrium just as a profile of probabilities, one per player

Best Response Functions for Matching Pennies Player 2 Head Tail Player 1 1,-1 -1,1 p: Probability that 1 chooses Head q: Probability that 2 chooses Head B1(q) = 1 if q >1/2; = 0 if q <1/2; =[0,1] if q =1/2 B2(p) = 0 if p >1/2; = 1 if p <1/2; =[0,1] if p =1/2 Equilibrium: (p, q) = (1/2,1/2) Note: if a player has only two available actions, then one probability alone completely characterizes his mixed strategy, so we can write the equilibrium just as a profile of probabilities, one per player

Best response function for Battle of the Sexes Player 2 Ball Theatre Player 1 2,1 0,0 1,2 p: Probability that 1 chooses Ball q: Probability that 2 chooses Ball

Best response function for Battle of the Sexes Player 2 Ball Theatre Player 1 2,1 0,0 1,2 p: Probability that 1 chooses Ball q: Probability that 2 chooses Ball B1(q) = 1 if q >1/3; = 0 if q <1/3; =[0,1] if q =1/3 B2(p) = 1 if p >2/3; = 0 if p <2/3; =[0,1] if p =2/3

Best response function for Battle of the Sexes Player 2 Ball Theatre Player 1 2,1 0,0 1,2 p: Probability that 1 chooses Ball q: Probability that 2 chooses Ball B1(q) = 1 if q >1/3; = 0 if q <1/3; =[0,1] if q =1/3 B2(p) = 1 if p >2/3; = 0 if p <2/3; =[0,1] if p =2/3 Equilibria: (p, q) = (1,1) = (Ball, Ball), (p, q) = (0,0) = (Theatre, Theatre) (p, q) = (2/3,1/3)

Pure-Strategy Equilibria survive allowing for mixing In battle of the sexes, the 2 equilibria identified in the game with ordinal preferences survive if we interpret the game as having vNM preferences and allow for mixing This is always true and vice versa for pure-strategy equilibria

Pure-Strategy Equilibria survive allowing for mixing Proposition: any equilibrium of a game with ordinal preference is a pure-strategy equilibrium of the corresponding game with vNM preferences and any pure-strategy equilibrium of a game with vNM preferences is an equilibrium of the corresponding game with ordinal preferences So all equilibria of the games we encountered so far stay equilibria if we allow for mixing and interpret the payoff functions as Bernoulli payoff functions

Characterization of mixed-strategy equilibria Proposition: let α* be a mixed-strategy equilibrium. Then each action ai that is played by i with positive probability according to αi* yields the same expected payoff to i as strategy αi* every action ai' that is played by i with probability 0 according to αi* yields at most the same expected payoff to i as strategy αi* This is useful for finding mixed-strategy equilibria: each player has to be indifferent between all the strategies she plays so for a given game and each player i, we can consider subsets of strategies and see whether we find solutions for the other players’ mixed strategies, such that i is indifferent among all actions in the subset

Player 2 L R T 2,3 5,0 Player 1 M 3,2 1,4 B 1,5 4,1 No equilibrium where either player plays a pure strategy (easy)

Player 2 L R T 2,3 5,0 Player 1 M 3,2 1,4 B 1,5 4,1 No equilibrium where either player plays a pure strategy (easy) Is there one where player 1 chooses T and B, M and B or all three actions with positive probability? No: T strictly dominates B, so whatever player 2 does, 1 can increase expected payoff by playing T instead of B (so b = Pr(B) = 0)

Player 2 L R T 2,3 5,0 Player 1 M 3,2 1,4 B 1,5 4,1 No equilibrium where either player plays a pure strategy (easy) Is there one where player 1 chooses T and B, M and B or all three actions with positive probability? No: T strictly dominates B, so whatever player 2 does, 1 can increase expected payoff by playing T instead of B (so b = Pr(B) = 0) That leaves 1 choosing T and M. This requires for l=Pr(L) 2l + 5(1 – l) = 3l + 1(1 – l), or 5 – 3l = 1 + 2l or l = 4/5 mixing of 2 between L and R requires for t=Pr(T) 3t + 2(1 – t) = 4(1 – t), or 2 + t = 4 – 4t or t = 2/5

Player 2 L R T 2,3 5,0 Player 1 M 3,2 1,4 B 1,5 4,1 No equilibrium where either player plays a pure strategy (easy) Is there one where player 1 chooses T and B, M and B or all three actions with positive probability? No: T strictly dominates B, so whatever player 2 does, 1 can increase expected payoff by playing T instead of B (so b = Pr(B) = 0) That leaves 1 choosing T and M. This requires for l=Pr(L) 2l + 5(1 – l) = 3l + 1(1 – l), or 5 – 3l = 1 + 2l or l = 4/5 mixing of 2 between L and R requires for t=Pr(T) 3t + 2(1 – t) = 4(1 – t), or 2 + t = 4 – 4t or t = 2/5 Equilibrium: ((t,m,b),(l,1 – l)) = ((2/5,3/5,0),(4/5,1/5))

Dominated strategies An action can be dominated by a mixed strategy: The mixed strategy αi strictly dominates action ai' if for all profiles a-i of the other players’ actions the expected payoff to αi is strictly larger than the payoff to ai' , i.e. Ui(αi,a-i ) > ui(ai',a-i ) for all a-i ai' is then strictly dominated

Dominated strategies An action can be dominated by a mixed strategy: The mixed strategy αi strictly dominates action ai' if for all profiles a-i of the other players’ actions the expected payoff to αi is strictly larger than the payoff to ai' , i.e. Ui(αi,a-i ) > ui(ai',a-i ) for all a-i ai' is then strictly dominated Example: T is strictly dominated by a mixed strategy that plays M and B both with probability ½. Player 2 L R T 1,3 1,0 Player 1 M 4,2 0,4 B 0,5 3,1

Symmetric games Definitions of symmetric games and symmetric equilibria carry over in obvious way We had seen that symmetric game with ordinal preferences does not necessarily have a symmetric equilibrium But a symmetric game with vNM preferences and a finite set of actions always has at least one symmetric equilibrium (pure or mixed).

Application: Preparing questions for a seminar n students, all identical, decide independently benefit b > 0 to each if at least one is prepared, 0 otherwise cost c with 0 < c < b of preparing the question There are n asymmetric equilibria where exactly one is prepared Symmetric equilibrium? Has to be mixed. p: probability to prepare need U(prepare) = U(not prepare) b – c = 0 Pr(0 others prepare) + b Pr(>0 others prepare) b – c = b (1 – Pr(0 others prepare)) c/b = Pr(0 others prepare) = (1 – p)n-1 p = 1 – (c/b)1/(n-1) Note: p is decreasing in n. But also probability that at least one is prepared is decreasing in n, because probability that no one is prepared is (1 – p) c/b, which is increasing in n. This is essentially identical to the reporting crime issue

More on mixed-strategy equilibria It appears a bit odd that players choose their strategies in order to keep the others indifferent. It is not claimed that players intentionally do this, but is just an equilibrium requirement However, think of repetition of games that have matching pennies structure (e.g. goalie vs. penalty kicker): the players actually would like to keep the other indifferent. Why?

More on mixed-strategy equilibria How could players arrive at their beliefs? Deleting dominated strategies Best-response dynamics (play best response to previous action of opponent) Learning in repeated games This illustrates that there are stable mixed equilibria (e.g. in matching pennies) and unstable ones (e.g. in BoS) concerning learning and stable/unstable equilibria: if players through repeated play learn the average behavior of the population, this will eventually converge to an equilibrium. But in matching pennies, if the equilibrium is disrupted (e.g. through immigration) we get back after a while, whereas in BoS, we would get to one of the pure equilibria.

Problem set #04 NOTE: I expect that you have tried to solve the exercises before the seminar Osborne, Ex 106.2 Osborne, Ex 114.2 (Osborne, Ex 114.3) (Osborne, Ex 118.2) Osborne, Ex 121.2 Osborne, Ex 141.1 Osborne, Ex 142.1 Show the following: a 2x2 game with two pure strict Nash equilibria always has a mixed-strategy equilibrium that is not a pure strategy equilibrium.