Download presentation
Presentation is loading. Please wait.
1
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review Midterm3/19 2/20
2
Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem
3
The Essentials of a Game 1. Players: We require at least 2 players (Players choose actions and receive payoffs.) 2. Actions: Player i chooses from a finite set of actions, S = {s 1,s 2,…..,s n }. Player j chooses from a finite set of actions T = {t 1,t 2,……,t m }. 3. Payoffs: We define P i (s,t) as the payoff to player i, if Player i chooses s and player j chooses t. We require that P i (s,t) + P j (s,t) = 0 for all combinations of s and t. 4. Information: What players know (believe) when choosing actions. ZERO-SUM
4
The Essentials of a Game 4. Information: What players know (believe) when choosing actions. Perfect Information: Players know their own payoffs other player(s) payoffs the history of the game, including other(s) current action* *Actions are sequential (e.g., chess, tic-tac-toe). Common Knowledge
5
The Essentials of a Game 4. Information: What players know (believe) when choosing actions. Complete Information: Players know their own payoffs other player(s) payoffs the history of the game, excluding other(s) current action* *Actions are simultaneous (e.g., matrix games). Common Knowledge
6
Extensive Game Player 1 chooses a = {1, 2 or 3} Player 2 b = {1 or 2} Player 1 c = {1, 2 or 3} Payoffs = a 2 + b 2 + c 2 if /4 leaves remainder of 0 or 1. -(a 2 + b 2 + c 2 ) if /4 leaves remainder of 2 or 3. Player1’s decision nodes -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. “Square the Diagonal” (Rapoport: 48-9) Player 2’s decision nodes 1 3 2 1 2 1 2 3
7
Extensive Game How should the game be played? Solution: a set of “advisable” strategies, one for each player. Strategy: a complete plan of action for every possible decision node of the game, including nodes that could only be reached by a mistake at an earlier node. Player1‘s advisable Strategy in red -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 1 3 2 1 2 1 2 3 Start at the final decision nodes (in red) Backwards-induction
8
Extensive Game How should the game be played? Solution: a set of “advisable” strategies, one for each player. Strategy: a complete plan of action for every possible decision node of the game, including nodes that could only be reached by a mistake at an earlier node. Player1‘s advisable Strategy in red -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. Player2’s advisable strategy in green 1 3 2 1 2 1 2 3 Player1’s advisable strategy in red
9
Extensive Game How should the game be played? If both player’s choose their advisable (prudent) strategies, Player1 will start with 2, Player2 will choose 1, then Player1 will choose 2. The outcome will be 9 for Player1 (-9 for Player2). If a player makes a mistake, or deviates, her payoff will be less. -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 1 3 2 1 2 1 2 3
10
Extensive Game Enumerating Strategies: In the extensive form game with perfect information (Game 1), Player 2 has 8 strategies: -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 1 2 1 1 1, 1, 1 1, 1, 2 1, 2, 1 1, 2, 2
11
Extensive Game Enumerating Strategies: In the extensive form game with perfect information (Game 1), Player 2 has 8 strategies: -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 1 1, 1, 1 1, 1, 2 1, 2, 1 1, 2, 2 2 1
12
Extensive Game Enumerating Strategies: In the extensive form game with perfect information (Game 1), Player 2 has 8 strategies: -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 1 1, 1, 1 1, 1, 2 1, 2, 1 1, 2, 2 2 1
13
Extensive Game Enumerating Strategies: In the extensive form game with perfect information (Game 1), Player 2 has 8 strategies: -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 1 1, 1, 1 1, 1, 2 1, 2, 1 1, 2, 2 2 2
14
Extensive Game Enumerating Strategies: In the extensive form game with perfect information (Game 1), Player 2 has 8 strategies: -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 1, 1, 1 1, 1, 2 1, 2, 1 1, 2, 2 2, 1, 1 2, 1, 2 2, 2, 1 2, 2, 2 1 1 2
15
Extensive Game Enumerating Strategies: In the extensive form game with perfect information (Game 1), Player 2 has 8 strategies: -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 2 1, 1, 1 1, 1, 2 1, 2, 1 1, 2, 2 2, 1, 1 2, 1, 2 2, 2, 1 2, 2, 2 2 1
16
Extensive Game Enumerating Strategies: In the extensive form game with perfect information (Game 1), Player 2 has 8 strategies: -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 2 1, 1, 1 1, 1, 2 1, 2, 1 1, 2, 2 2, 1, 1 2, 1, 2 2, 2, 1 2, 2, 2 2 1
17
Extensive Game Enumerating Strategies: In the extensive form game with perfect information (Game 1), Player 2 has 8 strategies: -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 2 1, 1, 1 1, 1, 2 1, 2, 1 1, 2, 2 2, 1, 1 2, 1, 2 2, 2, 1 2, 2, 2 2 2
18
Extensive Game A Clarification: Rapoport (pp. 49-53) claims Player 1 has 27 strategies. However, if we consider inconsistent strategies, the actual number of strategies available to Player 1 is 3 7 = 2187. An inconsistent strategy includes actions at decision nodes that would not be reached by correct implementation at earlier nodes, i.e., could only be reached by mistake. Since we can think of a strategy as a set of instructions (or program) given to an agent or referee (or machine) to implement, a complete strategy must include instructions for what to do after a mistake is made. This greatly expands the number of strategies available, though the essence of Rapoport’s analysis is correct.
19
Extensive Game Complete Information: Players know their own payoffs; other player(s) payoffs; history of the game excluding other(s) current action* *Actions are simultaneous -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 1 3 2 1 2 1 2 3 Information Sets
20
Extensive Game In the extensive form game with complete information, Player 2 has only 2 strategies: -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 1 1 1 1, 1, 1 1, 1, 2 1, 2, 1 1, 2, 2 T 1 (Always choose 1)
21
Extensive Game In the extensive form game with complete information, Player 2 has only 2 strategies: -3 -6 -11 -6 9 -14 -6 9 -14 9 12 17 -11 -14 -19 -14 17 -22 GAME 1. 2 2, 1, 1 2, 1, 2 2, 2, 1 2, 2, 2 2 2 … and T 2 (Always choose 2)
22
Matrix Game T1T2 T1T2 Also called “Normal Form” or “Strategic Game” Solution = {S 22, T 1 } S 11 S 12 S 13 S 21 S 22 S 23 S 31 S 32 S 33
23
Dominant Strategies Dominant Strategy: a strategy that is best no matter what the opponent(s) choose(s). T 1 T 2 T 3 -3 0 -10 -1 5 2 -2 -4 0 -3 0 1 -1 5 2 -2 2 0 S1S2S3S1S2S3 S1S2S3S1S2S3
24
Dominant Strategies Dominant Strategy: a strategy that is best no matter what the opponent(s) choose(s). T 1 T 2 T 3 Sure Thing Principle: If you have a dominant strategy, use it! -3 0 -10 -1 5 2 -2 -4 0 -3 0 1 -1 5 2 -2 2 0 S1S2S3S1S2S3 S1S2S3S1S2S3
25
Prudent Strategies T 1 T 2 T 3 -3 1 -20 -1 5 2 -2 -4 15 S1S2S3S1S2S3 Prudent Strategy: A prudent strategy for player i maximizes the minimum payoff she can get from playing different strategies. Such a strategy is simply max s min t P(s,t) for player i. Also called, Maximin Strategy
26
Prudent Strategies T 1 T 2 T 3 Player 1’s worst payoffs for each strategy are in red. -3 1 -20 -1 5 2 -2 -4 15 S1S2S3S1S2S3 Prudent Strategy: A prudent strategy for player i maximizes the minimum payoff she can get from playing different strategies. Such a strategy is simply max s min t P(s,t) for player i.
27
Prudent Strategies T 1 T 2 T 3 Player 2’s worst payoffs for each strategy are in green. -3 1 -20 -1 5 2 -2 -4 15 S1S2S3S1S2S3 Prudent Strategy: A prudent strategy for player i maximizes the minimum payoff she can get from playing different strategies. Such a strategy is simply max s min t P(s,t) for player i.
28
Prudent Strategies T 1 T 2 T 3 -3 1 -20 -1 5 2 -2 -4 15 S1S2S3S1S2S3 Prudent Strategy: A prudent strategy for player i maximizes the minimum payoff she can get from playing different strategies. Such a strategy is simply max s min t P(s,t) for player i. Saddlepoint: A set of prudent strategies (one for each player), s. t. (s’, t’) is a saddlepoint, iff maxmin = minmax. We call the solution {S 2, T 1 } a saddlepoint
29
Prudent Strategies -3 1 -20 -1 5 2 -2 -4 15 Saddlepoint: A set of prudent strategies (one for each player), s. t. (s’, t’) is a saddlepoint, iff maxmin = minmax.
30
Mixed Strategies Left Right L R L R -2 4 2 -1 Player 1 Player 2 GAME 2: Button-Button (Rapoport: 65-73.) Player 1 hides a button in his Left or Right hand. Player 2 observes Player 1’s choice and then picks either Left or Right. Draw the game in matrix form.
31
Mixed Strategies Left Right L R L R -2 4 2 -1 Player 1 Player 2 GAME 2: Button-Button Player 1 has 2 strategies: L or R. -2 4 -2 4 2 -1 -1 2 LRLR LL RR LR RL
32
Mixed Strategies Left Right L R L R -2 4 2 -1 Player 1 Player 2 GAME 2: Button-Button Player 2 has 4 strategies: -2 4 -2 4 2 -1 -1 2 LRLR LL RR LR RL
33
Mixed Strategies Left Right L R L R -2 4 2 -1 Player 1 Player 2 GAME 2: Button-Button Player 2 has 4 strategies: -2 4 -2 4 2 -1 -1 2 LRLR LL RR LR RL
34
Mixed Strategies Left Right L R L R -2 4 2 -1 Player 1 Player 2 GAME 2: Button-Button Player 2 has 4 strategies: -2 4 -2 4 2 -1 -1 2 LRLR LL RR LR RL
35
Mixed Strategies Left Right L R L R -2 4 2 -1 Player 1 Player 2 GAME 2: Button-Button Player 2 has 4 strategies: -2 4 -2 4 2 -1 -1 2 LRLR LL RR LR RL
36
Mixed Strategies Left Right L R L R -2 4 2 -1 Player 1 Player 2 GAME 2: Button-Button The game can be solve by backwards-induction. Player 2 will … -2 4 -2 4 2 -1 -1 2 LRLR LL RR LR RL
37
Mixed Strategies Left Right L R L R -2 4 2 -1 Player 1 Player 2 GAME 2: Button-Button The game can be solve by backwards-induction. … therefore, Player 1 will: -2 4 -2 4 2 -1 -1 2 LRLR LL RR LR RL
38
Mixed Strategies Left Right L R L R -2 4 2 -1 Player 1 Player 2 What would happen if Player 2 cannot observe Player 1’s choice? GAME 2: Button-Button
39
Mixed Strategies Left Right L R L R -2 4 2 -1 Player 1 Player 2 -2 4 2 -1 L R L R GAME 2: Button-Button
40
Solving the Zero-sum Game GAME 2. -2 4 2 -1 Mixed Strategy: A mixed strategy for player i is a probability distribution over all strategies available to player i. Let (p, 1-p) = prob. Player I chooses L, R. (q, 1-q) = prob. Player 2 chooses L, R. L R LRLR
41
Solving the Zero-sum Game GAME 2. -2 4 2 -1 Then Player 1’s expected payoffs are: EP 1 (L|p) = -2(p) + 2(1-p) = 2 – 4p EP 1 (R|p) = 4(p) – 1(1-p) = 5p – 1 L R LRLR (p) (1-p) (q) (1-q) 01 p EP p*=1/3 2 4 -2 EP 1 (L|p) = 2 – 4p EP 1 (R|p) = 5p – 1
42
Solving the Zero-sum Game GAME 2. -2 4 2 -1 Player 2’s expected payoffs are: EP 2 (L|q) = 2(q) – 4(1-q) = 6q – 4 EP 2 (R|q) = -2(q) + 1(1-q) = -3q + 1 EP 2 (L|q) = EP 2 (R|q) => q* = 5/9 L R LRLR (p) (1-p) (q) (1-q)
43
Solving the Zero-sum Game Player 1 EP 1 (L|p) = -2(p) + 2(1-p) = 2 – 4p EP 1 (R|p) = 4(p) – 1(1-p) = 5p – 1 0 p 1 q -EP 2 p*=1/3 2 4 -2 EP 1 -4 2 -2 2 q*= 5/9 Player 2 EP 2 (L|q) = 2(q) – 4(1-q) = 6q – 4 EP 2 (R|q) = -2(q) + 1(1-q) = -3q + 1 2/3 = EP 1 * = - EP 2 * =-2/3 This is the Value of the game.
44
Solving the Zero-sum Game GAME 3. -2 4 2 -1 Then Player 1’s expected payoffs are: EP(T 1 ) = -2(p) + 2(1-p) EP(T 2 ) = 4(p) – 1(1-p) EP(T 1 ) = EP(T 2 ) => p* = 1/3 And Player 2’s expected payoffs are: (V)alue = 2/3 L R LRLR (p) (1-p) (q) (1-q) (Security) Value: the expected payoff when both (all) players play prudent strategies. Any deviation by an opponent leads to an equal or greater payoff.
45
The Minimax Theorem Von Neumann (1928) Every zero sum game has a saddlepoint (in pure or mixed strategies), s.t., there exists a unique value, i.e., an outcome of the game where maxmin = minmax.
46
Nonzero-sum Games Examples: Bargaining Duopoly International Trade
47
Nonzero-sum Games The Essentials of a Game Eliminating Dominated Strategies Best Response Nash Equilibrium Duopoly: An Application Solving the Game Existence of Nash Equilibrium Properties and Problems See: Gibbons, Game Theory for Applied Economists (1992): 1-51.
48
The Essentials of a Game 1. Players: We require at least 2 players (Players choose actions and receive payoffs.) 2. Actions: Player i chooses from a finite set of actions, S = {s 1,s 2,…..,s n }. Player j chooses from a finite set of actions T = {t 1,t 2,……,t m }. 3A. Payoffs: We define P i (s,t) as the payoff to player i, if Player i chooses s and player j chooses t. We allow that P i (s,t) + P j (s,t) = 0. 4. Information: What players know (believe) when choosing actions. NONZERO-SUM
49
Eliminating Dominated Strategies 1,0 1,20,1 0,3 0,12,0 1,0 1,2 0,3 0,1 1,0 1,2 L M R TBTB R is strictly dominated by M, so the game can be reduced to Now, B is strictly dominated by T... TBTB T (T, M)
50
Eliminating Dominated Strategies Definition Best Response Strategy: a strategy, s’, is a best response strategy iff P i (s’,t) > P i (s,t) for all s. 0,4 4,0 5,3 4,0 0,4 5,3 3,5 3,5 4,4 S1S2S3S1S2S3 T 1 T 2 T 3 A dominated strategy will never be played by a rational player.
51
Eliminating Dominated Strategies Definition Best Response Strategy: a strategy, s’, is a best response strategy iff P i (s’,t) > P i (s,t) for all s. 0,4 4,0 5,3 4,0 0,4 5,3 3,5 3,5 4,4 S1S2S3S1S2S3 T 1 T 2 T 3 A strategy can be dominated by a mixture of other strategies … T 3 is dominated by a mixture of T 1 and T 2.
52
Eliminating Dominated Strategies Definition Best Response Strategy: a strategy, s’, is a best response strategy iff P i (s’,t) > P i (s,t) for all s. 0,4 4,0 5,3 4,0 0,4 5,3 3,5 3,5 4,4 S1S2S3S1S2S3 T 1 T 2 T 3 A strategy can be dominated by a mixture of other strategies … T 3 is dominated by a mixture of T 1 and T 2.
53
Eliminating Dominated Strategies Definition Best Response Strategy: a strategy, s’, is a best response strategy iff P i (s’,t) > P i (s,t) for all s. 0,4 4,0 5,3 4,0 0,4 5,3 3,5 3,5 4,4 S1S2S3S1S2S3 T 1 T 2 T 3 A strategy can be dominated by a mixture of other strategies … Now S 3 is dominated by a mixture of S 1 and S 2.
54
Eliminating Dominated Strategies Definition Best Response Strategy: a strategy, s’, is a best response strategy iff P i (s’,t) > P i (s,t) for all s. 0,4 4,0 5,3 4,0 0,4 5,3 3,5 3,5 4,4 S1S2S3S1S2S3 T 1 T 2 T 3 A strategy can be dominated by a mixture of other strategies … Now S 3 is dominated by a mixture of S 1 and S 2.
55
Eliminating Dominated Strategies Definition Best Response Strategy: a strategy, s’, is a best response strategy iff P i (s’,t) > P i (s,t) for all s. -3 0 -10 -1 5 2 -2 -4 0 0,4 4,0 5,3 4,0 0,4 5,3 3,5 3,5 6,6 S1S2S3S1S2S3 S1S2S3S1S2S3 Does either player have a dominant strategy? A dominated strategy? T 1 T 2 T 3
56
Best Response Best Response Strategy: a strategy, s*, is a best response strategy, iff the payoff to (s*,t) is at least as great as the payoff to (s,t) for all s. -3 0 -10 -1 5 2 -2 -4 0 0,4 4,0 5,3 4,0 0,4 5,3 3,5 3,5 6,6 S1S2S3S1S2S3 S1S2S3S1S2S3 T 1 T 2 T 3
57
Best Response Best Response Strategy: a strategy, s*, is a best response strategy, iff the payoff to (s*,t) is at least as great as the payoff to (s,t) for all s. -3 0 -10 -1 5 2 -2 -4 0 0,4 4,0 5,3 4,0 0,4 5,3 3,5 3,5 6,6 S1S2S3S1S2S3 S1S2S3S1S2S3 T 1 T 2 T 3
58
Nash Equilibrium Best Response Strategy: a strategy, s*, is a best response strategy, iff the payoff to (s*,t) is at least as great as the payoff to (s,t) for all s. -3 0 -10 -1 5 2 -2 -4 0 0,4 4,0 5,3 4,0 0,4 5,3 3,5 3,5 6,6 S1S2S3S1S2S3 S1S2S3S1S2S3 T 1 T 2 T 3 Nash Equilibrium: a set of best response strategies (one for each player), (s*, t*) such that s* is a best response to t* and t* is a b.r. to s*. (S 3,T 3 )
59
Nash Equilibrium -3 0 -10 -1 5 2 -2 -4 0 4,4 2,3 1,5 3,2 1,1 0,0 5,1 0,0 3,3 S1S2S3S1S2S3 S1S2S3S1S2S3 Nash equilibrium need not be efficient. T 1 T 2 T 3
60
Nash Equilibrium -3 0 -10 -1 5 2 -2 -4 0 1,1 0,0 0,0 0,0 1,1 0,0 0,0 0,0 1,1 S1S2S3S1S2S3 S1S2S3S1S2S3 Nash equilibrium need not be unique. A COORDINATION PROBLEM T 1 T 2 T 3
61
Nash Equilibrium -3 0 -10 -1 5 2 -2 -4 0 1,1 0,0 0,0 0,0 1,1 0,0 0,0 0,0 1,1 S1S2S3S1S2S3 S1S2S3S1S2S3 Nash equilibrium need not be unique. What is the effect of repeated play? T 1 T 2 T 3
62
Nash Equilibrium -3 0 -10 -1 5 2 -2 -4 0 1,1 0,0 0,0 0,0 1,1 0,0 0,0 0,0 3,3 S1S2S3S1S2S3 S1S2S3S1S2S3 Inefficient and multiple Nash equilibrium. T 1 T 2 T 3
63
Duopoly Consider a market in which two identical firms can produce a good with marginal cost = $2 per unit (assume no fixed cost). Market demand can be described by: P(rice) = 8 – Q(uantity) Where Q is total industry output (Q = q 1 + q 2 )
64
Duopoly Consider a market in which two identical firms can produce a good with marginal cost = $2 per unit (assume no fixed cost). For each firm, Profit ( ) = Total Revenue – Total Cost = Pq – 2q Each firm will choose a level of output q i, to maximize its profit, taking into account what it expects the other firm to produce.
65
Duopoly Consider a market in which two identical firms can produce a good with marginal cost = $2 per unit (assume no fixed cost). For each firm, Profit ( ) = Total Revenue – Total Cost = Pq – 2q Each firm will choose a level of output q, to maximize its profit, taking into account what it expects the other firm to produce.
66
Duopoly Each firm’s profit maximizing level of output, q*, is a function of the other firm’s output. q13 q13 6 q 2 P = 8 - Q Demand Condition 1 = Total Revenue – Total Costq 1 - 2q 1 = 6q 1 - q 1 2 - q 2 q 1 FOC: 6 - 2q 1 - q 2 = 0 q q 1 * = 3 – 1/2q 2
67
Duopoly Each firm’s profit maximizing level of output, q*, is a function of the other firm’s output. q13 q13 6 q 2 P = 8 - (q 1 +q 2 ) 1 = Pq 1 - 2q 1 = [8 - (q 1 +q 2 )]q 1 - 2q 1 = 6q 1 - q 1 2 - q 2 q 1 = 6 - 2q 1 - q 2 = 0 q q 1 * = 3 – 1/2q 2
68
Duopoly Each firm’s profit maximizing level of output, q*, is a function of the other firm’s output. q13 q13 6 q 2 P = 8 - (q 1 +q 2 ) 1 = Pq 1 - 2q 1 = [8 - (q 1 +q 2 )]q 1 - 2q 1 = 6q 1 - q 1 2 - q 2 q 1 = 6 - 2q 1 - q 2 = 0 q q 1 * = 3 – 1/2q 2 d1d1 dq1dq1 FOC:
69
Duopoly A Nash Equilibrium is a pair of “best responses,” such that q 1 * is a best response to q 2 * and q 2 * is a best response to q 1 *. q 1 q 1 * = 2 q 2 * = 2q 2 q 2 * = 3 - 1/2q 1 q 1 * = 3 - 1/2q 2
70
Duopoly A Nash Equilibrium is a pair of “best responses,” such that q 1 * is a best response to q 2 * and q 2 * is a best response to q 1 *. q1q1* q1q1* q 2 * q 2 Is this the best they can do?
71
Next Time 2/27Nash Eq: problems & properties Schelling, Strategy of Conflict: 81-172.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.