Mixed Strategies For Managers

Slides:

Advertisements

Similar presentations

Crime, Punishment, and Forgiveness

Advertisements

Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.

Stackelberg -leader/follower game 2 firms choose quantities sequentially (1) chooses its output; then (2) chooses it output; then the market clears This.

Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.

Simultaneous- Move Games with Mixed Strategies Zero-sum Games.

Game Theory “I Used to Think I Was Indecisive - But Now I’m Not So Sure” - Anonymous Mike Shor Lecture 5.

Chapter 10 Game Theory and Strategic Behavior

1 Chapter 14 – Game Theory 14.1 Nash Equilibrium 14.2 Repeated Prisoners’ Dilemma 14.3 Sequential-Move Games and Strategic Moves.

Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.

1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang.

Eponine Lupo.  Questions from last time  3 player games  Games larger than 2x2—rock, paper, scissors  Review/explain Nash Equilibrium  Nash Equilibrium.

For any player i, a strategy weakly dominates another strategy if (With at least one S -i that gives a strict inequality) strictly dominates if where.

MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)

Game Theory Advertising Example 1. Game Theory What is the optimal strategy for Firm A if Firm B chooses to advertise? 2.

OLIGOPOLY A market structure in which there are few firms, each of which is large relative to the total industry. Key idea: Decision of firms are interdependent.

Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Zero-Sum Games (follow-up)

ECO290E: Game Theory Lecture 4 Applications in Industrial Organization.

Chapter 6 © 2006 Thomson Learning/South-Western Game Theory.

A camper awakens to the growl of a hungry bear and sees his friend putting on a pair of running shoes, “You can’t outrun a bear,” scoffs the camper. His.

Arguments for Recovering Cooperation Conclusions that some have drawn from analysis of prisoner’s dilemma: – the game theory notion of rational action.

Todd and Steven Divide the Estate Problem Bargaining over 100 pounds of gold Round 1: Todd makes offer of Division. Steven accepts or rejects. Round.

Yale 9&10 Mixed Strategies in Theory and Tennis. Overview As I randomize the strategies, the expected payoff is a weighted average of the pure strategies.

Games of pure conflict two person constant sum. Two-person constant sum game Sometimes called zero-sum game. The sum of the players’ payoffs is the same,

Review: Game theory Dominant strategy Nash equilibrium

Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.

Lectures in Microeconomics-Charles W. Upton Game Theory.

© 2008 Pearson Addison Wesley. All rights reserved Chapter Fourteen Game Theory.

UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Review Midterm3/19 3/12.

6.1 Consider a simultaneous game in which player A chooses one of two actions (Up or Down), and B chooses one of two actions (Left or Right). The game.

UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.

ECON 1001 Tutorial 10.

Game Theoretic Analysis of Oligopoly lr L R 0000 L R 1 22 The Lane Selection Game Rational Play is indicated by the black arrows.

UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Review Midterm3/23 3/9.

UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.

UNIT III: COMPETITIVE STRATEGY

UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.

© 2009 Institute of Information Management National Chiao Tung University Lecture Notes II-2 Dynamic Games of Complete Information Extensive Form Representation.

Game Theory “I used to think I was indecisive – but now I’m not so sure.” - Anonymous Topic 4 Mixed Strategies.

Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer

Exam Questions. Fred and Elmer No Price War Price War.

CPS 170: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.

1 Applications Here we look at several applications. We will see a classic example of a dilemma that can arise in such games.

1 1 BA 210 Lesson III.5 Strategic Uncertainty when Interests ConflictOverviewOverview.

Games of Strategy (Game Theory) Topic 1 – Part IV.

Games People Play. 4. Mixed strategies In this section we shall learn How to not lose a game when it appears your opponent has a counter to all your moves.

Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -

Game theory & Linear Programming Steve Gu Mar 28, 2008.

Chapter 12 - Imperfect Competition: A Game-Theoretic Approach Copyright © 2015 The McGraw-Hill Companies, Inc. All rights reserved.

The Science of Networks 6.1 Today’s topics Game Theory Normal-form games Dominating strategies Nash equilibria Acknowledgements Vincent Conitzer, Michael.

3.1.4 Types of Games. Strategic Behavior in Business and Econ Outline 3.1. What is a Game ? The elements of a Game The Rules of the Game:

Empirical Aspects of Plurality Elections David R. M. Thompson, Omer Lev, Kevin Leyton-Brown & Jeffrey S. Rosenschein COMSOC 2012 Kraków, Poland.

1 The Volunteer’s Dilemma (Mixed Strategies). 2 The Volunteer Dilemma Game Simultaneously and independently, players have to decide if they wish to volunteer.

Lec 23 Chapter 28 Game Theory.

Entry Deterrence Players Two firms, entrant and incumbent Order of play Entrant decides to enter or stay out. If entrant enters, incumbent decides to fight.

By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.

Arguments for Recovering Cooperation Conclusions that some have drawn from analysis of prisoner’s dilemma: – the game theory notion of rational action.

Game theory basics A Game describes situations of strategic interaction, where the payoff for one agent depends on its own actions as well as on the actions.

Mixed Strategies Keep ‘em guessing.

Microeconomics Course E

Teoria dei giochi e Oligopolio

Simultaneous-Move Games: Mixed Strategies

Chapter 12 - Imperfect Competition: A Game-Theoretic Approach

Managerial Economics Kyle Anderson

Game Theory Fall Mike Shor Topic 3.

LECTURE 2 MIXED STRATEGY GAME

Strategic Decision Making in Oligopoly Markets

Game Theory and Strategic Play

Molly W. Dahl Georgetown University Econ 101 – Spring 2009

Lecture Game Theory.

Presentation transcript:

Mixed Strategies For Managers

Overview Dominant and dominated strategies Dominant strategy equilibrium Prisoners’ dilemma Nash equilibrium in pure strategies Games with multiple Nash equilibria Equilibrium selection Games with no pure strategy Nash equilibria Mixed strategy Nash equilibrium

Outline Games with no pure strategy Nash equilibrium Mixed Strategies What is the idea? How do we compute them? Mixed strategies in practice Examples Evidence from football penalty kicks Minimax strategies in zero-sum games

costly auditing (waste) Mixed strategies are strategies that involve randomization. Example: filing taxes Fiscal Authority Audit Don’t audit pays low taxes gets punished pays low taxes Cheat Fiscal Authorithy low tax revenue costly auditing Taxpayer pays high taxes Don’t cheat pays high taxes costly auditing (waste) high tax revenue

Best response of the taxpayer Fiscal Authority Audit Don’t audit pays low taxes gets punished pays low taxes Cheat Fiscal Authority low tax revenue costly auditing Taxpayer pays high taxes Don’t cheat pays high taxes costly auditing (waste) high tax revenue

Best response of the Fiscal Authorithy Fiscal Authority Audit Don’t audit pays low taxes gets punished pays low taxes Cheat Fiscal Authority low tax revenue costly auditing Taxpayer pays high taxes Don’t cheat pays high taxes costly auditing (waste) high tax revenue

Best responses do not coincide: No Nash equilibrium in pure strategies Fiscal Authority Audit Don’t audit pays low taxes gets punished pays low taxes Cheat Fiscal Authority low tax revenue costly auditing Taxpayer pays high taxes Don’t cheat pays high taxes costly auditing (waste) high tax revenue

Example: to work or not to work… Players Employee Work Shirk Manager Monitor Do not monitor

Payoffs The employee Salary: $100K unless caught shirking Cost of effort: $50K The manager Value of the employee output: $200K Profit if the employee doesn’t work: $0 Cost of monitoring: $10K

The payoff matrix Manager Employee Monitor No monitor Monitor Work Employee Shirk

The payoff matrix Manager Employee Monitor No monitor 50 , 90 50 , 100 0 , -10 100 , -100 Manager Monitor No Monitor Work Employee Shirk

Best response of the employee Monitor No monitor 50 , 90 50 , 100 0 , -10 100 , -100 Manager Monitor No Monitor Work Employee Shirk

Best response of the manager Monitor No monitor 50 , 90 50 , 100 0 , -10 100 , -100 Manager Monitor No Monitor Work Employee Shirk

No Nash equilibrium in pure strategies Monitor No monitor 50 , 90 50 , 100 0 , -10 100 , -100 Manager Monitor No Monitor Work Employee Shirk

Mixed Strategies (1) What is the idea? (2) How do we compute mixed strategies?

The Idea Mixed Strategies The idea is to prevent the other player to anticipate my strategy. Randomizing “just right” takes away any ability to be taken advantage of. Just right: Making other player indifferent to her strategies.

Computing mixed strategies Manager Monitor No monitor Employee Work 50 , 90 50 , 100 Shirk 0 , -10 100 , -100 q 1q p 1p Suppose that: The employee chooses to work with probability p (and shirk with 1p) The manager chooses to monitor with probability q (and no monitor with 1q)

The manager’s perspective: how can I avoid shirking? Mixed Strategies Calculate the employee’s expected payoff. Find out his best response to each possible strategy of the manager.

1. Expected payoff of the employee Mixed Strategies Manager Monitor No monitor Employee Work 50 , 90 50 , 100 Shirk 0 , -10 100 , -100 q 1q Expected payoff from working: Expected payoff from shirking: (50 x q) + (50 x (1q))= 50 (0 x q) + (100 x (1q))= 100100q

2. The employee’s best response Mixed Strategies What is the employee’s best response for all possible strategies of the manager? The manager’s possible strategies: q=0, q=0.1, …, q=0.5, ..., q=1 Technically, q[0,1]

2. The employee’s best response Expected payoff from working: 50 Expected payoff from shirking:100100q Recap: E. P. working > E.P. of shirking 50 > 100 – 100q if q >1/2 E. P. working < E.P. of shirking 50 < 100 – 100q if q <1/2 E. P. working = E.P. of shirking if q =1/2

2. The employee’s best response Mixed Strategies Best response to all q >1/2 : Work Best response to all q <1/2 : Shirk Best response to q=1/2 : Work or Shirk (i.e., the employee is indifferent) If you want to keep the employee from shirking, you should set q >1/2 (i.e., monitor more than half of the time).

Not done yet… Mixed Strategies All this was from the Manager’s perspective; she wants to determine the best q to induce the Employee not to shirk. To do so, she tried to figure out how the employee would respond to different q. Now look at things from the Employee’s perspective. The employee will also try to determine the best p.

The employee’s perspective: follow the same steps Mixed Strategies Calculate the manager’s expected payoff. Find out her best response to each possible strategy of the employee.

1. Expected payoff of the manager Mixed Strategies Manager Monitor No monitor Employee Work 50 , 90 50 , 100 Shirk 0 , -10 100 , -100 p 1p Expected payoff from monitoring: Expected payoff from not monitoring: (90 x p) + (-10 x (1p))= 100p 10 (100 x p) + (-100 x (1p))= 200p100

2. The manager’s best response Mixed Strategies What is the manager’s best response for all possible strategies of the employee? The employee’s possible strategies: p=0, p=0.1, …, p=0.5, ..., p=1 Technically, p[0,1]

2. The manager’s best response Expected payoff from monitoring: 100p 10 Expected payoff from not monitoring:200p100 Recap: E. P. of monitoring > E.P. of no monitoring 100p-10 > 200p – 100 if p <9/10 E. P. of monitoring < E.P. of no monitoring if p >9/10 E. P. of monitoring = E.P. of no monitoring if p =9/10

2. The manager’s best response Mixed Strategies Best response to all p <9/10: Monitor Best response to all p >9/10: No monitor Best response to p=9/10 : Monitor or No Monitor (i.e., the manager is indifferent) If you want keep the manager from monitoring, you should set p > 9/10 (work “most of the time”).

Nash equilibrium in mixed strategies The employer works with probability 9/10 and shirks with probability 1/10. The manager monitors with probability ½ and does not monitor with probability ½.

Nash equilibrium in mixed strategies 1 p Probability of working Can this be an equilibrium? 1/3 1/4 1 q Probability of monitoring

Nash equilibrium in mixed strategies 1 p What is the employee’s best response to q =1/4? Probability of working Shirk! 1/3 ( Shirk if q <1/2 ) 1/4 1 q Probability of monitoring

Nash equilibrium in mixed strategies 1 p Probability of working Can this be an equilibrium? 1/4 1 q Probability of monitoring

Nash equilibrium in mixed strategies 1 p Probability of working What is the manager’s best response to p =0 (shirk)? Monitor! ( Monitor if p <9/10 ) 1/4 1 q Probability of monitoring

Nash equilibrium in mixed strategies 1 p Probability of working Can this be an equilibrium? 1 q Probability of monitoring

Nash equilibrium in mixed strategies 1 shirk work p Probability of working 1/2 1 q Probability of monitoring

Nash equilibrium in mixed strategies 1 no monitor 9/10 p Probability of working monitor 1 q Probability of monitoring

Nash equilibrium in mixed strategies The employee is Indifferent between “work” and “shirk” 1 The manager is Indifferent between “monitor” and “no monitor” no monitor 9/10 Unique N.E. in mixed strategies shirk work p Probability of working monitor 1/2 1 q Probability of monitoring

Equilibrium Payoffs: the employee Mixed Strategies Manager Monitor No monitor Employee Work 50 , 90 50 , 100 Shirk 0 , -10 100 , -100 1/2 1/2 9/10 1/10 Expected payoff from working: (50 x ½ ) + (50 x ½ ) = 50 Expected payoff from shirking: (0 x ½ ) + (100 x ½ ) = 50 Gets (50 x 9/10) + (50 x 1/10) = 50

Equilibrium Payoffs: the manager Mixed Strategies Manager Monitor No monitor Employee Work 50 , 90 50 , 100 Shirk 0 , -10 100 , -100 1/2 1/2 9/10 1/10 Expected payoff from monitoring: (90 x 9/10 ) + (-10 x 1/10) = 80 Expected payoff from no monitoring: (100 x 9/10 ) + (-100 x 1/10 ) = 80 Gets (80 x 1/2) + (80 x 1/2) = 80

What if cost of monitoring was 50 (instead of 10)? Mixed Strategies Initial Payoff Matrix Manager Monitor No monitor Employee Work 50 , 90 50 , 100 Shirk 0 , -10 100 , -100 New Payoff Matrix Manager Monitor No monitor Employee Work 50 , . . . 50 , 100 Shirk 0 , . . . 100 , -100 50 -50

A change in the manager’s payoffs Mixed Strategies New Payoff Matrix Manager Monitor No monitor Employee Work 50 , 50 50 , 100 Shirk 0 , -50 100 , -100 Which player’s equilibrium strategy will change? The employee’s equilibrium strategy: “Work with probability ½ and shirk with probability ½” (As opposed to “work with probability 9/10 …” with a less expensive monitoring technology)

Properties of mixed strategy equilibria Mixed Strategies A player chooses his strategy so as to make his rival indifferent. As a player, you want to prevent others from exploiting any systematic behavior of yours. A player earns the same expected payoff for each pure strategy chosen with positive probability. When a player’s own payoff from a pure strategy changes (e.g., more costly monitoring), his mixture does not change but his opponent’s does.