1 a1a1 A1A1 a2a2 a3a3 A2A2. 2 2.4 Mixed Strategies When there is no saddle point: We’ll think of playing the game repeatedly. We continue to assume that.

Slides:



Advertisements
Similar presentations
Thursday, March 7 Duality 2 – The dual problem, in general – illustrating duality with 2-person 0-sum game theory Handouts: Lecture Notes.
Advertisements

Introduction to Game Theory
NON - zero sum games.
GAME THEORY.
Two Player Zero-Sum Games
Module 4 Game Theory To accompany Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna Power Point slides created by Jeff Heyl.
Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.
MS&E 211 Quadratic Programming Ashish Goel. A simple quadratic program Minimize (x 1 ) 2 Subject to: -x 1 + x 2 ≥ 3 -x 1 – x 2 ≥ -2.
Totally Unimodular Matrices
Mixed Strategies CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte.
15 THEORY OF GAMES CHAPTER.
APPENDIX An Alternative View of the Payoff Matrix n Assume total maximum profits of all oligopolists is constant at 200 units. n Alternative policies.
Two-Player Zero-Sum Games
Operations Research Assistant Professor Dr. Sana’a Wafa Al-Sayegh 2 nd Semester ITGD4207 University of Palestine.
For any player i, a strategy weakly dominates another strategy if (With at least one S -i that gives a strict inequality) strictly dominates if where.
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Study Group Randomized Algorithms 21 st June 03. Topics Covered Game Tree Evaluation –its expected run time is better than the worst- case complexity.
Game theory.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Zero-Sum Games (follow-up)
1. 2 APPENDIX B A PROCEDURE FOR GENERATING AN EQUILIBRIUM POINT FOR 2-PERSON GAMES (That sometimes works!)
Part 3: The Minimax Theorem
Equilibrium Concepts in Two Player Games Kevin Byrnes Department of Applied Mathematics & Statistics.
Chapter 14 Game Theory to accompany Operations Research: Applications and Algorithms 4th edition by Wayne L. Winston Copyright (c) 2004 Brooks/Cole, a.
Duality Lecture 10: Feb 9. Min-Max theorems In bipartite graph, Maximum matching = Minimum Vertex Cover In every graph, Maximum Flow = Minimum Cut Both.
Matrix Games Mahesh Arumugam Borzoo Bonakdarpour Ali Ebnenasir CSE 960: Selected Topics in Algorithms and Complexity Instructor: Dr. Torng.
An Introduction to Game Theory Part III: Strictly Competitive Games Bernhard Nebel.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Finite Mathematics & Its Applications, 10/e by Goldstein/Schneider/SiegelCopyright © 2010 Pearson Education, Inc. 1 of 68 Chapter 9 The Theory of Games.
To accompany Quantitative Analysis for Management, 8e by Render/Stair/Hanna S-1 © 2003 by Prentice Hall, Inc. Upper Saddle River, NJ Supplement 1.
Game Theory.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
The Multiplicative Weights Update Method Based on Arora, Hazan & Kale (2005) Mashor Housh Oded Cats Advanced simulation methods Prof. Rubinstein.
1. 2 Non-Cooperative games Player I Player II I want the maximum payoff to Player I I want the maximum payoff to Player II.
C&O 355 Mathematical Programming Fall 2010 Lecture 4 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A.
Simplex method (algebraic interpretation)
3.4 Linear Programming p Optimization - Finding the minimum or maximum value of some quantity. Linear programming is a form of optimization where.
Chapter 11 Game Theory Math Game Theory What is it? – a way to model conflict and competition – one or more "players" make simultaneous decisions.
Dominance Since Player I is maximizing her security level, she prefers “large” payoffs. If one row is smaller (element- wise) than another,
Chapter 3 Determinants Linear Algebra. Ch03_2 3.1 Introduction to Determinants Definition The determinant of a 2  2 matrix A is denoted |A| and is given.
Chapter 4 Linear Programming: The Simplex Method
Issues on the border of economics and computation נושאים בגבול כלכלה וחישוב Speaker: Dr. Michael Schapira Topic: Dynamics in Games (Part III) (Some slides.
When dealing with a model, we use the letter  for the mean. We write or, more often, replacing p by, Instead of , we can also write E(X ). ( Think of.
Part 3 Linear Programming
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
X y x-y · 4 -y-2x · 5 -3x+y · 6 x+y · 3 Given x, for what values of y is (x,y) feasible? Need: y · 3x+6, y · -x+3, y ¸ -2x-5, and y ¸ x-4 Consider the.
Algorithms for solving two-player normal form games
1. 2 Some details on the Simplex Method approach 2x2 games 2xn and mx2 games Recall: First try pure strategies. If there are no saddle points use mixed.
OR Chapter 7. The Revised Simplex Method  Recall Theorem 3.1, same basis  same dictionary Entire dictionary can be constructed as long as we.
1. 2 You should know by now… u The security level of a strategy for a player is the minimum payoff regardless of what strategy his opponent uses. u A.
Hon Wai Leong, NUS (CS6234, Spring 2009) Page 1 Copyright © 2009 by Leong Hon Wai CS6234: Lecture 4  Linear Programming  LP and Simplex Algorithm [PS82]-Ch2.
Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem.
Example 31: Solve the following game by equal gains method: Y I II I II I X II II
GAME THEORY Day 5. Minimax and Maximin Step 1. Write down the minimum entry in each row. Which one is the largest? Maximin Step 2. Write down the maximum.
Parameterized Two-Player Nash Equilibrium Danny Hermelin, Chien-Chung Huang, Stefan Kratsch, and Magnus Wahlstrom..
Table of Contents Matrices - Definition and Notation A matrix is a rectangular array of numbers. Consider the following matrix: Matrix B has 3 rows and.
9.2 Mixed Strategy Games In this section, we look at non-strictly determined games. For these type of games the payoff matrix has no saddle points.
Linear Programming Chap 2. The Geometry of LP  In the text, polyhedron is defined as P = { x  R n : Ax  b }. So some of our earlier results should.
Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.
2.5 The Fundamental Theorem of Game Theory For any 2-person zero-sum game there exists a pair (x*,y*) in S  T such that min {x*V. j : j=1,...,n} =
1 Chapter 4 Geometry of Linear Programming  There are strong relationships between the geometrical and algebraic features of LP problems  Convenient.
Game Theory [geym theer-ee] : a mathematical theory that deals with the general features of competitive situations in a formal abstract way.
Mixed Strategies Keep ‘em guessing.
The Duality Theorem Primal P: Maximize
Tools for Decision Analysis: Analysis of Risky Decisions
Chapter 6 Game Theory (Module 4) 1.
Game Theory.
Solutions Sample Games 1
Game Theory Solutions 1 Find the saddle point for the game having the following payoff table. Use the minimax criterion to find the best strategy for.
Lecture 20 Linear Program Duality
Presentation transcript:

1 a1a1 A1A1 a2a2 a3a3 A2A2

2 2.4 Mixed Strategies When there is no saddle point: We’ll think of playing the game repeatedly. We continue to assume that the players use the same basic philosophy and principles as before (that is, to find the “safest” strategy that best protect yourself from loosing). However we now assume that the players can mix up the strategies that they use. Let Player I play strategy a i with probability x i and let Player II play strategy A j with probability y j

3 2.4 Mixed Strategies Player II Player I Relative frequency of application

4 Since x 1, x 2, y 1 and y 2 are probabilities x 1 + x 2 = 1 and y 1 + y 2 = 1 One possibility would be for Player I to use a 1 3/4 of the time and a 2 1/4 of the time whilst Player II to uses A 1 1/2 of the time and A 2 1/2 of the time. BUT, WHAT IS THE BEST COMBO ??? Example

Definition A mixed strategy for Player I is a vector x = (x 1,...,x m ) with x i ≥ 0 for all i and  i x i = 1. Similarly, a mixed strategy for Player II is a vector y = (y 1,...,y n ) with y j ≥ 0 for all j and  j y j = 1. A pure strategy is a vector x, where one component is 1 and all other components are 0. eg. (0, 0, 1, 0, 0). So if a person uses a pure strategy they play the same option all the time. (This is what we do when there is a saddle.)

6 Expected Payoff Let E(x,y) denote the expected value of the payoff to Player I given that she uses strategy x and Player II uses strategy y. By definition then, E(x,y) :=  i,j x i y j v ij = xVy (Convention: in xVy, x is a row vector, y is a column vector, and V is a matrix.) If we play the game repeatedly many times Player 1 expects to get E(x, y) on average.

Example No saddle. E(x,y) = xVy = (x 1,x 2 ) V (y 1,y 2 ) = (x 1, x 2 )(y 1 + 5y 2, 6y 1 + 2y 2 ) = x 1 y 1 + 5x 1 y 2 + 6x 2 y 1 + 2x 2 y 2 For x = (0.5, 0.5) we obtain E(x,y) = 3.5(y 1 + y 2 ) = 3.5 for any y, since y 1 + y 2 = 1. Note that this is better than the optimal security level for Player I (equal to 2) !!!! BUT, CAN WE DO BETTER?

8 Similarly, if y = (0.5, 0.5), we have E(x,y) = 3x 1 + 4x 2 < 4x 1 + 4x 2 = 4 (why?) Note that the optimal security level for Player II is equal to 5 (for pure strategies). Thus, this mixed strategy is (on average) superior to any pure strategy that Player II can use.

9 Notation S := Set of feasible mixed strategies for Player I, ie. S:={(x 1,...,x m ): x i ≥ 0,  i x i = 1} T:= Set of feasible mixed strategies for Player II, ie. T:={(y 1,...,y n ): y j ≥ 0,  j y i = 1} Our aim is to choose “the best” of all the elements in S and T.

Definition The security level of Player I associated with strategy x in S is the minimum feasible expected payoff to Player I given that she uses x (and that Player II is doing sensible things, that is, y in T). We denote the security level for Player I s(x), ie. s(x):= min {E(x,y): y in T}. Similarly, for Player II, let  (y) denote the security level associated with strategy y in T, namely  (y):= max{E(x,y): x in S}.

Theorem There exists the following equalities: s(x) = min{xV.j : j=1,2,...,n} and  (y) = max{V i. y : i=1,2,...,m} In words, if Player I is using a given strategy x, Player II can restrict herself to pure strategies !!!! If Player II is using a given strategy y, Player I can restrict himself to pure strategies!!! An LP-based proof (See Lecture Notes for an alternative direct proof. )

12 LP based Proof Since by definition E(x,y) = xVy, we have s(x) = min {xVy: y in T} Let c: = xV, then s(x) = min {cy : y in T} = min {cy: y y n = 1, y j ≥ 0 } This is a LP problem with one functional constraint. Thus, a basic feasible solution is of the form y=(0, 0,..., 0, 1, 0, 0,... 0), which is in fact a pure strategy! Similarly for  (y). Q: What is the value of j for which y j = 1 ? A: The j that has the least c j value!

Definition The optimal security level for Player I is equal to v 1 := max {s(x): x in S} and the optimal security level for Player II is equal to v 2 := min {  (y): y in T} If v 1 = v 2 we call the common quantity the value of the game.

14 Example (Continued) v 1 = max {s(x): x in S} = max { min{xV.j : j = 1,2,...,n}: x in S} (using Theorem 1.4.1) = max {min{xV.1, xV.2 }: x in S} = max {min{x 1 + 6x 2, 5x 1 + 2x 2 } x in S} = max {min{x – 6x 1, 5x – 2x 1 }: 0 ≤ x 1 ≤ 1} (using x 1 + x 2 = 1 ) = max {min{–5x 1 + 6, 3x 1 + 2}: 0 ≤ x 1 ≤ 1}

x x1x1 –5x 1 + 6

–5 x x x1x1 minimum of the two lines

– 5 x x x1x1 maximum of the minimum function here

18 v 1 = max {min{–5x 1 + 6, 3x 1 + 2}: 0 ≤ x 1 ≤ 1}. x* 1 = 1/2; x* 2 = 1 – x* 1 = 1/2; v 1 = 3x* = 7/ –5x x x1x1

19 For Player II v 2 = min{  (y): y in T} = min{ max {V i. y: i = 1,...,m}: y in T} = min { max {y 1 + 5y 2, 6y 1 + 2y 2 }: y in T} = min { max {–4y 1 + 5, 4y 1 + 2}: 0 ≤ y 1 ≤ 1}

–4y y y1y1

–4y y y1y1 Max of two lines

–4y y y1y1 v 2 = min { max {–4y 1 + 5, 4y 1 + 2}:0 ≤ y 1 ≤ 1} y* 1 = 3/8; y* 2 = 1 – y* 1 = 5/8; v 2 =7/2. Min of max function

23 Is this solution stable? Let us see if Player I is happy with x* given that Player II is using y*. For any feasible strategy x for Player I we thus have: xVy* = (7/2)(x 1 + x 2 ) = 7/2 for all x in S. Thus, given that Player II is using y*, Player I will be happy with x*, in fact she will be as happy with any feasible strategy. Convince yourself that Player II is happy with y* given that Player I is using x*.

Definition A strategy pair (x*,y*) in S  T is said to be in equilibrium if xVy* ≤ x*Vy* ≤ x*Vy for all (x,y) in S  T. Fundamental questions: Do we always have such pairs? How do we construct such pairs if they exist?

25 Example Solve the two person zero sum game whose payoff matrix is See lecture for solution.