1 15.053 Thursday, March 7 Duality 2 – The dual problem, in general – illustrating duality with 2-person 0-sum game theory Handouts: Lecture Notes.

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

3.6 Support Vector Machines
Solving Linear Programming Problems
1 Chapter 14 Making the most of things: Linear Programming.
Advanced Piloting Cruise Plot.
© 2008 Pearson Addison Wesley. All rights reserved Chapter Seven Costs.
Chapter 1 The Study of Body Function Image PowerPoint
Cognitive Radio Communications and Networks: Principles and Practice By A. M. Wyglinski, M. Nekovee, Y. T. Hou (Elsevier, December 2009) 1 Chapter 12 Cross-Layer.
February 21, 2002 Simplex Method Continued
Thursday, April 11 Some more applications of integer
Tuesday, May 7 Integer Programming Formulations Handouts: Lecture Notes.
February 7, 2002 A brief review of Linear Algebra Linear Programming Models Handouts: Lecture Notes.
Thursday, March 14 Introduction to Network Flows
Tuesday, March 5 Duality – The art of obtaining bounds – weak and strong duality Handouts: Lecture Notes.
February 14, 2002 Putting Linear Programs into standard form
Summary of Convergence Tests for Series and Solved Problems
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
1 LP, extended maxflow, TRW OR: How to understand Vladimirs most recent work Ramin Zabih Cornell University.
1 Outline relationship among topics secrets LP with upper bounds by Simplex method basic feasible solution (BFS) by Simplex method for bounded variables.
10.1 Trading Strategies Involving Options Chapter 10.
Discrete Math Recurrence Relations 1.
ABC Technology Project
Another example Max z=5x1+12x2+4x3-MR S.t. x1+2x2+x3+x4=10
Shadow Prices vs. Vickrey Prices in Multipath Routing Parthasarathy Ramanujam, Zongpeng Li and Lisa Higham University of Calgary Presented by Ajay Gopinathan.
Basic Permutations and Combinations
1 Undirected Breadth First Search F A BCG DE H 2 F A BCG DE H Queue: A get Undiscovered Fringe Finished Active 0 distance from A visit(A)
VOORBLAD.
1. 2 No lecture on Wed February 8th Thursday 9 th Feb 14: :00 Thursday 9 th Feb 14: :00.
Quadratic Inequalities
Equations of Lines Equations of Lines
Artificial Variables, 2-Phase and Big M Methods
Constant, Linear and Non-Linear Constant, Linear and Non-Linear
Factor P 16 8(8-5ab) 4(d² + 4) 3rs(2r – s) 15cd(1 + 2cd) 8(4a² + 3b²)
The simplex algorithm The simplex algorithm is the classical method for solving linear programs. Its running time is not polynomial in the worst case.
Optimization 1/33 Radford, A D and Gero J S (1988). Design by Optimization in Architecture, Building, and Construction, Van Nostrand Reinhold, New York.
© 2012 National Heart Foundation of Australia. Slide 2.
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
25 seconds left…...
Januar MDMDFSSMDMDFSSS
1. 2 No lecture on Wed February 8th Thursday 9 th Feb Friday 27 th Jan Friday 10 th Feb Thursday 14: :00 Friday 16:00 – 19:00 HS N.
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Local Search Jim Little UBC CS 322 – CSP October 3, 2014 Textbook §4.8
CPSC 322, Lecture 14Slide 1 Local Search Computer Science cpsc322, Lecture 14 (Textbook Chpt 4.8) Oct, 5, 2012.
PSSA Preparation.
IEOR 4004 Midterm Review (part I)
Oct 9, 2014 Lirong Xia Hypothesis testing and statistical decision theory.
NON - zero sum games.
EMIS 8374 LP Review: The Ratio Test. 1 Main Steps of the Simplex Method 1.Put the problem in row-0 form. 2.Construct the simplex tableau. 3.Obtain an.
LINEAR PROGRAMMING Modeling a problem is boring --- and a distraction from studying the abstract form! However, modeling is very important: --- for your.
1 LP Duality Lecture 13: Feb Min-Max Theorems In bipartite graph, Maximum matching = Minimum Vertex Cover In every graph, Maximum Flow = Minimum.
Totally Unimodular Matrices
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Part 3: The Minimax Theorem
Chapter 6 Linear Programming: The Simplex Method Section 3 The Dual Problem: Minimization with Problem Constraints of the Form ≥
Duality Lecture 10: Feb 9. Min-Max theorems In bipartite graph, Maximum matching = Minimum Vertex Cover In every graph, Maximum Flow = Minimum Cut Both.
Finite Mathematics & Its Applications, 10/e by Goldstein/Schneider/SiegelCopyright © 2010 Pearson Education, Inc. 1 of 68 Chapter 9 The Theory of Games.
MIT and James Orlin © Chapter 3. The simplex algorithm Putting Linear Programs into standard form Introduction to Simplex Algorithm.
The Duality Theorem Primal P: Maximize
EMGT 6412/MATH 6665 Mathematical Programming Spring 2016
Chap 9. General LP problems: Duality and Infeasibility
Lecture 20 Linear Program Duality
DUALITY THEORY Reference: Chapter 6 in Bazaraa, Jarvis and Sherali.
Presentation transcript:

Thursday, March 7 Duality 2 – The dual problem, in general – illustrating duality with 2-person 0-sum game theory Handouts: Lecture Notes

2 PRIMAL PROBLEM: maximize subject to z = 3x 1 + 4x 2 +6x 3 + 8x 4 x 1 + x 2 + x 3 + x 4 = 1 2x 1 + 3x 2 +4x 3 + 5x 4 = 3 x 1, x 2, x 3, x 4 0 maximize y 1 + 3y 2 subject to y 1 + 2y 2 y 1 + 3y 2 y 1 + 4y 2 y 1 + 5y 2 Observation 1. The constraint matrix in the primal is the transpose of the constraint matrix in the dual. Observation 2. The RHS coefficients in the primal become the cost coefficients in the dual. DUSL PROBLEM:

3 PRIMAL PROBLEM: maximize subject to z = 3x 1 + 4x 2 +6x 3 + 8x 4 x 1 + x 2 + x 3 + x 4 = 1 2x 1 + 3x 2 +4x 3 + 5x 4 = 3 x 1, x 2, x 3, x 4 0 DUSL PROBLEM: maximize y 1 + 3y 2 subject to y 1 + 2y 2 y 1 + 3y 2 y 1 + 4y 2 y 1 + 5y 2 Observation 3. The cost coefficients in the primal become the RHS coefficients in the dual. Observation 4. The primal (in this case) is a max problem with equality constraints and non-negative variables The dual (in this case) is a minimization problem with constraints and variables unconstrained in sign.

4 PRIMAL PROBLEM: maximize subject to z = 3x 1 + 4x 2 +6x 3 + 8x 4 x 1 + x 2 + x 3 + x 4 = 1 2x 1 + 3x 2 +4x 3 + 5x 4 = 3 x 1, x 2, x 3, x 4 0 DUSL PROBLEM: maximize y 1 + 3y 2 subject to y 1 + 2y 2 y 1 + 3y 2 y 1 + 4y 2 y 1 + 5y 2 Question. How does the dual change if we have inequality constraints? Recall that the optimal dual variables are the shadow prices for the primal

5 PRIMAL PROBLEM: maximize subject to z = 3x 1 + 4x 2 +6x 3 + 8x 4 x 1 + x 2 + x 3 + x 4 2x 1 + 3x 2 +4x 3 + 5x 4 x 1, x 2, x 3, x 4 Prices Method. Recall that the optimal dual variables are shadow prices. Suppose we replace the 1 by 1 + Can the optimal objective value go down? Can it go up? Suppose we replace the 3 by 3 + Can the optimal objective value go down? Can it go up? Conclusion: y1 0. Conclusion: y2 0.

6 PRIMAL PROBLEM: maximize subject to z = 3x 1 + 4x 2 +6x 3 + 8x 4 x 1 + x 2 + x 3 + x 4 = 1 2x 1 + 3x 2 +4x 3 + 5x 4 = 3 x 1, x 2, x 3, x 4 0 DUSL PROBLEM: maximize y 1 + 3y 2 subject to y 1 + 2y 2 y 1 + 3y 2 y 1 + 4y 2 y 1 + 5y 2 Now suppose that we permit variables in the primal problem to be either 0 or unconstrained in sign. Recall: reduced costs are the shadow prices of the 0 and 0 constraints.

7 PRIMAL PROBLEM: maximize subject to z = 3x 1 + 4x 2 +6x 3 + 8x 4 x 1 + x 2 + x 3 + x 4 = 1 2x 1 + 3x 2 +4x 3 + 5x 4 = 3 x 1 0 x 2 0 x 3 0x 4 uis c 1 = 3 – y 1 – 2y 2 Suppose we replace x1 0 by x1. Can the optimal objective value go down? Can it go up? Conclusion: c1 0, and thus y1 + 2y2 3. Prices To do with your partner: figure out the sign on the shadow price for the constraint x3 0. Also, what do we do with x4 uis?

8 Summary for forming the dual of a maximization problem PRIMAL Max Σj aijxj = bi Σj aijxj bi xj 0 xj u.i.s. DUAL Min yi u.i.s. yi 0 j yiaij cj j yiaij = cj

9 Complementary Slackness PRIMAL DUAL Σj aijxj bi yi 0 xj 0 Σj yiaij cj Comp. Slackness yi ( j aijxj - bi ) = 0 xj j yiaij - cj ) = 0

10 Determine the dual PRIMAL PROBLEM: maximize subject to z = 3x 1 + 4x 2 +6x 3 + 8x 4 x 1 + x 2 + x 3 + x 4 1 2x 1 + 3x 2 +4x 3 + 5x 4 = 3 x 1 0, x 2 0 x 3 u.i.s. Determine the dual of the above linear program. Then compare your result with that of your partner.

11 Duals of Minimization Problem The dual of a minimization problem is a maximization problem. The shadow prices for the dual linear program form the optimal solution for the primal problem. The dual of the dual is the primal.

12 2-person 0-sum game theory Person R chooses a row: either 1, 2, or 3 Person C chooses a column: either 1, 2, or 3 This matrix is the payoff matrix for player R. (And player C gets the negative.) e.g., R chooses row 3; C chooses column 1 R gets 1; C gets –1 (zero sum)

13 Some more examples of payoffs R chooses 2, C chooses 3 R gets 0; C gets 0 (zero sum) R chooses row 3; C chooses column 3 R gets -2; C gets +2 (zero sum)

14 Next: 2 volunteers Player R puts out 1, 2 or 3 fingers Player C simultaneously puts out 1, 2, or 3 fingers We will run the game for 5 trials. R tries to maximize his or her total C tries to minimize Rs total.

15 Next: Play the game with your partner (If you dont have one, then watch) Player R puts out 1, 2 or 3 fingers Player C simultaneously puts out 1, 2, or 3 fingers We will run the game for 5 trials. R tries to maximize his or her total C tries to minimize Rs total.

16 Who has the advantage: R or C? Suppose that R and C are both brilliant players and they play a VERY LONG TIME. We will find a lower and upper bound on the payoff to R using linear programming. Will Rs payoff be positive in the long run, or will it be negative, or will it converge to 0?

17 Computing a lower bound Suppose that player R must announce his or her strategy in advance of C making a choice. If R is forced to announce a row, then what row will R select? A strategy that consists of selecting the same row over and over again is a pure strategy. R can guarantee a payoff of at least –1.

18 Computing a lower bound on Rs payoff Suppose we permit R to choose a random strategy. Suppose R will flip a coin, and choose row 1 if Heads, and choose row 3 if tails. The column player makes the choice after hearing the strategy, but before seeing the flip of the coin.

19 What is players C best response? If C knows Rs random strategy, then C can determine the expected payoff for each column chosen Prob. Expected Payoff What would Cs best response be? So, with a random strategy R can get at least -.5

20 Suppose that R randomizes between row 1 and row 2. Prob. Expected Payoff What would Cs best response be? So, with a random strategy R can get at least 0.

21 What is Rs best random strategy? Prob. C will choose the column that is the minimum of A, B, and C. Expected Payoff A: -2 x1 + 2 x2 + x3 B: x1 - x2 C: 2 x1 - 2 x3

22 Finding the min of 2 numbers as an optimization problem. Let z = min (x, y) Then z is the optimum solution value to the following LP: maximize z subject to z x z y

23 Rs best strategy, as an LP Expected Payoff 2-person 0-sum game Maximize z (the payoff to x) A: z -2 x1 + 2 x2 + x3 C: z 2 x1 - 2 x3 B: z x1 - x2 x1 + x2 + x3 = 1 x1, x2, x3 0

24 The Row Players LP, in general Expected Payoff Maximize z (the payoff to x) Pj: z a1j x1 + a2j x2 +… + anjxn for all j x1 + x2 + … + xn = 1 xj 0 for all j

25 Here is the optimal random strategy for R. This strategy is a lower bound on what R can obtain if he or she takes into account what C is doing. Expected Payoff The optimal payoff to R is 1/9. So, with a random strategy R guarantees obtaining at least 1/9. Prob.

26 We can obtain a lower bound for C (and an upper bound for R) in the same manner. Exp. payoff If C chooses a random strategy, it will give an upper bound on what R can obtain. If C announced this random strategy, R would select 1 or 2. So, with this random strategy C guarantees that R obtains at most 1/3. Prob.

27 The best random strategy for C is to minimize the max expected payoff. The column player can set up a linear program to determine the best random strategy Note: C can use a random strategy to ensure that R receives at most 1/9 on average. Exp. payoff So, with this random strategy R gets only 1/9. Prob.

28 The best random strategy for C is to minimize the max expected payoff. So, the optimal average payoff to the game is 1/9, assuming that both players play optimally. Exp. payoff Prob. For 2-person 0-sum games, the maximum payoff that R can guarantee by choosing a random strategy is the minimum payoff to R that C can guarantee by choosing a random strategy.

29 2-person 0-sum games in general. Let x denote a random strategy for R, with value z(x) and let y denote a random strategy for C with value v(y). z(x) v(y) for all x, y The optimum x* can be obtained by solving an LP. So can the optimum y*. z(x*) = v(y*) The two linear programs are dual to each other.

30 More on 2-person 0-sum games In principle, R can do as well with a random fixed strategy as by carefully varying a strategy over time Duality for game theory was discovered by Von Neumann and Morgenstern (predates LP duality). The idea of randomizing strategy permeates strategic gaming.