A study of Correlated Equilibrium Polytope By: Jason Sorensen.

Slides:

Advertisements

Similar presentations

Thursday, March 7 Duality 2 – The dual problem, in general – illustrating duality with 2-person 0-sum game theory Handouts: Lecture Notes.

Advertisements

Maximal Independent Subsets of Linear Spaces. Whats a linear space? Given a set of points V a set of lines where a line is a k-set of points each pair.

Mixed Strategies.

Error-Correcting codes

2 x0 0 12/13/2014 Know Your Facts!. 2 x1 2 12/13/2014 Know Your Facts!

2 x /10/2015 Know Your Facts!. 8 x /10/2015 Know Your Facts!

C&O 355 Mathematical Programming Fall 2010 Lecture 6 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A.

Complexity Results about Nash Equilibria Vincent Conitzer, Tuomas Sandholm International Joint Conferences on Artificial Intelligence 2003 (IJCAI’03) Presented.

Incremental Linear Programming Linear programming involves finding a solution to the constraints, one that maximizes the given linear function of variables.

5 x4. 10 x2 9 x3 10 x9 10 x4 10 x8 9 x2 9 x4.

Simplex (quick recap). Replace all the inequality constraints by equalities, using slack variables.

Constraint Networks (cont.) Emma Rollón Postdoctoral researcher at UCI April 1st, 2009.

Lecture #3; Based on slides by Yinyu Ye

Price Of Anarchy: Routing

Mixed Strategies CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte.

Two-Player Zero-Sum Games

Calibrated Learning and Correlated Equilibrium By: Dean Foster and Rakesh Vohra Presented by: Jason Sorensen.

1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang.

MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)

Course: Applications of Information Theory to Computer Science CSG195, Fall 2008 CCIS Department, Northeastern University Dimitrios Kanoulas.

General-sum games You could still play a minimax strategy in general- sum games –I.e., pretend that the opponent is only trying to hurt you But this is.

Part 3: The Minimax Theorem

An Introduction to Game Theory Part I: Strategic Games

Equilibrium Concepts in Two Player Games Kevin Byrnes Department of Applied Mathematics & Statistics.

Separating Hyperplanes

by Vincent Conitzer of Duke

CHAPTER 5: CONVEX POLYTOPES Anastasiya Yeremenko 1.

An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.

1 Computing Nash Equilibrium Presenter: Yishay Mansour.

An Introduction to Game Theory Part III: Strictly Competitive Games Bernhard Nebel.

Correlated-Q Learning and Cyclic Equilibria in Markov games Haoqi Zhang.

Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman.

Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer

Mechanisms for Making Crowds Truthful Andrew Mao, Sergiy Nesterko.

Derivative Action Learning in Games Review of: J. Shamma and G. Arslan, “Dynamic Fictitious Play, Dynamic Gradient Play, and Distributed Convergence to.

1 Economics & Evolution Number 3. 2 The replicator dynamics (in general)

Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.

Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Lecture 2: two-person non.

Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia.

Topic 3 Games in Extensive Form 1. A. Perfect Information Games in Extensive Form. 1 RaiseFold Raise (0,0) (-1,1) Raise (1,-1) (-1,1)(2,-2) 2.

Ásbjörn H Kristbjörnsson1 The complexity of Finding Nash Equilibria Ásbjörn H Kristbjörnsson Algorithms, Logic and Complexity.

Part 3 Linear Programming

1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.

Algorithms for solving two-player normal form games

Linear Programming Chap 2. The Geometry of LP  In the text, polyhedron is defined as P = { x  R n : Ax  b }. So some of our earlier results should.

TU/e Algorithms (2IL15) – Lecture 12 1 Linear Programming.

1 Chapter 4 Geometry of Linear Programming  There are strong relationships between the geometrical and algebraic features of LP problems  Convenient.

Discrete Optimization

A useful reduction (SAT -> game)

Game theory basics A Game describes situations of strategic interaction, where the payoff for one agent depends on its own actions as well as on the actions.

Nash Equilibrium: P or NP?

Linear Programming Many problems take the form of maximizing or minimizing an objective, given limited resources and competing constraints. specify the.

2.6 Solving Systems of Linear Inequalities

The Duality Theorem Primal P: Maximize

A useful reduction (SAT -> game)

Proving that a Valid Inequality is Facet-defining

Linear Programming.

Polyhedron Here, we derive a representation of polyhedron and see the properties of the generators. We also see how to identify the generators. The results.

Polyhedron Here, we derive a representation of polyhedron and see the properties of the generators. We also see how to identify the generators. The results.

Presented By Aaron Roth

I.4 Polyhedral Theory (NW)

Quantum Foundations Lecture 3

I.4 Polyhedral Theory.

Proving that a Valid Inequality is Facet-defining

9.3 Linear programming and 2 x 2 games : A geometric approach

Normal Form (Matrix) Games

Linear Constrained Optimization

Presentation transcript:

A study of Correlated Equilibrium Polytope By: Jason Sorensen

The set of Correlated Equilibrium is a polytope Polytope has two equivalent definitions: 1.)The Convex hull of a finite set of points in R^n 2.)The bounded intersection of a finite set of closed half-spaces The correlated equilibrium constraints are Nash Equilibrium are Correlated -> non-empty

Linear programming Can find maximal correlated equilibrium by linear programming Use constraints given, player i maximizing hyperplane is Easily solved by any LP program for reasonable sized games

The Shapley game 12 (Non-trivial) LP constraints 6 of them reduce to x1 = x2 = x5 = x6 = x7 = x9 Reduces original 8 dimensional polytope to 9 dimensions Unique Nash Equilibrium at xi = 1/9 This corresponds to LP minimum for all utility hyperplanes ,00,10,0 2 1,00,1 3 0,01,0

The Shapley polytope

Chicken or Dare? Game matrix is: 3 Nash Equilibria at x3 = 1, x2 = 1, and one mixed Each Nash Equilibirium is an LP minimum for a utility hyperplane Can reduce polytope dimension to 3 by utilizing equality constraint DC D0,07,2 C2,76,6

The Chicken Polytope

Theorem time! All Nash Equilibrium lie on the boundary of the Correlated Equilibrium Proof on the board! … But may not by too great if polytope is not full dimensional (boundary != relative boundary)

Are CE always better than NE? In the two cases we studied, all utility hyperplanes were minimized at an NE Is this generally the case? NO! In fact, there is no way to find NE by linear systems in general

Correlated Equilibrium get Bested Game of “Poker” (?!) Unique Nash Equilibrium - with irrational coordinates NE does not occur at a vertex (all vertices are rational) Value of NE for player one:.890 Value of worst case CE for player one:.5833 Value of best case CE for player one: The CE Polytope is full 7 dimensional (not graphable) 1LR T(3,0,2)(0,2,0) B(0,1,0)(1,0,0) 2LR T (0,1,0) B(0,3,0)(2,0,3)

Putting it all together We can guarantee convergence to CE by natural (no-regret) learning processes But CE may not always be better than Nash Equilibrium Which CE do these “natural learning processes” converge to? How long does convergence take? Will investigate further in next 2 weeks

A Conundrum Using learning processes to find optimal responses may result in being “bullied” by opponent (it’s always the smart ones who get picked on) In chicken or dare, if the opponent is rational and knows you are learning, there is no reason not to always play dare How do we decide whether to learn or bully for optimal payoffs? Each “playing strategy” (learning or not) combated against each other in a game results in a certain payoff for each player after convergence Model this situation as a new game, where each “move” is a learning strategy and learn the optimal strategy Have we really learned anything?

Open Problems Compare learning strategies to figure out optimal method Figure out properties of general game polytopes (the number of faces) In which situations is the polytope full dimensional? In which situations are the NE the LP minimizing vertices for all utility hyperplanes?