Algorithms for solving two-player normal form games

Slides:



Advertisements
Similar presentations
Preprocessing Techniques for Computing Nash Equilibria Vincent Conitzer Duke University Based on: Conitzer and Sandholm. A Generalized Strategy Eliminability.
Advertisements

Complexity Results about Nash Equilibria Vincent Conitzer, Tuomas Sandholm International Joint Conferences on Artificial Intelligence 2003 (IJCAI’03) Presented.
A study of Correlated Equilibrium Polytope By: Jason Sorensen.
Linear programming: lp_solve, max flow, dual CSC 282 Fall 2013.
Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.
Totally Unimodular Matrices
Bilinear Games: Polynomial Time Algorithms for Rank Based Subclasses Ruta Mehta Indian Institute of Technology, Bombay Joint work with Jugal Garg and Albert.
C&O 355 Mathematical Programming Fall 2010 Lecture 12 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A.
COMP 553: Algorithmic Game Theory Fall 2014 Yang Cai Lecture 21.
Computation of Nash Equilibrium Jugal Garg Georgios Piliouras.
6.896: Topics in Algorithmic Game Theory Lecture 11 Constantinos Daskalakis.
Introduction to Game theory Presented by: George Fortetsanakis.
Game Theoretical Insights in Strategic Patrolling: Model and Analysis Nicola Gatti – DEI, Politecnico di Milano, Piazza Leonardo.
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Two-Player Zero-Sum Games
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Nash Equilibria In Graphical Games On Trees Edith Elkind Leslie Ann Goldberg Paul Goldberg.
Study Group Randomized Algorithms 21 st June 03. Topics Covered Game Tree Evaluation –its expected run time is better than the worst- case complexity.
Course: Applications of Information Theory to Computer Science CSG195, Fall 2008 CCIS Department, Northeastern University Dimitrios Kanoulas.
Towards a Constructive Theory of Networked Interactions Constantinos Daskalakis CSAIL, MIT Based on joint work with Christos H. Papadimitriou.
Ranking Games that have Competitiveness-based Strategies Leslie Goldberg, Paul Goldberg, Piotr Krysta and Carmine Ventre University of Liverpool.
Network Theory and Dynamic Systems Game Theory: Mixed Strategies
Algorithms for solving two- player normal form games Tuomas Sandholm Carnegie Mellon University Computer Science Department.
General-sum games You could still play a minimax strategy in general- sum games –I.e., pretend that the opponent is only trying to hurt you But this is.
Equilibrium Concepts in Two Player Games Kevin Byrnes Department of Applied Mathematics & Statistics.
Vincent Conitzer CPS Normal-form games Vincent Conitzer
by Vincent Conitzer of Duke
Eponine Lupo.  Game Theory is a mathematical theory that deals with models of conflict and cooperation.  It is a precise and logical description of.
Christos alatzidis constantina galbogini.  The Complexity of Computing a Nash Equilibrium  Constantinos Daskalakis  Paul W. Goldberg  Christos H.
An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Game Theory and Applications following H. Varian Chapters 28 & 29.
INSTITUTO DE SISTEMAS E ROBÓTICA Minimax Value Iteration Applied to Robotic Soccer Gonçalo Neto Institute for Systems and Robotics Instituto Superior Técnico.
1 Computing Nash Equilibrium Presenter: Yishay Mansour.
An Introduction to Game Theory Part III: Strictly Competitive Games Bernhard Nebel.
Stackelberg Scheduling Strategies By Tim Roughgarden Presented by Alex Kogan.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Finite Mathematics & Its Applications, 10/e by Goldstein/Schneider/SiegelCopyright © 2010 Pearson Education, Inc. 1 of 68 Chapter 9 The Theory of Games.
1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.
Game Theory.
Computing equilibria in extensive form games Andrew Gilpin Advanced AI – April 7, 2005.
An Intro to Game Theory Avrim Blum 12/07/04.
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”
Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.
Nash Equilibria In Graphical Games On Trees Revisited Edith Elkind Leslie Ann Goldberg Paul Goldberg (University of Warwick) (To appear in ACM EC’06)
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 11.
Games with Imperfect Information Bayesian Games. Complete versus Incomplete Information So far we have assumed that players hold the correct belief about.
Game theory & Linear Programming Steve Gu Mar 28, 2008.
Dominance Since Player I is maximizing her security level, she prefers “large” payoffs. If one row is smaller (element- wise) than another,
The Science of Networks 6.1 Today’s topics Game Theory Normal-form games Dominating strategies Nash equilibria Acknowledgements Vincent Conitzer, Michael.
Ásbjörn H Kristbjörnsson1 The complexity of Finding Nash Equilibria Ásbjörn H Kristbjörnsson Algorithms, Logic and Complexity.
Part 3 Linear Programming
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Shall we play a game? Game Theory and Computer Science Game Theory /06/05 - Zero-sum games - General-sum games.
1. 2 Some details on the Simplex Method approach 2x2 games 2xn and mx2 games Recall: First try pure strategies. If there are no saddle points use mixed.
1. 2 You should know by now… u The security level of a strategy for a player is the minimum payoff regardless of what strategy his opponent uses. u A.
1 a1a1 A1A1 a2a2 a3a3 A2A Mixed Strategies When there is no saddle point: We’ll think of playing the game repeatedly. We continue to assume that.
1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.
GAME THEORY Day 5. Minimax and Maximin Step 1. Write down the minimum entry in each row. Which one is the largest? Maximin Step 2. Write down the maximum.
Parameterized Two-Player Nash Equilibrium Danny Hermelin, Chien-Chung Huang, Stefan Kratsch, and Magnus Wahlstrom..
Finding equilibria in large sequential games of imperfect information CMU Theory Lunch – November 9, 2005 (joint work with Tuomas Sandholm)
2.5 The Fundamental Theorem of Game Theory For any 2-person zero-sum game there exists a pair (x*,y*) in S  T such that min {x*V. j : j=1,...,n} =
Nash Equilibrium: P or NP?
Tools for Decision Analysis: Analysis of Risky Decisions
Information, Control and Games
Enumerating All Nash Equilibria for Two-person Extensive Games
Lecture 20 Linear Program Duality
Normal Form (Matrix) Games
Presentation transcript:

Algorithms for solving two-player normal form games

Recall: Nash equilibrium Let A and B be |M| x |N| matrices. Mixed strategies: Probability distributions over M and N If player 1 plays x, and player 2 plays y, the payoffs are xTAy and xTBy Given y, player 1’s best response maximizes xTAy Given x, player 2’s best response maximizes xTBy (x,y) is a Nash equilibrium if x and y are best responses to each other

Finding Nash equilibria Zero-sum games Solvable in poly-time using linear programming General-sum games Unknown whether solvable in poly-time “Together with factoring, the complexity of finding a Nash equilibrium is in my opinion the most important concrete open question on the boundary of P today.” -- Papadimitriou Several algorithms with exponential worst-case running time Lemke-Howson [1964] – linear complementarity problem Porter-Nudelman-Shoham [AAAI-04] – support enumeration Sandholm-Gilpin-Conitzer [2005] - MIP Nash = mixed integer programming approach

Zero-sum games Among all best responses, there is always at least one pure strategy Thus, player 1’s optimization problem is: This is equivalent to: By LP duality, player 2’s optimal strategy is given by the dual variables

General-sum games: Lemke-Howson algorithm = pivoting algorithm similar to simplex algorithm We say each mixed strategy is “labeled” with the player’s unplayed pure strategies and the pure best responses of the other player A Nash equilibrium is a completely labeled pair (i.e., the union of their labels is the set of pure strategies)

Lemke-Howson Illustration Example of label definitions

Lemke-Howson Illustration Equilibrium 1

Lemke-Howson Illustration Equilibrium 2

Lemke-Howson Illustration Equilibrium 3

Lemke-Howson Illustration Run of the algorithm

Lemke-Howson Illustration

Lemke-Howson Illustration

Lemke-Howson Illustration

Lemke-Howson Illustration