Nash Equilibrium: P or NP?

Slides:

Advertisements

Similar presentations

A study of Correlated Equilibrium Polytope By: Jason Sorensen.

Advertisements

Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.

Bilinear Games: Polynomial Time Algorithms for Rank Based Subclasses Ruta Mehta Indian Institute of Technology, Bombay Joint work with Jugal Garg and Albert.

Mixed Strategies CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte.

COMP 553: Algorithmic Game Theory Fall 2014 Yang Cai Lecture 21.

Computation of Nash Equilibrium Jugal Garg Georgios Piliouras.

6.896: Topics in Algorithmic Game Theory Lecture 11 Constantinos Daskalakis.

Congestion Games with Player- Specific Payoff Functions Igal Milchtaich, Department of Mathematics, The Hebrew University of Jerusalem, 1993 Presentation.

MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)

Seminar In Game Theory Algorithms, TAU, Agenda  Introduction  Computational Complexity  Incentive Compatible Mechanism  LP Relaxation & Walrasian.

Equilibrium Concepts in Two Player Games Kevin Byrnes Department of Applied Mathematics & Statistics.

by Vincent Conitzer of Duke

Complexity Results about Nash Equilibria

An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.

Duality Lecture 10: Feb 9. Min-Max theorems In bipartite graph, Maximum matching = Minimum Vertex Cover In every graph, Maximum Flow = Minimum Cut Both.

1 Computing Nash Equilibrium Presenter: Yishay Mansour.

An Introduction to Game Theory Part III: Strictly Competitive Games Bernhard Nebel.

Finite Mathematics & Its Applications, 10/e by Goldstein/Schneider/SiegelCopyright © 2010 Pearson Education, Inc. 1 of 68 Chapter 9 The Theory of Games.

 Linear Programming and Smoothed Complexity Richard Kelley.

1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.

Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer

Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.

Presenter: Jen Hua Chi Adviser: Yeong Sung Lin Network Games with Many Attackers and Defenders.

6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 11.

Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Lecture 2: two-person non.

Regret Minimizing Equilibria of Games with Strict Type Uncertainty Stony Brook Conference on Game Theory Nathanaël Hyafil and Craig Boutilier Department.

Algorithmic Game Theory and Internet Computing Vijay V. Vazirani Georgia Tech Primal-Dual Algorithms for Rational Convex Programs II: Dealing with Infeasibility.

1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.

Algorithms for solving two-player normal form games

Approximation Algorithms Department of Mathematics and Computer Science Drexel University.

1  Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.

1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.

TU/e Algorithms (2IL15) – Lecture 12 1 Linear Programming.

Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.

Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.

Constraint Satisfaction Problems and Games

A useful reduction (SAT -> game)

Linear Programming Many problems take the form of maximizing or minimizing an objective, given limited resources and competing constraints. specify the.

The Duality Theorem Primal P: Maximize

Game Theory Just last week:

Tools for Decision Analysis: Analysis of Risky Decisions

Market Equilibrium Ruta Mehta.

Non-additive Security Games

A useful reduction (SAT -> game)

Computing equilibria in extensive form games

Algorithmic Game Theory and Internet Computing

Communication Complexity as a Lower Bound for Learning in Games

Vincent Conitzer CPS Repeated games Vincent Conitzer

Historical Note von Neumann’s original proof (1928) used Brouwer’s fixed point theorem. Together with Danzig in 1947 they realized the above connection.

Linear Programming.

Game Theory in Wireless and Communication Networks: Theory, Models, and Applications Lecture 2 Bayesian Games Zhu Han, Dusit Niyato, Walid Saad, Tamer.

Multiagent Systems Extensive Form Games © Manfred Huber 2018.

Vincent Conitzer Normal-form games Vincent Conitzer

Network Formation Games

Chapter 11 Limitations of Algorithm Power

Computing Nash Equilibrium

Multiagent Systems Repeated Games © Manfred Huber 2018.

Enumerating All Nash Equilibria for Two-person Extensive Games

Vincent Conitzer Repeated games Vincent Conitzer

Network Formation Games

15th Scandinavian Workshop on Algorithm Theory

Collaboration in Repeated Games

2-Nash and Leontief Economy

Normal Form (Matrix) Games

A Technique for Reducing Normal Form Games to Compute a Nash Equilibrium Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University, Computer Science.

Vincent Conitzer CPS Repeated games Vincent Conitzer

Presentation transcript:

Nash Equilibrium: P or NP? S Kameshwaran Oct 25, 2002

Complexity prelims P NP Set of problems that have algorithms which have polynomial running time (worst case) in the input length NP Given a solution, checking its correctness is possible in polynomial time But finding a solution is not assured in polynomial time

Complexity prelims Polynomial reductions NP-Complete: Two problems X and Y X p Y: If there is a polynomial algorithm that takes an instance X` of X and creates an instance Y` of Y, such that they are equivalent in terms of the solutions NP-Complete: Most difficult problems in NP If any of these problems can be solved in polynomial time then all problems in NP can be solved in polynomial time

Complexity prelims Cook’s Theorem: If Y p X, where Y  NP-Complete SAT is NP-Complete SAT  NP For all X  NP, X p SAT If Y p X, where Y  NP-Complete X is NP-Hard X  NP, then X  NP-Complete

Nash Equilibrium A strategy combination is in NE if each player chooses a strategy that is the best response to the strategies chosen by others Issues: Existence of NE for a game Uniqueness of NE Pareto-optimality

Nash Equilibrium: Dining in Bangalore (TGIF, TGIF) is good for both the players (Pizza Hut, Pizza Hut) is not a admissible NE Man Woman Pizza Hut TGIF 10, 10 0, 0 20, 20

Nash Equilibrium: Battle of the Sexes Both NE are Pareto-optimal and both are admissible Man Woman Prize Fight Ballet 2, 1 0, 0 1, 2

Nash Equilibrium N players Action set of player i: Si={si1, …, sim(i)} Utility to player i: ui: S  R Strategy of player i: pi={pi1, …, pim(i)} pij >= 0, jpij = 1 Utility of using strategy p: ui(p) = s ui(s) j pj(s)

ui(pi, p*-i) <= ui(p*i, p*-i) Nash Equilibrium N players Action set of player i: Si={si1, …, sim(i)} S = i Si Utility to player i: ui: S  R Strategy of player i: pi={pi1, …, pim(i)} pij >= 0, jpij = 1 Utility of using strategy p: ui(p) = s ui(s) j pj(s) Nash Equilibrium: p* is NE if for all i, and for all pi ui(pi, p*-i) <= ui(p*i, p*-i)

Some more notations xij(p) = ui(sij,p-i) zij(p) = xij(p) – ui(p) Other players use mixed strategy profile p-i and i use pure strategy sij zij(p) = xij(p) – ui(p) p* is NE iff z(p*) <= 0 gij(p) = max[zij(p), 0] p* is NE iff gij(p*) = 0

NE as a fixed point of a function Define y: P  P p* is NE iff it is a fixed point of y y is a continuous function of a compact set P to itself, so a fixed point exists (Brouwer’s FPT) Nash’s proof of existence of NE for N-person games

NE as a fixed point of a correspondence Best response correspondence: BR: P  P Point to set function BR(p) = arg maxq[i ui(qi, p-i)] BR(p) = Set of best response strategies to p p* is a NE iff it is a fixed point of best response correspondence, p*BR(p*) BR is non-empty, closed and convex valued, hence by Kakutani’s FPT, fixed point exists

NE as a solution to complementarity problem Find a pair of vectors x, y which are complementary (orthogonal) to each other Used to prove complementary slackness and optimality in mathematical programming p* is NE iff p* and z(p*) are orthogonal Two-person games: Linear complementary problems

NE as a minimum of a function on a polytope Define v: P  R v(p) = i j [gij(p)]2 v is a continuous, differentiable function on polytope P p* is NE iff it is global minimum of v, v(p*) = 0 Other formulations: Stationary point, semi-algebraic set

Two person zero sum games 2 Players For every strategy x of player 1 and strategy y of player 2, payoff to 1+ payoff to 2=0 What player one loses is gained by the other Player 1 wants to minimize his losses Determine the optimal mixed strategy that minimizes his loss Player 2 wants to maximize his profits Determine the optimal mixed strategy that maximizes his profit

Two person zero sum games Minimax theorem: Expected minimum of the maximum loss = Expected maximum of the minimum profit The above can be converted into two linear programs which are dual to each other LP  P Simplex algorithm can solve it efficiently though it is not bounded in polynomial time

Two person general sum games Two person non-zero sum games can be formulated as linear complementary problems (LCP) LCP: Given a matrix M and a vector q Find w and z, such that w – Mz = q w >= 0, z >= 0, and wizi = 0 for all i Solution technique: Lemke-Howson Algorithm

Two person general sum games LCP Can be characterized as LPs (Mangasarian, 1978) The solution to LCP can be obtained by optimizing a suitable LP Finding the suitable objective function is not easy (depends on matrix M) Possible Approach: Characterize M in terms of two person games (M consists of payoff matrices of both players) Infer the complexity of games using M

Two person general sum games Lemke-Howson Algorithm Finds a sample equilibrium Incomplete for finding all equilibria Computational complexity: still unknown Exponential lower bound is shown (Murty, 1978) Can be exponential even on a zero-sum game Approximation theory cannot be used as there is no objective function is used and one cannot know how close is the solution to the optimal

Two person general sum games Reduction of a SAT to symmetric two-person game (Conitzer, Sandholm, 2002) Following are NP-hard (even for symmetric case) Deciding whether more than one NE exists Deciding whether a given strategy is played in NE Deciding whether a given strategy is never played in NE Deciding if a Pareto-optimal NE exists

N person games NE as a min of a function v(p) = i j [gij(p)]2 Constraints: pij >= 0, jpij = 1 Global minima correspond to NE Local minima may exist Algorithms: Continuous differentiable function, so any global optimization technique can be used Penalty function methods Convergence of these algorithms are very slow

N person games Alternate formulation: Non-linear complementarity problem Sequence of approximations by LCP Similar to Newton’s method Scarf’s algorithms Worst case complexity is exponential in N

Bayesian-Nash Equilibrium Games of incomplete information: A player may not the utility of other players Most real world applications induce such games: Auctions, markets, bargaining, etc Modifications: Ti: Set of types a player can belong to Eg: In bargaining, the seller assumes that buyer belongs to type [10,25], which means that buyer may value the good anywhere between 10 and 25 Let q be the joint probability distribution on T = i Ti q and T are known to all players Each player knows his own type ti  Ti

Bayesian Nash Equilibrium N players Action set of player i: Ai={ai1, …, aim(i)} A = i Ai Utility to player i: ui: A  T  R (utility depends on the type of the player) Strategy of player i, i : Ti  Ai If either Ti or Ai is infinite then the strategy space is infinite Strategy is now a function Utility of using strategy  : ui( , ti) = Expected utility over all the possible types for other players given the type of i is ti

Bayesian Nash Equilibrium Complexity for 2 player case Reduction from Set cover (Conitzer, Sandholm, 2002) Deciding whether a BNE exists is NP-complete If randomization is allowed, then BNE always exists for finite games Infinite case: Strategy space is infinite Finding the utility of a player amounts to optimizing over a function space Function spaces are generally infinite dimensional Assuring the existence of a solution is not possible

Bayesian Nash Equilibrium Games arising out of market design Infinite Games Given the action combination from players, utility to each player is calculated by some optimization problem some resource allocation algorithm which moves the goods from sellers to buyers to maximize some criteria Finding the expected utility: What is the complexity of finding the utility of a player who uses a strategy  ? What is the complexity of deciding the existence of BNE in infinite games?

References Complementarity and Fixed Point Problems, Ed: M. L. Balinski and R. W. Cottle, North-Holland Publishing Company, 1978 Computational complexity of complementary pivot methods, K. G. Murty Characterization of LCPs as LPs, O. L. Mangasarian Complexity results about NE, V. Conitzer and T. Sandholm, 2002 Computation of equilibria in finite games, R. D. McKelvey and A. McLennan, 1996

Combinatorial Markets Next Week.. Combinatorial Markets Prof. Y. Narahari