Bilinear Games: Polynomial Time Algorithms for Rank Based Subclasses Ruta Mehta Indian Institute of Technology, Bombay Joint work with Jugal Garg and Albert.

Slides:

Advertisements

Similar presentations

Quantum Lower Bounds The Polynomial and Adversary Methods Scott Aaronson September 14, 2001 Prelim Exam Talk.

Advertisements

Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman TexPoint fonts used in EMF. Read the TexPoint manual.

1 A Graph-Theoretic Network Security Game M. Mavronicolas , V. Papadopoulou , A. Philippou  and P. Spirakis § University of Cyprus, Cyprus  University.

A Lemke-Type Algorithm for Market Equilibrium under Separable, Piecewise-Linear Concave Utilities Ruta Mehta Indian Institute of Technology – Bombay Joint.

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.

Totally Unimodular Matrices

Mixed Strategies CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte.

COMP 553: Algorithmic Game Theory Fall 2014 Yang Cai Lecture 21.

Computation of Nash Equilibrium Jugal Garg Georgios Piliouras.

6.896: Topics in Algorithmic Game Theory Lecture 12 Constantinos Daskalakis.

An Approximate Truthful Mechanism for Combinatorial Auctions An Internet Mathematics paper by Aaron Archer, Christos Papadimitriou, Kunal Talwar and Éva.

6.896: Topics in Algorithmic Game Theory Lecture 11 Constantinos Daskalakis.

Congestion Games with Player- Specific Payoff Functions Igal Milchtaich, Department of Mathematics, The Hebrew University of Jerusalem, 1993 Presentation.

Nash Equilibria In Graphical Games On Trees Edith Elkind Leslie Ann Goldberg Paul Goldberg.

Equilibrium Concepts in Two Player Games Kevin Byrnes Department of Applied Mathematics & Statistics.

by Vincent Conitzer of Duke

Christos alatzidis constantina galbogini.  The Complexity of Computing a Nash Equilibrium  Constantinos Daskalakis  Paul W. Goldberg  Christos H.

Jie Gao Joint work with Amitabh Basu*, Joseph Mitchell, Girishkumar Stony Brook Distributed Localization using Noisy Distance and Angle Information.

1 Computing Nash Equilibrium Presenter: Yishay Mansour.

Load Balancing, Multicast routing, Price of Anarchy and Strong Equilibrium Computational game theory Spring 2008 Michal Feldman.

Potential games, Congestion games Computational game theory Spring 2010 Adapting slides by Michal Feldman TexPoint fonts used in EMF. Read the TexPoint.

Pure Nash Equilibria: Complete Characterization of Hard and Easy Graphical Games Albert Xin Jiang U. of British Columbia MohammadAli Safari Sharif U. of.

Algorithms and Economics of Networks Abraham Flaxman and Vahab Mirrokni, Microsoft Research.

Congestion Games (Review and more definitions), Potential Games, Network Congestion games, Total Search Problems, PPAD, PLS completeness, easy congestion.

1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.

Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:

The Computational Complexity of Finding a Nash Equilibrium Edith Elkind, U. of Warwick.

Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:

Graphical Models for Game Theory by Michael Kearns, Michael L. Littman, Satinder Singh Presented by: Gedon Rosner.

Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman.

An Intro to Game Theory Avrim Blum 12/07/04.

Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer

Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.

Approximating Minimum Bounded Degree Spanning Tree (MBDST) Mohit Singh and Lap Chi Lau “Approximating Minimum Bounded DegreeApproximating Minimum Bounded.

An Algorithm for the Coalitional Manipulation Problem under Maximin Michael Zuckerman, Omer Lev and Jeffrey S. Rosenschein (Simulations by Amitai Levy)

The Power of the Defender M. Gelastou  M. Mavronicolas  V. Papadopoulou  A. Philippou  P. Spirakis §  University of Cyprus, Cyprus § University of.

Small clique detection and approximate Nash equilibria Danny Vilenchik UCLA Joint work with Lorenz Minder.

6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 11.

Batch Scheduling of Conflicting Jobs Hadas Shachnai The Technion Based on joint papers with L. Epstein, M. M. Halldórsson and A. Levin.

Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.

1 The Price of Defense M. Mavronicolas , V. Papadopoulou , L. Michael ¥, A. Philippou , P. Spirakis § University of Cyprus, Cyprus  University of Patras.

Ásbjörn H Kristbjörnsson1 The complexity of Finding Nash Equilibria Ásbjörn H Kristbjörnsson Algorithms, Logic and Complexity.

Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish users (i.e., players) strategically interact in using.

Algorithms for solving two-player normal form games

1. 2 Some details on the Simplex Method approach 2x2 games 2xn and mx2 games Recall: First try pure strategies. If there are no saddle points use mixed.

1. 2 You should know by now… u The security level of a strategy for a player is the minimum payoff regardless of what strategy his opponent uses. u A.

Vasilis Syrgkanis Cornell University

1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.

6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 8.

Parameterized Two-Player Nash Equilibrium Danny Hermelin, Chien-Chung Huang, Stefan Kratsch, and Magnus Wahlstrom..

Common Intersection of Half-Planes in R 2 2 PROBLEM (Common Intersection of half- planes in R 2 ) Given n half-planes H 1, H 2,..., H n in R 2 compute.

Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.

Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.

Lower Bounds on Extended Formulations Noah Fleming University of Toronto Supervised by Toniann Pitassi.

Comp/Math 553: Algorithmic Game Theory

Nash Equilibrium: P or NP?

Market Equilibrium Ruta Mehta.

Non-additive Security Games

Computability and Complexity

Structured Models for Multi-Agent Interactions

Network Formation Games

Computing Nash Equilibrium

AI and Games 唐平中清华大学 2/22/2019 Pingzhong Tang.

Enumerating All Nash Equilibria for Two-person Extensive Games

Lecture 20 Linear Program Duality

Network Formation Games

Collaboration in Repeated Games

2-Nash and Leontief Economy

Normal Form (Matrix) Games

Presentation transcript:

Bilinear Games: Polynomial Time Algorithms for Rank Based Subclasses Ruta Mehta Indian Institute of Technology, Bombay Joint work with Jugal Garg and Albert X. Jiang

A Game: Rock-Paper-Scissor

Rock-Paper-Scissor: A Play Winner $1$1

Rock-Paper-Scissor: A Play Winner $1$1

Rock-Paper-Scissor: A Play Winner $1$1

0,0-1,11,-1 0,0-1,1 1,-10,0 Rock-Paper-Scissor Payoffs

RPC R01 P10 C 10 Bimatrix Game Steady State: No player gains by unilateral deviation RPC R01 P 01 C1 0 S 1 = { R, P, C } S 2 = { R, P, C } AB

RPC R01 P10 C 10 Bimatrix Game No Steady State RPC R01 P 01 C1 0 S 1 = { R, P, C } S 2 = { R, P, C } AB

R 1/3 P 1/3 C 1/3 R01 P10 C 10 Mixed Play Steady State RPC R 1/301 P 1/301 C 1/310 S 1 = { R, P, C } A B ∆ 1 ={r 1, p 1, c 1 ≥0; r 1 +p 1 +c 1 =1} S 1 = { R, P, C } ∆ 2 ={r 2, p 2, c 2 ≥0; r 2 +p 2 +c 2 =1}

John Nash (1951)  Finite Game: Finitely many players, each with finitely many strategies.  Nash: Every finite game has a steady state in mixed strategy. Hence forth called Nash equilibrium (NE)  Proved using Kakutani fixed point theorem: Highly non-constructive.

Nash Equilibrium Computation  Papadimitriou (JCSS’94) : PPAD-class  Problems where existence is guaranteed like fixed point, Sperner’s Lemma, Nash equilibrium.  Chen and Deng (FOCS’06) : It is PPAD-hard.  CDT (FOCS’06) : Even approximation is PPAD- hard.

Rank and Computation  Kannan and Theobald (SODA’07) :  Define rank of (A,B) as rank(A+B).  FPTAS for fixed rank games.  Polynomial time algorithms for exact Nash.  Dantzig (1963) : Zero-sum (rank-0) is equiv. to LP.  AGMS (STOC’11) : Rank-1 games.

Bilinear Games Bimatrix Game with polyhedral strategy sets.  Two players: 1 and 2  Polyhedral strategy sets:  X={x | Ex = e; x ≥ 0}, Y={y | Fy=f; y ≥ 0}  Payoff matrices: A, B  Bilinear Payoff: (x, y) fetches x T Ay to player 1, and x T By to player 2. Motivation: Koller et al. (STOC’94) for two-player extensive form game with perfect recall.

Nash Equilibrium in Bilinear  NE: No player gains by unilateral deviation.  Existence: Corollary of Glicksberg’s result.  Symmetric Game: B=A T and Y=X.  (x, y) is a symmetric profile if y=x.  Existence of symmetric NE: An adaptation of Nash’s proof for symmetric bimatrix games.

Bilinear Contains:  Bimatrix, Polymatrix, Bayesian, etc.  Bimatrix: X = ∆ 1, Y = ∆ 2  Polymatrix:  N players. Each pair plays a bimatrix game.  Player i: S i finite strategy set, ∆ i Mixed strategy set.  Goal of i: Choose x i from ∆ i to maximize total payoff. A ij i j

Polymatrix to Bilinear  M= |S 1 |+ … + |S n |. X = {(x 1,…,x n ) | x i in ∆ i }, Y=X.  A, B=A T Symmetric NE of (A,B) maps to a NE of the polymatrix game 0 0 A ij 0 0 i j A =

Best Response (Koller et al.)  Fix a strategy y of player 2.  Player 1 solves max: x T (Ay) min: e T p Ex = e p T E ≥ (Ay) T x ≥ 0 At optimal: p s.t. A i y ≤ p T E i & x i > 0 => A i y = p T E i  Given x X, for player 2 we get At optimal: q s.t. B j x ≤ q T F j & y j > 0 => q T F j = B j x

Best Response Polytopes (BRPs)  (x,y) is a NE iff p: Ay ≤ E T p; x i > 0 => A i y = p T E i q: x T B ≤ q T F; y j > 0 => q T F j = B j x x T (Ay - E T p) ≤ 0 and (x T B - q T F)y ≤ 0 x T (A+B)y – e T p – f T y ≤ 0

Nash Equilibrium in BRPs NE iff x T (Ay - E T p)=0 and (x T B - q T F)y=0 x T (A+B)y – e T p – f T y=0 Assumption: P and Q are non-degnerate. (u, v) of P x Q gives a NE => (u, v) is a vertex.

QP Formulation max: x T (A+B)y – e T p – f T y s.t.(y, p) P (x, q) Q  Optimal value 0.  Only vertex solutions.

Our Results  Rank-1 games: rank(A+B)=1  Extend Adsul et al. algorithm for exact NE.  Fixed rank games: rank(A+B)=k  Extend FPTAS of Kannan et al.  Rank of A or B is constant  Enumerate all NE in polynomial time.

Rank-1 Case  Zero-sum ~ rank(A+B)=0: LP formulation (Charnes’53)  rank(A+B)=1 then A+B = a. b T  The QP formulation: max: (x T a)(b T y) – e T p – f T y s.t.(y, p) P (x, q) Q

Rank-1 Case  Replace (x T a) by z. Recall B = -A + a. b T x T (A+B)y – e T p – f T y=0 z(b T y) – e T p – f T y=0  N = Points of P x Q’ with z(b T y) – e T p – f T y=0  Forms paths and cycles, since z gives one degree of freedom. NE of (A,B): Points in intersection of N and z – x T a =0.

Parameterized LP LP(z) = max: z(b T y) – e T p – f T y s.t.(y, p) P (x, z, q) Q’  Given any c, Optimal value of LP(c) is 0.  OPT(c) lies on N, and  Let N (c)={Points of N with z=c}, then OPT(c)= N (c).  N is a single path on which z is monotonic.

Rank-1: The Algorithm  NE: Intersection of N and H: z – x T a =0. . c 1 =a min, c 2 =a max H N H–H– H+H+ NE N (c 1 ) N (c 2 )

Rank-1: Binary Search Algorithm  NE of (A,B): Points in intersection of N and H.  c=c 1 +c 2 /2. H NE N (c 1 ) N (c 2 ) N N (c) H+H+ H–H–

Rank-1: Binary Search Algorithm  NE of (A,B): Points in intersection of N and H.  c=c 1 +c 2 /2. If N (c) in H –,then c 1 =c else c 2 =c. H NE N (c 2 ) N N (c 1 ) H+H+ H–H–

Analysis  Terminates because,  z is monotonic on N.  Increase in z on each edge is lower bounded by 1/d where d is polynomial sized in the input.  Time complexity:  Solve LP(c) to get N (c) in each pivot.  log(d) * log(a max – a min ) pivots.

Conclusions  Bilinear games:  Bimatrix with polytopal strategy sets.  Fairly general. Contains polymatrix, bayesian, etc.  Polynomial time algorithm for rank based subclasses.  Open problems:  Designing a Lemke-Howson type algorithm.  Degree, index, stability concepts.  Computation of approximate equilibrium.

Thank You