6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 13.

Slides:



Advertisements
Similar presentations
Approximate Nash Equilibria in interesting games Constantinos Daskalakis, U.C. Berkeley.
Advertisements

On allocations that maximize fairness Uriel Feige Microsoft Research and Weizmann Institute.
1 A Graph-Theoretic Network Security Game M. Mavronicolas , V. Papadopoulou , A. Philippou  and P. Spirakis § University of Cyprus, Cyprus  University.
Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
Price Of Anarchy: Routing
Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.
Continuation Methods for Structured Games Ben Blum Christian Shelton Daphne Koller Stanford University.
Totally Unimodular Matrices
Bilinear Games: Polynomial Time Algorithms for Rank Based Subclasses Ruta Mehta Indian Institute of Technology, Bombay Joint work with Jugal Garg and Albert.
6.896: Topics in Algorithmic Game Theory Lecture 8 Constantinos Daskalakis.
COMP 553: Algorithmic Game Theory Fall 2014 Yang Cai Lecture 21.
Computation of Nash Equilibrium Jugal Garg Georgios Piliouras.
6.896: Topics in Algorithmic Game Theory Lecture 12 Constantinos Daskalakis.
6.896: Topics in Algorithmic Game Theory Lecture 11 Constantinos Daskalakis.
Two-Player Zero-Sum Games
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Nash Equilibria In Graphical Games On Trees Edith Elkind Leslie Ann Goldberg Paul Goldberg.
Ranking Games that have Competitiveness-based Strategies Leslie Goldberg, Paul Goldberg, Piotr Krysta and Carmine Ventre University of Liverpool.
Towards a Constructive Theory of Networked Interactions Constantinos Daskalakis CSAIL, MIT Based on joint work with Christos H. Papadimitriou.
Ranking Games that have Competitiveness-based Strategies Leslie Goldberg, Paul Goldberg, Piotr Krysta and Carmine Ventre University of Liverpool.
Seminar In Game Theory Algorithms, TAU, Agenda  Introduction  Computational Complexity  Incentive Compatible Mechanism  LP Relaxation & Walrasian.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Jensen’s Inequality (Special Case) EM Theorem.
Christos alatzidis constantina galbogini.  The Complexity of Computing a Nash Equilibrium  Constantinos Daskalakis  Paul W. Goldberg  Christos H.
6.896: Topics in Algorithmic Game Theory Lecture 15 Constantinos Daskalakis.
An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.
Overlapping Coalition Formation: Charting the Tractability Frontier Y. Zick, G. Chalkiadakis and E. Elkind (submitted to AAMAS 2012)
Totally Unimodular Matrices Lecture 11: Feb 23 Simplex Algorithm Elliposid Algorithm.
CPSC 689: Discrete Algorithms for Mobile and Wireless Systems Spring 2009 Prof. Jennifer Welch.
1 Computing Nash Equilibrium Presenter: Yishay Mansour.
AWESOME: A General Multiagent Learning Algorithm that Converges in Self- Play and Learns a Best Response Against Stationary Opponents Vincent Conitzer.
An Introduction to Game Theory Part III: Strictly Competitive Games Bernhard Nebel.
CS151 Complexity Theory Lecture 6 April 15, 2004.
Algorithms and Economics of Networks Abraham Flaxman and Vahab Mirrokni, Microsoft Research.
1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.
Graphical Games Kjartan A. Jónsson. Nash equilibrium Nash equilibrium Nash equilibrium N players playing a dominant strategy is a Nash equilibrium N players.
Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”
1 Introduction to Approximation Algorithms Lecture 15: Mar 5.
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”
Piyush Kumar (Lecture 2: PageRank) Welcome to COT5405.
The Multiplicative Weights Update Method Based on Arora, Hazan & Kale (2005) Mashor Housh Oded Cats Advanced simulation methods Prof. Rubinstein.
Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
Small clique detection and approximate Nash equilibria Danny Vilenchik UCLA Joint work with Lorenz Minder.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 11.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 21.
6.896: Topics in Algorithmic Game Theory Lecture 10 Constantinos Daskalakis.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 10.
Ásbjörn H Kristbjörnsson1 The complexity of Finding Nash Equilibria Ásbjörn H Kristbjörnsson Algorithms, Logic and Complexity.
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Shall we play a game? Game Theory and Computer Science Game Theory /06/05 - Zero-sum games - General-sum games.
Algorithms for solving two-player normal form games
ECE 8443 – Pattern Recognition Objectives: Jensen’s Inequality (Special Case) EM Theorem Proof EM Example – Missing Data Intro to Hidden Markov Models.
1. 2 You should know by now… u The security level of a strategy for a player is the minimum payoff regardless of what strategy his opponent uses. u A.
1 Probability and Statistical Inference (9th Edition) Chapter 5 (Part 2/2) Distributions of Functions of Random Variables November 25, 2015.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 22.
Hedonic Clustering Games Moran Feldman Joint work with: Seffi Naor and Liane Lewin-Eytan.
1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 8.
Parameterized Two-Player Nash Equilibrium Danny Hermelin, Chien-Chung Huang, Stefan Kratsch, and Magnus Wahlstrom..
TU/e Algorithms (2IL15) – Lecture 12 1 Linear Programming.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 9.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 12.
Theory of Computational Complexity Probability and Computing Ryosuke Sasanuma Iwama and Ito lab M1.
Theory of Computational Complexity M1 Takao Inoshita Iwama & Ito Lab Graduate School of Informatics, Kyoto University.
Comp/Math 553: Algorithmic Game Theory
Computing a Nash Equilibrium of a Congestion Game: PLS-completeness
Berlin Chen Department of Computer Science & Information Engineering
Normal Form (Matrix) Games
Presentation transcript:

6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 13

Special Classes of Games

Special Classes of 2-player Games small probability games (  equilibrium with non-trivial support on a linear number of strategies) zero-sum two-player games ( ) poly-time solvable (lecture 2) low-rank two-player games ( ) PTAS[Kannan, Theobald ’09] sparse two-player games ( constant number of non-zero entries in each row, column) [Daskalakis, Papadimitriou ’09]PTAS [Daskalakis, Papadimitriou ’09] PTAS win-lose games (all payoff entries in {0,1}) exact is PPAD-complete [follows from Abbott, Kane, Valiant ’05] also no FPTAS [Chen, Teng, Valiant ’07] note : exact is PPAD-complete [Chen,Deng,Teng 06] note : exact is PPAD-complete no PTAS is known.. rank-1 two-player games ( R + C has rank 1 ) poly-time solvable [Adsul-Garg-Mehta-Sohoni ’10]

Special Classes of Graphical Games line / cylic graphical games (many players, 2 strategies per player) exact algorithm[Elkind, Goldberg, Goldberg ’06] the only class of graphs where exact equilibria can be computed limitations on the graph structure: trees (many players, constant #strategies) FTPAS[Kearns, Littman, Singh ’01] bounds on the cyclicity of the graph: e.g. if d, s are bounded, and t = O(log n), the above algorithm is a PTAS, since the input size is. Theorem [Daskalakis, Papadimitriou ’08] An -Nash equilibrium of a graphical game with n players, maximum degree d, treewidth t, and at most s strategies per player can be computed in time polynomial in n and. 2

Idea of these algorithms dynamic programming + discretization + TV bound assume that the players only use mixed strategies in probabilities that are multiples of a fixed fraction find the best discretized collection of mixed strategies What is the loss in approximation due to the discretization?

[ Total Variation Distance Def: The total variation (TV) distance between two random variables X and Y is the L1 distance of their PDFs. ]

The TV Bound In a game, the mixed strategy of each player is a random variable independent of the random variables of the other players. The effect of the discretization is to replace the random variable X i corresponding to player i ’s mixed strategy with another variable Y i whose probability for every pure strategy is an integer multiple of the discretization parameter. How much does the payoff of a player change if we replace X = (X 1, X 2, …, X n ) by Y = (Y 1, Y 2, …, Y n ) ? using independence

The TV Bound How much does the payoff of a player change if we replace X = (X 1, X 2, …, X n ) by Y = (Y 1, Y 2, …, Y n ) ? If I’m allowed to use discretization, I can make sure that degree #strategies strategy set of player i choose for approximation of. 2

Idea of these algorithms dynamic programming + discretization + TV bound because of TV bound, the best discretized collection of mixed strategies is guaranteed to be an. -Nash equilibrium assume that the players only use mixed strategies in probabilities that are multiples of. 2 runtime: 2

multiplayer zero-sum games (review from lecture 3)

Multiplayer Zero-Sum, wha? Take an arbitrary two-player game, between Alice and Bob. Add a third player, Eve, who does not affect Alice or Bob’s payoffs, but receives payoff The game is zero-sum, but solving it is PPAD-complete. intractability even for 3 player, if three-way interactions are allowed. What if only pairwise interactions are allowed?

Polymatrix Games - players are nodes of a graph G - player’s payoff is the sum of payoffs from all adjacent edges … … - edges are 2-player games N.B. finding a Nash equilibrium is PPAD- complete for general games on the edges [D, Gold, Pap ’06] What if the total sum of players’ payoffs is always 0?

Polymatrix Games Theorem [Daskalakis-Papadimitriou ’09, Cai-Daskalakis’11] - a Nash equilibrium can be found efficiently with linear-programming; - if every node uses a no-regret learning algorithm (such as the MWU method from Lecture 4), the players’ behavior converges to a Nash equilibrium. - the Nash equilibria comprise a convex set; strong indication that Nash eq. makes sense in this setting. i.e. payoffs approach equilibrium payoffs, and empirical strategies approach Nash equilibrium If the global game is zero-sum: essentially the broadest class of zero- sum games we could hope to solve

Anonymous Games

anonymous games Every player is (potentially) different, but only cares about how many players (of each type) play each of the available strategies. e.g. symmetry in auctions, congestion games, social phenomena, etc. ‘‘The women of Cairo: Equilibria in Large Anonymous Games.’’ Blonski, Games and Economic Behavior, “Partially-Specified Large Games.” Ehud Kalai, WINE, ‘‘Congestion Games with Player- Specific Payoff Functions.’’ Milchtaich, Games and Economic Behavior, all players share the same set of strategies: S = {1,…, s} - payoff functions: u p = u p (σ ; n 1, n 2,…,n s ) number of the other players choosing each strategy in S choice of p Description Size: O(min {s n s, n s n })

PTAS There is a PTAS for anonymous games with a constant #strategies. Theorem [Daskalakis, Papadimitriou ’07, ’08]: Remarks: - exact computation is not known to be PPAD-complete for multi-player anonymous games with a constant number of strategies; - on the flip side, if n is small and s is large (few players, many strategies) then trivially PPAD-complete, since general 2-player games can be reduced to this.

sketch of algorithm for 2 strategies p2p2 p1p1 discretize [0,1] n into multiples of δ, and restrict search to the discrete space pick best point in discrete space   since 2 strategies per player, Nash equilibrium lies in [0,1] n

sketch for 2 strategies (cont.) Basic Question: what grid size  is required for  - approximation? if function of  only  PTAS if function also of n  nothing p2p2 p1p1   First trouble: size of search space 1  n but will deal with this later

Theorem [Daskalakis, Papadimitriou ’07]: Given - n ind. Bernoulli’s X i with expectations p i, i =1,…, n there exists another set of Bernoulli’s Y i with expectations q i such that - a constant  independent of n q i ’s are integer multiples of  in fact: N.B. argument from last lecture gives sketch for 2 strategies (cont.)

The TV Bound How much does player p’s payoff from pure strategy σ change if we replace X = (X 1, X 2, …, X n ) with Y = (Y 1, Y 2, …, Y n ) ? Given previous theorem, can guarantee that there exists a discretized point making the above difference at most by selecting.

Completing the algorithm dynamic programming + discretization + TV bound complete this step (Exercise) assume that the players only use mixed strategies in probabilities that are multiples of. enough to guarantee a discretized - Nash equilibrium Resulting running time for 2 strategies.

Theorem [Daskalakis, Papadimitriou ’07]: Given - n ind. Bernoulli’s X i with expectations p i, i =1,…, n The first probabilistic approximation theorem there exists another set of Bernoulli’s Y i with expectations q i such that - a constant  independent of n q i ’s are integer multiples of  in fact: argument from last time gives

proof of approximation result Law of Rare Events + CLT - rounding p i ’s to the closest multiple of  gives total variation n  - probabilistic rounding up or down quickly runs into problems - what works: Poisson Approximations Berry-Esséen (Stein’s Method)

proof of approximation result Intuition: If p i ’s were small  would be close to a Poisson with mean  define the q i ’s so that

Poisson approximation is only good for small values of p i ’s. (LRE) proof of approximation result For intermediate values of p i ’s, Normals are better. (CLT) Berry-Esséen

Anonymous Games Summary 2-strategies per player: [DP ’07] constant #strategies per player: bad function of s [DP ’08]

is there a faster PTAS? Theorem [Daskalakis ’08]: There is an oblivious PTAS with running time - or, at most mix, and they choose mixed strategies which are integer multiples of Theorem [D’08]: In every anonymous game there exists an ε-approximate Nash equilibrium in which the underlying structural result… - either all players who mix play the same mixed strategy

Lemma: - The sum of m ≥ k 3 indicators X i with expectations in [1/k,1-1/k] is O(1/k)-close in total variation distance to a Binomial distribution with the same mean and variance the corresponding symmetry… … i.e. close to a sum of indicators with the same expectation [tightness of parameters by Berry-Esséen]

proof of structural result round some of the X i ’s falling here to 0 and some of them to ε so that the total mean is preserved to within ε - if more than 1/ε 3 X i ’s are left here, appeal to previous slide (Binomial appx) similarly 0 ε 1-ε1 0 ε ε 1 - o.w. use Dask. Pap. ’07 (exists rounding into multiples of ε 2 )

Anonymous Games Summary 2-strategies per player: [DP ’07] [D ’08] constant #strategies per player: bad function of s [DP ’08]

Is there an even faster PTAS? Theorem [Daskalakis, Papadimitriou ’08]: There is a non-oblivious PTAS with running time the underlying probabilistic result [DP ’08]: If two sums of indicators have equal moments up to moment k then their total variation distance is O(2 -k ).

Anonymous Games Summary 2-strategies per player: [DP ’07] [D ’08] [DP ’09] constant #strategies per player: bad function of s is there an FPTAS?