Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman.

Slides:



Advertisements
Similar presentations
6.896: Topics in Algorithmic Game Theory Lecture 21 Yang Cai.
Advertisements

Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman TexPoint fonts used in EMF. Read the TexPoint manual.
Price of Stability Li Jian Fudan University May, 8 th,2007 Introduction to.
Price Of Anarchy: Routing
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
Congestion Games with Player- Specific Payoff Functions Igal Milchtaich, Department of Mathematics, The Hebrew University of Jerusalem, 1993 Presentation.
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium
EC941 - Game Theory Lecture 7 Prof. Francesco Squintani
How Bad is Selfish Routing? By Tim Roughgarden Eva Tardos Presented by Alex Kogan.
Regret Minimization and the Price of Total Anarchy Paper by A. Blum, M. Hajiaghayi, K. Ligett, A.Roth Presented by Michael Wunder.
Coalition Formation and Price of Anarchy in Cournot Oligopolies Joint work with: Nicole Immorlica (Northwestern University) Georgios Piliouras (Georgia.
Algorithms and Economics of Networks Abraham Flaxman and Vahab Mirrokni, Microsoft Research.
ECO290E: Game Theory Lecture 4 Applications in Industrial Organization.
Combinatorial Algorithms
An Introduction to Game Theory Part I: Strategic Games
1 Algorithmic Game Theoretic Perspectives in Networking Dr. Liane Lewin-Eytan.
Mechanism Design without Money Lecture 4 1. Price of Anarchy simplest example G is given Route 1 unit from A to B, through AB,AXB OPT– route ½ on AB and.
Strategic Network Formation and Group Formation Elliot Anshelevich Rensselaer Polytechnic Institute (RPI)
1 Best-Reply Mechanisms Noam Nisan, Michael Schapira and Aviv Zohar.
Computational Game Theory
Local Connection Game. Introduction Introduced in [FLMPS,PODC’03] A LCG is a game that models the creation of networks two competing issues: players want.
Beyond selfish routing: Network Formation Games. Network Formation Games NFGs model the various ways in which selfish agents might create/use networks.
The Price Of Stability for Network Design with Fair Cost Allocation Elliot Anshelevich, Anirban Dasgupta, Jon Kleinberg, Eva Tardos, Tom Wexler, Tim Roughgarden.
Selfish Caching in Distributed Systems: A Game-Theoretic Analysis By Byung-Gon Chun et al. UC Berkeley PODC’04.
Local Connection Game. Introduction Introduced in [FLMPS,PODC’03] A LCG is a game that models the creation of networks two competing issues: players want.
Bayesian Combinatorial Auctions Giorgos Christodoulou, Annamaria Kovacs, Michael Schapira האוניברסיטה העברית בירושלים The Hebrew University of Jerusalem.
Near Optimal Network Design With Selfish Agents Eliot Anshelevich Anirban Dasupta Eva Tardos Tom Wexler Presented by: Andrey Stolyarenko School of CS,
Load Balancing, Multicast routing, Price of Anarchy and Strong Equilibrium Computational game theory Spring 2008 Michal Feldman.
Near-Optimal Network Design with Selfish Agents By Elliot Anshelevich, Anirban Dasgupta, Eva Tardos, Tom Wexler STOC’03 Presented by Mustafa Suleyman CIFTCI.
Potential games, Congestion games Computational game theory Spring 2010 Adapting slides by Michal Feldman TexPoint fonts used in EMF. Read the TexPoint.
Algorithmic Issues in Non- cooperative (i.e., strategic) Distributed Systems.
Algorithms and Economics of Networks Abraham Flaxman and Vahab Mirrokni, Microsoft Research.
Congestion Games (Review and more definitions), Potential Games, Network Congestion games, Total Search Problems, PPAD, PLS completeness, easy congestion.
On Bounded Rationality and Computational Complexity Christos Papadimitriou and Mihallis Yannakakis.
Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:
Second case study: Network Creation Games (a.k.a. Local Connection Games)
Inefficiency of Equilibria: POA, POS, SPOA Based on Slides by Amir Epstein and by Svetlana Olonetsky Modified/Corrupted by Michal Feldman and Amos Fiat.
Near-Optimal Network Design With Selfish Agents Elliot Anshelevich, Anirban Dasgupta, Éva Tardos, Tom Wexler STOC’03, June 9–11, 2003, San Diego, California,
Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:
Price of Anarchy Bounds Price of Anarchy Convergence Based on Slides by Amir Epstein and by Svetlana Olonetsky Modified/Corrupted by Michal Feldman and.
1 Issues on the border of economics and computation נושאים בגבול כלכלה וחישוב Congestion Games, Potential Games and Price of Anarchy Liad Blumrosen ©
Instructor: YE, Deshi Local Search Instructor: YE, Deshi
Beyond Routing Games: Network (Formation) Games. Network Games (NG) NG model the various ways in which selfish users (i.e., players) strategically interact.
Approximating Minimum Bounded Degree Spanning Tree (MBDST) Mohit Singh and Lap Chi Lau “Approximating Minimum Bounded DegreeApproximating Minimum Bounded.
Inoculation Strategies for Victims of Viruses and the Sum-of-Squares Partition Problem Kevin Chang Joint work with James Aspnes and Aleksandr Yampolskiy.
On a Network Creation Game PoA Seminar Presenting: Oren Gilon Based on an article by Fabrikant et al 1.
Connections between Learning Theory, Game Theory, and Optimization Maria Florina (Nina) Balcan Lecture 14, October 7 th 2010.
Beyond Routing Games: Network (Formation) Games. Network Games (NG) NG model the various ways in which selfish users (i.e., players) strategically interact.
Price of Anarchy Georgios Piliouras. Games (i.e. Multi-Body Interactions) Interacting entities Pursuing their own goals Lack of centralized control Prediction?
1 Intrinsic Robustness of the Price of Anarchy Tim Roughgarden Stanford University.
Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish agents strategically interact in using a network They.
Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish users (i.e., players) strategically interact in using.
Hedonic Clustering Games Moran Feldman Joint work with: Seffi Naor and Liane Lewin-Eytan.
Vasilis Syrgkanis Cornell University
1 Chapter 12 Local Search Slides by Kevin Wayne Pearson-Addison Wesley. All rights reserved.
Computational Game Theory: Network Creation Game Arbitrary Payments Credit to Slides To Eva Tardos Modified/Corrupted/Added to by Michal Feldman and Amos.
CSCI 256 Data Structures and Algorithm Analysis Lecture 2 Some slides by Kevin Wayne copyright 2005, Pearson Addison Wesley all rights reserved, and some.
Market Design and Analysis Lecture 2 Lecturer: Ning Chen ( 陈宁 )
The Price of Routing Unsplittable Flow Yossi Azar Joint work with B. Awerbuch and A. Epstein.
Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.
Approximation Algorithms based on linear programming.
Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.
Congestion games Computational game theory Fall 2010
CSCI 3210: Computational Game Theory
Presented By Aaron Roth
Network Formation Games
Selfish Load Balancing
Network Formation Games
Presentation transcript:

Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman

Inefficiency of equilibria Outcome of rational behavior might be inefficient How to measure inefficiency? – E.g., prisoner’s dilemma Define an objective function – Social welfare (= sum of players’ payoffs): utilitarian – Maximize min i u i (egalitarian) – … 0,53,3 1,15,0

Inefficiency of equilibria To measure inefficiency we need to specify: – Objective function – Definition of approximately optimal – Definition of an equilibrium – If multiple equilibria exist, which one do we consider?

Common measures Price of anarchy (poa)=cost of worst NE / cost of OPT Price of stability (pos)=cost of best NE / cost of OPT – Note: poa, pos ≥ 1 (by definition) Approximation ratio: Measures price of limited computational resources Competitive ratio: Measures price of not knowing future Price of anarchy: Measures price of lack of coordination

Price of anarchy Example: in prisoner’s dilemma, poa = pos = 3 – But can be as large as desired Wish to find games in which pos or poa are bounded – NE “approximates” OPT – Might explains Internet efficiency. Suppose we define poa and pos w.r.t. NE in pure strategies – we first need to prove existence of pure NE 0,53,3 1,15,0 Prisoner’s dilemma

Max-cut game Given undirected graph G = (V,E) players are nodes v in V An edge (u,v) means u “hates” v (and vice versa) Strategy of node i: s i  {Black,White} Utility of node i: # neighbors of different color Lemma: for every graph G, corresponding game has a pure NE

Proof 1 Claim: OPT of max-cut defines a NE Proof: – Define strategies of players by cut (i.e., one side is Black, other side is White) – Suppose a player i wishes to switch strategies: i’s benefit from switching = improvement in value of the cut – Contradicting optimality of cut u i =1 u i =2

Proof 2 Algorithm greedy-find-cut (GFC): – Start with arbitrary partition of nodes into two sets – If exists node with more neighbors in other side, move it to other side (repeat until no such node exists) Claim 1: GFC provides 2-approx. to max-cut, and runs in polynomial time Proof: – Poly time: GFC terminates within at most |E| steps (since every step improves the value of the solution in at least 1, and |E| is a trivial upper bound to solution) – 2-approx.: Each node ends up with more neighbors in other side than in own side, so at least |E|/2 edges are in cut (since #edges in cut > #edges not in cut)

Proof 2 (cont’d) Claim 2: cut obtained by GFC defines a NE Proof: obvious, as each player stops only if his strategy is the best response to the other players’ strategies Conclusion: max-cut game admits a NE in pure strategies

Potential games Definition: a game is a potential game if there exists a function  :S1×…×Sn  R s.t.  i,s i,s - i,s i ’, c i (s i,s -i ) > c i (s i ’,s -i ) IFF  (s i,s -i ) >  (s i ’,s -i ) Note: G is an exact potential game if c i (s i,s -i ) - c i (s i ’,s -i ) =  (s i,s -i ) -  (s i ’,s -i ) Example: max-cut is an exact potential game, where  is the cut size – Unfortunately,  is not always so natural

Potential games Lemma: a game is a potential game IFF local improvements always terminate proof: – Define a directed graph with a node for each possible pure strategy profile – Directed edge (u,v) means v (which differs from u only in the strategy of a single player, i) is a (strictly) better action for i, given the strategies of the other players – A potential function exists IFF graph does not contains cycles If cycle exists, no potential function; e.g., (a,b,c,a) means f(a)<f(b)<f(c)<f(a) If no cycles exist, can easily define a potential function WHY?

Examples direction of local improvement 1,-1-1,1 1,-1 0,53,3 1,15,0 0,02,2 3,30,0 Matching pennies Prisoner’s dilemma Coordination game C D DC col row Which are potential games? Exact potential games? Are the potential functions unique? 0,02,1 1,20,0 Battle of the sexes

Properties of potential games Admit a Nash equilibrium Best-response dynamics converge to NE Price of stability is bounded

Existence of a pure NE Theorem: every potential game admits a pure NE Proof: we show that the profile minimizing  is a NE – Let s be profile minimizing  – Suppose by contradiction it is not a NE, so i can improve by deviating to a new profile s’ –  (s’) -  (s) = c i (s’) – c i (s) < 0 – Thus  (s’) <  (s), contradicting s minimizes  More generally, the set of pure-strategy Nash equilibria is exactly the set of local minima of the potential function –Local minimum = no player can improve the potential function by herself

Best-response dynamics converge to a NE Best-response dynamics: –Start with any strategy profile –If a player is not best-responding, switch that player’s strategy to a better response (must decrease potential) –Terminate when no player can improve (thus a NE) –Alas, no guarantee on the convergence rate

16 Multicast Routing Multicast routing: Given a directed graph G = (V, E) with edge costs c e  0, a source node s, and k agents located at terminal nodes t 1, …, t k. Agent j must construct a path P j from node s to its terminal t j. Fair share: If x agents use edge e, they each pay c e / x. Slides on cost sharing based on slides by Kevin Wayne Pearson-Addison Wesley. All rights reserved.

17 Multicast Routing outer 2 middle 4 1 pays /2 + 1 middle4 1 outer middle outer 8 2 pays 8 5/ s t1t1 v t2t

18 Nash Equilibrium Example: – Two agents start with outer paths. – Agent 1 has no incentive to switch paths (since ). – Once this happens, agent 1 prefers middle path (since 4 > 5/2 + 1). – Both agents using middle path is a Nash equilibrium. s t1t1 v t2t

Recall price of anarchy and stability Price of anarchy (poa)=cost of worst NE / cost of OPT Price of stability (pos)=cost of best NE / cost of OPT

Socially Optimum Social optimum: Minimizes total costs of all agents. Observation: In general, there can be many Nash equilibria. Even when it is unique, it does not necessarily equal the social optimum. s t1t1 v t2t Social optimum = 7 Unique Nash equilibrium = 8 s t k 1 +  Social optimum = 1 +  Nash equilibrium A = 1 +  Nash equilibrium B = k k agents pos=1, poa=kpos=poa=8/7

Price of anarchy Claim: poa ≤ k Proof: – Let s be the worst NE – Suppose by contradiction c(s) > k OPT – Then, there exists a player i s.t. c i (s) > OPT – But i can deviate to OPT (by paying OPT alone), contradicting s is a NE Note: bound is tight (lower bound in prev. slide)

22 Price of Stability What is price of stability in multicast routing? Lower bound of log k: s t2t2 t3t3 tktk t1t /2 1/31/k  1 + 1/2 + … + 1/k Social optimum: Everyone Takes bottom paths. Unique Nash equilibrium: Everyone takes top paths. Price of stability: H(k) / (1 +  ). upper bound will follow..

23 Finding a potential function Attempt 1: Let  (s) =  j=1 k cost(t i ) be the potential function. A problem: The potential might increase when some agent improve. Example: When all 3 agents use the right path, each pays 4/3 and the potential (total cost) is 4. After one agent moves to the left path the potential increases to 5. s t agents

24 Finding a potential function Attempt 2: Consider a set of paths P 1, …, P k. – Let x e denote the number of paths that use edge e. – Let  (P 1, …, P k ) =  e  E c e · H(x e ) be a potential function. – Consider agent j switching from path P j to path P j '. – Change in agent j’s cost: H(0) = 0,

25 Potential function –  increases by –  decreases by – Thus, net change in  is identical to net change in player j’s cost

26 Bounding the Price of Stability Claim: Let C(P 1, …, P k ) denote the total cost of selecting paths P 1, …, P k. For any set of paths P 1, …, P k, we have Proof: Let x e denote the number of paths containing edge e. – Let E + denote set of edges that belong to at least one of the paths.

27 Bounding the Price of Stability Theorem: There is a Nash equilibrium for which the total cost to all agents exceeds that of the social optimum by at most a factor of H(k) (i.e., price of stability ≤ H(k)). Proof: – Let (P 1 *, …, P k * ) denote set of socially optimal paths. – Run best-response dyn algorithm starting from P *. – Since  is monotone decreasing  (P 1, …, P k )   (P 1 *, …, P k * ). previous claim applied to P previous claim applied to P*

Local search and PLS (polynomial local search) Local optimization problem: find a local optimum (i.e., no improvement in neighborhood Local optimization problem is in PLS if exists an oracle that for every instance and solution s decides if s is a local optimum; if not returns a better solution s’ in neighborhood of s Finding NE in potential games is in PLS – Define neighborhood of a profile s to be profiles obtained by deviation of a single player – s is local optimum for c(s) =  (s) iff s is a NE

Congestion games [Rosenthal 73] There is a set of resources R Agent i’s set of actions (pure strategies) A i is a subset of 2 R, representing which subsets of resources would meet her needs –Note: different agents may need different resources There exist cost functions c r : {1, 2, 3, …} →  such that agent i’s cost for a = (a i, a -i ) is Σ r  a i c r (n r (a)) –n r (a) is the number of agents that chose r as one of their resources in the profile a

Example: multicast routing Resources = edges Each resource r has a cost c r Player 1’s action set: {{A}, {C,D}} Player 2’s action set: {{B}, {C,E}} For all resources r, c r (n r (a)) = c r / n r (a) s t1t1 v t2t2 E A 4 C D B

Every congestion game is an exact potential game Use potential  (a) = Σ r Σ 1 ≤ i ≤ nr(a) c r (i) –One interpretation: the sum of the costs that the agents would have received if each agent were unaffected by all later agents Why is this a correct potential function? Suppose an agent changes action: stop using some resources (R-), start using others (R+) increase in the agent’s cost equals Σ r  R+ c r (n r (a) + 1) - Σ r  R- c r (n r (a)) This is exactly the change in the potential function above –Conclusion: congestion games are potential games