Guiding dynamics in potential games Avrim Blum Carnegie Mellon University Joint work with Maria-Florina Balcan and Yishay Mansour [Cornell CSECON 2009]

Slides:

Advertisements

Similar presentations

Best Response Dynamics in Multicast Cost Sharing

Advertisements

Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman TexPoint fonts used in EMF. Read the TexPoint manual.

THE PRICE OF STOCHASTIC ANARCHY Christine ChungUniversity of Pittsburgh Katrina LigettCarnegie Mellon University Kirk PruhsUniversity of Pittsburgh Aaron.

Price of Stability Li Jian Fudan University May, 8 th,2007 Introduction to.

Price Of Anarchy: Routing

Fast Convergence of Selfish Re-Routing Eyal Even-Dar, Tel-Aviv University Yishay Mansour, Tel-Aviv University.

Joint Strategy Fictitious Play Sherwin Doroudi. “Adapted” from J. R. Marden, G. Arslan, J. S. Shamma, “Joint strategy fictitious play with inertia for.

Regret Minimization and the Price of Total Anarchy Paper by A. Blum, M. Hajiaghayi, K. Ligett, A.Roth Presented by Michael Wunder.

Online learning, minimizing regret, and combining expert advice

1 Smooth Games and Intrinsic Robustness Christodoulou and Koutsoupias, Roughgarden Slides stolen/modified from Tim Roughgarden TexPoint fonts used in EMF.

Algorithms and Economics of Networks Abraham Flaxman and Vahab Mirrokni, Microsoft Research.

Decentralized Dynamics for Finite Opinion Games Diodato Ferraioli, LAMSADE Paul Goldberg, University of Liverpool Carmine Ventre, Teesside University.

Balázs Sziklai Selfish Routing in Non-cooperative Networks.

1 Algorithmic Game Theoretic Perspectives in Networking Dr. Liane Lewin-Eytan.

Mechanism Design without Money Lecture 4 1. Price of Anarchy simplest example G is given Route 1 unit from A to B, through AB,AXB OPT– route ½ on AB and.

Strategic Network Formation and Group Formation Elliot Anshelevich Rensselaer Polytechnic Institute (RPI)

Maria-Florina Balcan Approximation Algorithms and Online Mechanisms for Item Pricing Maria-Florina Balcan & Avrim Blum CMU, CSD.

Computational Game Theory

On the Topologies Formed by Selfish Peers Thomas Moscibroda Stefan Schmid Roger Wattenhofer IPTPS 2006 Santa Barbara, California, USA.

The price of anarchy of finite congestion games Kapelushnik Lior Based on the articles: “ The price of anarchy of finite congestion games ” by Christodoulou.

Beyond selfish routing: Network Formation Games. Network Formation Games NFGs model the various ways in which selfish agents might create/use networks.

The Price Of Stability for Network Design with Fair Cost Allocation Elliot Anshelevich, Anirban Dasgupta, Jon Kleinberg, Eva Tardos, Tom Wexler, Tim Roughgarden.

On the Price of Stability for Designing Undirected Networks with Fair Cost Allocations Svetlana Olonetsky Joint work with Amos Fiat, Haim Kaplan, Meital.

Near Optimal Network Design With Selfish Agents Eliot Anshelevich Anirban Dasupta Eva Tardos Tom Wexler Presented by: Andrey Stolyarenko School of CS,

On the Price of Stability for Designing Undirected Networks with Fair Cost Allocations M.Sc. Thesis Defense Svetlana Olonetsky.

Potential games, Congestion games Computational game theory Spring 2010 Adapting slides by Michal Feldman TexPoint fonts used in EMF. Read the TexPoint.

Convergence Time to Nash Equilibria in Load Balancing Eyal Even-Dar, Tel-Aviv University Alex Kesselman, Tel-Aviv University Yishay Mansour, Tel-Aviv University.

The Price of Uncertainty Maria-Florina Balcan Georgia Tech Avrim Blum Carnegie Mellon Yishay Mansour Tel-Aviv/Google ACM-EC 2009.

Algorithms and Economics of Networks Abraham Flaxman and Vahab Mirrokni, Microsoft Research.

Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:

Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:

Price of Anarchy Bounds Price of Anarchy Convergence Based on Slides by Amir Epstein and by Svetlana Olonetsky Modified/Corrupted by Michal Feldman and.

Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman.

Constant Price of Anarchy in Network Creation Games via Public Service Advertising Presented by Sepehr Assadi Based on a paper by Erik D. Demaine and Morteza.

1 Issues on the border of economics and computation נושאים בגבול כלכלה וחישוב Congestion Games, Potential Games and Price of Anarchy Liad Blumrosen ©

The Structure of Networks with emphasis on information and social networks T-214-SINE Summer 2011 Chapter 8 Ýmir Vigfússon.

1 Network Creation Game A. Fabrikant, A. Luthra, E. Maneva, C. H. Papadimitriou, and S. Shenker, PODC 2003 (Part of the Slides are taken from Alex Fabrikant’s.

CPS 170: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.

NOBEL WP Szept Stockholm Game Theory in Inter-domain Routing LÓJA Krisztina - SZIGETI János - CINKLER Tibor BME TMIT Budapest,

Beyond Routing Games: Network (Formation) Games. Network Games (NG) NG model the various ways in which selfish users (i.e., players) strategically interact.

Inoculation Strategies for Victims of Viruses and the Sum-of-Squares Partition Problem Kevin Chang Joint work with James Aspnes and Aleksandr Yampolskiy.

Dynamic Games & The Extensive Form

Télécom 2A – Algo Complexity (1) Time Complexity and the divide and conquer strategy Or : how to measure algorithm run-time And : design efficient algorithms.

Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia.

Connections between Learning Theory, Game Theory, and Optimization Maria Florina (Nina) Balcan Lecture 14, October 7 th 2010.

Beyond Routing Games: Network (Formation) Games. Network Games (NG) NG model the various ways in which selfish users (i.e., players) strategically interact.

Mechanism Design without Money Lecture 3 1. A game You need to get from A to B Travelling on AX or YB takes 20 minutes Travelling on AY or XB takes n.

Price of Anarchy Georgios Piliouras. Games (i.e. Multi-Body Interactions) Interacting entities Pursuing their own goals Lack of centralized control Prediction?

1 Intrinsic Robustness of the Price of Anarchy Tim Roughgarden Stanford University.

Information Theory for Mobile Ad-Hoc Networks (ITMANET): The FLoWS Project Competitive Scheduling in Wireless Networks with Correlated Channel State Ozan.

Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish agents strategically interact in using a network They.

Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish users (i.e., players) strategically interact in using.

Improved Equilibria via Public Service Advertising Maria-Florina Balcan TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.:

Hedonic Clustering Games Moran Feldman Joint work with: Seffi Naor and Liane Lewin-Eytan.

Vasilis Syrgkanis Cornell University

Computational Game Theory: Network Creation Game Arbitrary Payments Credit to Slides To Eva Tardos Modified/Corrupted/Added to by Michal Feldman and Amos.

Correlation Clustering Nikhil Bansal Joint Work with Avrim Blum and Shuchi Chawla.

1 Bottleneck Routing Games on Grids Costas Busch Rajgopal Kannan Alfred Samman Department of Computer Science Louisiana State University.

The Price of Routing Unsplittable Flow Yossi Azar Joint work with B. Awerbuch and A. Epstein.

Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.

Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.

Satisfaction Games in Graphical Multi-resource Allocation

Congestion games Computational game theory Fall 2010

On a Network Creation Game

Presented By Aaron Roth

Network Formation Games

Circumventing the Price of Anarchy

Network Formation Games

The Price of Uncertainty:

Presentation transcript:

Guiding dynamics in potential games Avrim Blum Carnegie Mellon University Joint work with Maria-Florina Balcan and Yishay Mansour [Cornell CSECON 2009] [This talk based on results in “Improved Equilibria via Public Service Advertising”, SODA’09 and “The Price of Uncertainty”, ACM-EC’09]

Good equilibria, Bad equilibria Many games have both good and bad equilibria. In some places, everyone throws their trash on the street. In some, everyone puts their trash in the trash can. In some places, everyone drives their own car. In some, everybody uses and pays for good public transit.

Good equilibria, Bad equilibria Many games have both good and bad equilibria. A nice formal example is fair cost-sharing. n players in weighted directed graph G. Player i wants to get from s i to t i, and they share cost of edges they use with others. s t 1n Good equilibrium: all use edge of cost 1. (cost 1/n per player) Bad equilibrium: all use edge of cost n. (cost 1 per player)

Good equilibria, Bad equilibria Many games have both good and bad equilibria. A nice formal example is fair cost-sharing. n players in weighted directed graph G. Player i wants to get from s i to t i, and they share cost of edges they use with others. … s1s1 snsn t 000 k ¿ n cars Shared transit

High-level questions 1. Can a helpful authority encourage behavior to move from bad to good? –Model as having some limited powers of persuasion … s1s1 snsn t 000 k ¿ n cars Shared transit

High-level questions 1. Can a helpful authority encourage behavior to move from bad to good? –Model as having some limited powers of persuasion 2. In reverse direction, if we get people into a good equilibrium (and players are selfish, reasonably myopic, etc) then like to think behavior will stay there.

High-level questions 1. Can a helpful authority encourage behavior to move from bad to good? –Model as having some limited powers of persuasion 2. If game has small fluctuations in costs, or a few Byzantine players, (when) could behavior spiral out of control?

Direction 1: guiding from bad to good “Public service advertising model”: 0. n players begin in some arbitrary configuration. 1.Authority launches public-service advertising campaign, proposing joint action s *. … s1s1 snsn t 000 k k

Direction 1: guiding from bad to good “Public service advertising model”: 0. n players begin in some arbitrary configuration. 1.Authority launches public-service advertising campaign, proposing joint action s *. Each player i pays attention and follows with probability . Call these the receptive players … s1s1 snsn t 000 k 1.Authority launches public-service advertising campaign, proposing joint action s *.

Direction 1: guiding from bad to good “Public service advertising model”: 1.Authority launches public-service advertising campaign, proposing joint action s *. Each player i pays attention and follows with probability . Call these the receptive players 2.Remaining (non-receptive) players fall to some arbitrary equilibrium for themselves, given play of receptive players. 3.Campaign wears off. Entire set of players follows best- response dynamics from then on. 0. n players begin in some arbitrary configuration. … s1s1 snsn t 000 k

Direction 1: guiding from bad to good “Public service advertising model”: 1.Authority launches public-service advertising campaign, proposing joint action s *. Each player i pays attention and follows with probability . Call these the receptive players 2.Remaining (non-receptive) players fall to some arbitrary equilibrium for themselves, given play of receptive players. 3.Campaign wears off. Entire set of players follows best- response dynamics from then on. 0. n players begin in some arbitrary configuration. Note #1: if  =1, can just propose best Nash equilibrium. Key issue: what if  < 1?

Direction 1: guiding from bad to good “Public service advertising model”: 1.Authority launches public-service advertising campaign, proposing joint action s *. Each player i pays attention and follows with probability . Call these the receptive players 2.Remaining (non-receptive) players fall to some arbitrary equilibrium for themselves, given play of receptive players. 3.Campaign wears off. Entire set of players follows best- response dynamics from then on. 0. n players begin in some arbitrary configuration. Note #2: Can replace 2 with poly(n) steps of best-response for non-receptive players.

Direction 1: guiding from bad to good Price of stability: how much worse than optimal can the best Nash equilibrium be? Price of anarchy: how much worse than optimal can the worst Nash equilibrium be? E.g., fair cost sharing: may exist NE as bad as n times worse than OPT, but always exists one at most O(log n) times worse than OPT.

Main Results If only a constant fraction  of the players follow the advice, then we can still get within O(log(n)/  ) of OPT. Extend to cost-sharing + linear delays. For any  < 1, an  fraction is not sufficient. Ratio to OPT can still be unbounded. (PoS = log(n), PoA = n) (PoS = 1, PoA = 1 ) (PoS = 1, PoA =  (n 2 )) Threshold behavior: for  > ½, can get ratio O(1), but for  < ½, ratio stays  (n 2 ). (assume degrees  (log n)).

Fair Cost Sharing If only an  probability of players following the advice, then we get within O(log(n)/  ) of OPT. (PoS = log(n), PoA = n) - In any NE for non-receptive players, any such player i can’t improve by switching to his path P i OPT in OPT. - Advertiser proposes OPT (any apx also works) #receptives on edge e - Calculate total cost of these guaranteed options. Rearrange sum... … s1s1 snsn t 000 k

Fair Cost Sharing If only an  probability of players following the advice, then we get within O(log(n)/  ) of OPT. (PoS = log(n), PoA = n) - Calculate total cost of these guaranteed options. Rearrange sum... - Finally, use: X ~ Bi(n,p) - Take expectation, add back in cost of receptives: get O(OPT/  ). (End of phase 2)

Fair Cost Sharing If only an  probability of players following the advice, then we get within O(log(n)/  ) of OPT. (PoS = log(n), PoA = n) - Finally, use: X ~ Bi(n,p) - Take expectation, add back in cost of receptives: get O(OPT/  ). (End of phase 2) - Finally, in last phase, std potential argument shows behavior cannot get worse by more than an additional log(n) factor. (End of phase 3)

Cost Sharing, Extension + linear delays: - Problem: can’t argue as if remaining NR players didn’t exist since they add to delays - Define shadow game: pure linear latency fns. Offset defined by equilib at end of phase 2. # users on e at end of phase 2 - Behavior at end of phase 2 is equilib for this game too. - Show - This has good PoA.

Party affiliation games Given graph G, each edge labeled + or -. Vertices have two actions: RED or BLUE. Pay 1 for each + edge with endpoint of different color, and each – edge with endpoint of same color. Special cases: All + edges is consensus game. All – edges is cut-game. +1 to keep ratios finite

Party affiliation games (PoS = 1, PoA =  (n 2 )) - Threshold behavior: for  > ½, can get ratio O(1), but for  < ½, ratio stays  (n 2 ). (assume degrees  (log n)). - Consensus game, two cliques, with relatively sparse between them. Players “locked” into place. Lower bound: Degree (1/2 -  )n/8 across cut

Party affiliation games (PoS = 1, PoA =  (n 2 )) - Threshold behavior: for  > ½, can get ratio O(1), but for  < ½, ratio stays  (n 2 ). (assume degrees  (log n)). - Split nodes into those incurring low-cost vs those incurring high-cost under OPT. - Show that low-cost will switch to behavior in OPT. For high-cost, don’t care. - Cost only improves in final best-response process. Upper bound:

High-level questions 1. Can a helpful authority encourage behavior to move from bad to good? –Model as having some limited powers of persuasion 2. If game has small fluctuations in costs, or a few byzantine players, could behavior spiral out of control?

Direction #2 A few ways this could happen: –Small changes cause good equilibria to disappear, only bad ones left. (economy?) –Bad behavior by a few players causes pain for all (nukes) –Neither of above, but instead through more subtle interaction with dynamics… If game has small fluctuations in costs, or a few Byzantine players, could behavior spiral out of control?

Model Players follow best (or better) response dynamics. Costs of resources can fluctuate between moves: c i t 2 [c i /(1+  ), c i (1+  )] (alternatively, one or more Byzantine players who move between time steps) Play begins in a low-cost state. How bad can things get? Price-of-Uncertainty (  ) of game = maximum ratio of eventual social cost to initial cost.

Model Players follow best (or better) response dynamics. Costs of resources can fluctuate between moves: c i t 2 [c i /(1+  ), c i (1+  )] Price-of-Uncertainty (  ) of game = maximum ratio of eventual social cost to initial cost.

Model Players follow best (or better) response dynamics. Costs of resources can fluctuate between moves: c i t 2 [c i /(1+  ), c i (1+  )] Price-of-Uncertainty (  ) of game = maximum ratio of eventual social cost to initial cost. One way to look at this: Define graph: one node for each state. Edge u ! v if perturbation can cause BR to move from u to v. What do the reachable sets look like?

Set-cover games Special case of fair cost-sharing n players, m resources, with costs c 1,…,c m. Each player has some allowable resources Each player chooses some allowable resource. Players split cost with all others choosing same one. c1c1 c2c2 c3c3 cmcm

Main results Set-cover games: If  = O(1/nm) then PoU = O(log n). However, for any constant  > 0, PoU =  (n). Also, a single Byzantine player can take state from a PNE of cost O(OPT) to one of cost  (n ¢ OPT).

Main results General fair-cost-sharing games: If many players for each (s i,t i ) pair (n i =  (m)), then PoU = O(1) even for constant  >0. Open for general number of players. Matroid congestion games: Matroid congestion games: (strategy sets are bases of matroid. E.g., set-cover where choose k resources) If  = O(1/nm) then PoU = O(log n) for fair cost- sharing. In general, if  = O(1/nm) then PoU = O(GAP). In both cases, require best-response. Better- response not enough. (unlike set-cover) Also results for other classes of games too.

Set-Cover games (upper bound) For upper bound, think of players in sets as a stack of chips. View i th position in stack j as having cost c j /i. Load chips with value equal to initial cost. When player moves from j to k, move top chip. Cost of position goes up by at most (1+  ) 2. cjcj ckck At most mn different positions. So, following the path of any chip and removing loops, cost of final set is at most (1+  ) 2nm times its value. So, if  = O(1/nm) then PoU = O(log n).

Matroid games In matroid games, can think of each player as controlling a set of chips. Nice property of best response in matroids: –Can always order the move so that each individual chip is doing better-response. Apply previous argument. Fails for better-response. –Here, can get player to do kind of binary counting, bad even for exponentially-small .

Open questions and directions Looking at: how can we help players find their way to a good state? Getting to good states: nice line of work on how players might be able to do it all by themselves. [Blume, Young, Shamma, Marden, Beggs…] Noisy best-response / noisy adaptive play. Distribution in limit favors good states, like simulated annealing. But, time could be exponential (subway). And how dangerous could small fluctuations be in knocking them out?

Open questions and directions Getting to good states: nice line of work on how players might be able to do it all by themselves. [Blume, Young, Shamma, Marden, Beggs…] Noisy best-response / noisy adaptive play. Distribution in limit favors good states, like simulated annealing. But, time could be exponential (subway). To reach good states quickly, need to give players more information about game they are playing. More general, self-interested models for this?