1 Robust Price of Anarchy Bounds via Smoothness Arguments Tim Roughgarden Stanford University.

Slides:



Advertisements
Similar presentations
Advanced Piloting Cruise Plot.
Advertisements

Feichter_DPG-SYKL03_Bild-01. Feichter_DPG-SYKL03_Bild-02.
Cognitive Radio Communications and Networks: Principles and Practice By A. M. Wyglinski, M. Nekovee, Y. T. Hou (Elsevier, December 2009) 1 Chapter 12 Cross-Layer.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Appendix 01.
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 38.
Chapter 1 Image Slides Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Addition Facts
Minimizing Efficiency Loss in Mechanism and Protocol Design Tim Roughgarden (Stanford) includes joint work with: Shuchi Chawla (Wisconsin), Ho-Lin Chen.
Correctness of Gossip-Based Membership under Message Loss Maxim GurevichIdit Keidar Technion.
Selfish Flows over Time Umang Bhaskar, Lisa Fleischer Dartmouth College Elliot Anshelevich Rensselaer Polytechnic Institute.
1 Outline relationship among topics secrets LP with upper bounds by Simplex method basic feasible solution (BFS) by Simplex method for bounded variables.
Break Time Remaining 10:00.
Real-Time Competitive Environments: Truthful Mechanisms for Allocating a Single Processor to Sporadic Tasks Anwar Mohammadi, Nathan Fisher, and Daniel.
ABC Technology Project
Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman TexPoint fonts used in EMF. Read the TexPoint manual.
Conditional Equilibrium Outcomes via Ascending Price Processes Joint work with Hu Fu and Robert Kleinberg (Computer Science, Cornell University) Ron Lavi.
1 Approximation in Algorithmic Game Theory Robust Approximation Bounds for Equilibria and Auctions Tim Roughgarden Stanford University.
1 Atomic Routing Games on Maximum Congestion Costas Busch Department of Computer Science Louisiana State University Collaborators: Rajgopal Kannan, LSU.
Shadow Prices vs. Vickrey Prices in Multipath Routing Parthasarathy Ramanujam, Zongpeng Li and Lisa Higham University of Calgary Presented by Ajay Gopinathan.
The Weighted Proportional Resource Allocation Milan Vojnović Microsoft Research Joint work with Thành Nguyen Microsoft Research Asia, Beijing, April, 2011.
THE PRICE OF STOCHASTIC ANARCHY Christine ChungUniversity of Pittsburgh Katrina LigettCarnegie Mellon University Kirk PruhsUniversity of Pittsburgh Aaron.
15. Oktober Oktober Oktober 2012.
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
Slide 1 of 31 Noam Nisan Approximation Mechanisms: computation, representation, and incentives Noam Nisan Hebrew University, Jerusalem Based on joint works.
Routing and Congestion Problems in General Networks Presented by Jun Zou CAS 744.
Variants of Nash Equilibrium CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte 1.
25 seconds left…...
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Clock will move after 1 minute
Murach’s OS/390 and z/OS JCLChapter 16, Slide 1 © 2002, Mike Murach & Associates, Inc.
Peter Key, Laurent Massoulie, Don Towsley Infocom 07 presented by Park HoSung 1 Path selection and multipath congestion control.
Auction Theory Class 5 – single-parameter implementation and risk aversion 1.
1 A Graph-Theoretic Network Security Game M. Mavronicolas , V. Papadopoulou , A. Philippou  and P. Spirakis § University of Cyprus, Cyprus  University.
Pure and Bayes-Nash Price of Anarchy for GSP Renato Paes Leme Éva Tardos CornellCornell & MSR.
Simple and Near-Optimal Auctions Tim Roughgarden (Stanford)
Potential games are necessary to ensure pure Nash equilibria in cost sharing games.
Price Of Anarchy: Routing
How Bad is Selfish Routing? By Tim Roughgarden Eva Tardos Presented by Alex Kogan.
Regret Minimization and the Price of Total Anarchy Paper by A. Blum, M. Hajiaghayi, K. Ligett, A.Roth Presented by Michael Wunder.
Price of Total Anarchy June 2008 Slides by Israel Shalom Based on “Regret Minimization and the Price of Total Anarchy” By Avrim Blum, MohammadTaghi Hajiaghayi,
1 Smooth Games and Intrinsic Robustness Christodoulou and Koutsoupias, Roughgarden Slides stolen/modified from Tim Roughgarden TexPoint fonts used in EMF.
The price of anarchy of finite congestion games Kapelushnik Lior Based on the articles: “ The price of anarchy of finite congestion games ” by Christodoulou.
Bottleneck Routing Games in Communication Networks Ron Banner and Ariel Orda Department of Electrical Engineering Technion- Israel Institute of Technology.
Beyond selfish routing: Network Formation Games. Network Formation Games NFGs model the various ways in which selfish agents might create/use networks.
1 On the price of anarchy and stability of correlated equilibria of linear congestion games By George Christodoulou Elias Koutsoupias Presented by Efrat.
Bayesian Combinatorial Auctions Giorgos Christodoulou, Annamaria Kovacs, Michael Schapira האוניברסיטה העברית בירושלים The Hebrew University of Jerusalem.
Algorithms and Economics of Networks Abraham Flaxman and Vahab Mirrokni, Microsoft Research.
Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:
Noam Nisan Non-price Equilibria in Markets of Discrete goods Avinatan Hassidim, Haim Kaplan, Yishay Mansour, Noam Nisan.
Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:
Price of Anarchy Bounds Price of Anarchy Convergence Based on Slides by Amir Epstein and by Svetlana Olonetsky Modified/Corrupted by Michal Feldman and.
1 Issues on the border of economics and computation נושאים בגבול כלכלה וחישוב Congestion Games, Potential Games and Price of Anarchy Liad Blumrosen ©
Introduction 1 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAA A.
The Effectiveness of Stackelberg strategies and Tolls for Network Congestion Games Chaitanya Swamy University of Waterloo.
Price of Anarchy Georgios Piliouras. Games (i.e. Multi-Body Interactions) Interacting entities Pursuing their own goals Lack of centralized control Prediction?
1 Intrinsic Robustness of the Price of Anarchy Tim Roughgarden Stanford University.
Network Congestion Games
Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish users (i.e., players) strategically interact in using.
Vasilis Syrgkanis Cornell University
1 Issues on the border of economics and computation נושאים בגבול כלכלה וחישוב Price of Anarchy and Stability Liad Blumrosen ©
The Price of Routing Unsplittable Flow Yossi Azar Joint work with B. Awerbuch and A. Epstein.
Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.
Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.
Congestion games Computational game theory Fall 2010
Presented By Aaron Roth
The Price of Routing Unsplittable Flow
Network Formation Games
Presentation transcript:

1 Robust Price of Anarchy Bounds via Smoothness Arguments Tim Roughgarden Stanford University

2 Price of Anarchy Definition: price of anarchy (POA) of a game (w.r.t. some objective function, eq concept): optimal obj fn value equilibrium objective fn value the closer to 1 the better

3 Price of Anarchy Definition: price of anarchy (POA) of a game (w.r.t. some objective function, eq concept): optimal obj fn value equilibrium objective fn value the closer to 1 the better st 2x 12 5x 5 cost = = 24 cost = = 28 st 2x 12 5x 5 0 0

4 The Need for Robustness Meaning of a POA bound: if the game is at an equilibrium, then outcome is near-optimal.

5 The Need for Robustness Meaning of a POA bound: if the game is at an equilibrium, then outcome is near-optimal. Problem: what if cant reach equilibrium? (pure) equilibrium might not exist might be hard to compute, even centrally –[Fabrikant/Papadimitriou/Talwar], [Daskalakis/ Goldbeg/Papadimitriou], [Chen/Deng/Teng], etc. might be hard to learn in distributed way Worry: are our POA bounds meaningless?

6 Fix #1: Provable Convergence One Good Research Agenda: prove that natural learning dynamics converge (quickly) to equilibrium. [Hart/Mas-Collell], [Even-Dar/Kesselman/Mansour], [Fisher/ Racke/Voecking], [Chien/Sinclair], [Awerbuch et al], [Even- Dar/Mansour/Nadav], [Kleinberg/Piliouras/Tardos], etc. Problem: this best-case scenario has limited reach non-existence/complexity issues lower bounds on convergence time of natural dynamics (e.g., [Skopalik/Voecking STOC 08])

7 Fix #2: Robust POA Bounds High-Level Goal: worst-case bounds that apply even to non-equilibrium outcomes! best-response dynamics, pre-convergence –[Mirrokni/Vetta 04], [Goemans/Mirrokni/Vetta 05], [Awerbuch/Azar/Epstein/Mirrokni/Skopalik 08] correlated equilibria –[Christodoulou/Koutsoupias 05] coarse correlated equilibria aka price of total anarchy aka no-regret players –[Blum/Even-Dar/Ligett 06], [Blum/Hajiaghayi/Ligett/Roth 08]

8 Abstract Setup n players, each picks a strategy s i player i incurs a cost C i (s) Important Assumption: objective function is cost(s) := i C i (s) Key Definition: A game is (λ,μ)-smooth if, for every pair s,s * outcomes (λ > 0; μ < 1): i C i (s * i,s -i ) λcost(s * ) + μcost(s) [(*)]

9 Smooth => POA Bound Next: canonical way to upper bound POA (via a smoothness argument). notation: s = a Nash eq; s * = optimal Assuming (λ,μ)-smooth: cost(s) = i C i (s) [defn of cost] i C i (s * i,s -i ) [s a Nash eq] λcost(s * ) + μcost(s) [(*)] Then: POA (of pure Nash eq) λ/(1-μ).

10 Why Is Smoothness Stronger? Key point: to derive POA bound, only needed i C i (s * i,s -i ) λcost(s * ) + μcost(s) [(*)] to hold in special case where s = a Nash eq and s * = optimal. Smoothness: requires (*) for every pair s,s * outcomes. –even if s is not a pure Nash equilibrium

11 Some Smoothness Bounds atomic (unweighted) selfish routing [Awerbuch/Azar/Epstein 05], [Christodoulou/Koutsoupias 05], [Aland/Dumrauf/Gairing/Monien/Schoppmann 06], [Roughgarden 09] nonatomic selfish routing [Roughgarden/Tardos 00],[Perakis 04] [Correa/Schulz/Stier Moses 05] weighted congestion games [Aland/Dumrauf/Gairing/Monien/Schoppmann 06], [Bhawalkar/Gairing/Roughgarden 10] submodular maximization games [Vetta 02], [Marden/Roughgarden 10] coordination mechanisms [Cole/Gkatzelis/Mirrokni 10]

Application: Out-of-Equilibria Definition: a sequence s 1,s 2,...,s T of outcomes is no-regret if: for each player i, each fixed action q i : –average cost player i incurs over sequence no worse than playing action q i every time –simple hedging strategies can be used by players to enforce this (for suff large T) Theorem: [Roughgarden STOC 09] in a (λ,μ)- smooth game, average cost of every no- regret sequence at most [λ/(1-μ)] x cost of optimal outcome. 12

13 Smooth => POTA Bound notation: s 1,s 2,...,s T = no regret; s * = optimal Assuming (λ,μ)-smooth: t cost(s t ) = t i C i (s t ) [defn of cost]

14 Smooth => POTA Bound notation: s 1,s 2,...,s T = no regret; s * = optimal Assuming (λ,μ)-smooth: t cost(s t ) = t i C i (s t ) [defn of cost] = t i [C i (s * i,s t -i ) + i,t ] [ i,t := C i (s t )- C i (s * i,s t -i )]

15 Smooth => POTA Bound notation: s 1,s 2,...,s T = no regret; s * = optimal Assuming (λ,μ)-smooth: t cost(s t ) = t i C i (s t ) [defn of cost] = t i [C i (s * i,s t -i ) + i,t ] [ i,t := C i (s t )- C i (s * i,s t -i )] t [λcost(s * ) + μcost(s t )] + i t i,t [(*)]

16 Smooth => POTA Bound notation: s 1,s 2,...,s T = no regret; s * = optimal Assuming (λ,μ)-smooth: t cost(s t ) = t i C i (s t ) [defn of cost] = t i [C i (s * i,s t -i ) + i,t ] [ i,t := C i (s t )- C i (s * i,s t -i )] t [λcost(s * ) + μcost(s t )] + i t i,t [(*)] No regret: t i,t 0 for each i. To finish proof: divide through by T.

17 Why Important? pure Nash mixed Nash correlated eq no regret bound on no-regret sequences implies bound on inefficiency of mixed and correlated equilibria bound applies even to sequences that dont converge in any sense no regret much weaker than reaching equilibrium e.g., if every player uses multiplicative weights then get o(1) regret in poly-time

Tight Game Classes Theorem: [Roughgarden STOC 09] for every set C, unweighted congestion games with cost functions restricted to C are tight: maximum [pure POA] = minimum [λ/(1-μ)] congestion games w/cost functions in C (λ,μ): all such games are (λ,μ)-smooth 18

Tight Game Classes Theorem: [Roughgarden STOC 09] for every set C, unweighted congestion games with cost functions restricted to C are tight: maximum [pure POA] = minimum [λ/(1-μ)] weighted congestion games [Bhawalkar/ Gairing/Roughgarden ESA 10] and submodular maximization games [Marden/Roughgarden CDC 10] are also tight in this sense congestion games w/cost functions in C (λ,μ): all such games are (λ,μ)-smooth 19

20 Further Applications pure Nash mixed Nash correlated eq no regret best- response dynamics approximate Nash Theorem: in a (λ,μ)-smooth game, everything in these sets costs (essentially) λ/(1-μ) x OPT.

21 More Applications? Theorem: [Nadav/Roughgarden 10] Consider a (λ,μ)-smooth game for optimal choices of λ,μ. Then precisely the aggregate coarse correlated equilibria have cost λ/(1-μ) x OPT. essentially, bound holds if and only if the average (rather than per-player) regret is non-positive Proof: convex duality.

When Is a POA Bound a Smoothness Proof? Need to show: for every pair s,s * outcomes: i C i (s * i,s -i ) λcost(s * ) + μcost(s) Generally sufficient: prove POA bound for pure Nash equilibria such that: invoke best-response condition once/player hypothetical deviation by i independent of s -i Non-example: network creation games [Fabrikant et al], [Albers et al], [Demaine et al], etc. 22

Application: POA of Non- Truthful Mechanisms Mechanism Design: design protocol with desirable outcome despite selfish participants example: Vickrey (second-price) auction focus thus far on "truthful" mechanisms Non-truthful mechanisms: motivated by simplicity (e.g., sponsored search auctions) low communication complexity low computational complexity 23

Application: POA of Non- Truthful Mechanisms Fact: plausible outcomes of non-truthful mechanisms = Bayes-Nash equilibria POA results for simple non-truthful auctions in [Christodoulou/Kovacs/Schapira ICALP 08] and [Borodin/Lucier SODA 10] first bound POA of Nash equilibria, then "by magic" same bound holds for Bayes-Nash Fact: these are automatic consequences of smoothness. 24

POA of Non-Truthful Mechanisms (continued) [Paes Leme/Tardos FOCS 10] POA upper bounds for welfare of Generalized Sponsored Search auction different bounds for pure (1.618), mixed (4), and Bayes-Nash eq (8) (without smoothness) Open Questions: compute best smoothness bound prove a lower bound separating the best- possible pure vs. Bayes-Nash POA 25

POA of Non-Truthful Mechanisms (continued) [Bhawalkar/Roughgarden 10] POA upper bounds for welfare of subadditive combinatorial auctions with item bidding –generalizes [Christodoulou/Kovacs/Schapira 08] pure Nash: non-smooth upper bound of 2 Bayes-Nash: lower bound of 2.01, upper bound of 2 ln m [m = # goods] Open Questions: is Bayes-Nash POA = O(1)? 26

Local Smoothness and Splittable Congestion Games Local smoothness: require smoothness condition only for outcomes that are close –assume strategy sets = convex subset of R n can only decrease optimal value of λ/(1-μ) [Harks 08]: local smoothness gives improved POA bounds for splittable congestion games [Roughgarden/Schoppmann 10]: matching lower bounds => first tight bounds in this model [RS10]: local smoothness bounds extend to correlated eq but not to no-regret outcomes! 27

Splittable Congestion Games: Open Questions Open Question #1: determine optimal POA bounds for no-regret sequences. Open Question #2: is (the distribution of) every no-regret sequence a convex combination of pure Nash equilibria? Open Question #3: prove something non-trivial about the POA of symmetric splittable congestion games. 28