Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Two-person non zero-sum.

Slides:



Advertisements
Similar presentations
Price Of Anarchy: Routing
Advertisements

Game Theory S-1.
15 THEORY OF GAMES CHAPTER.
APPENDIX An Alternative View of the Payoff Matrix n Assume total maximum profits of all oligopolists is constant at 200 units. n Alternative policies.
Two-Player Zero-Sum Games
Operations Research Assistant Professor Dr. Sana’a Wafa Al-Sayegh 2 nd Semester ITGD4207 University of Palestine.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
An Introduction to... Evolutionary Game Theory
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Introduction to Network Mathematics (3) - Simple Games and applications Yuedong Xu 16/05/2012.
© 2015 McGraw-Hill Education. All rights reserved. Chapter 15 Game Theory.
Shengyu Zhang The Chinese University of Hong Kong.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Zero-Sum Games (follow-up)
Algorithms and Economics of Networks Abraham Flaxman and Vahab Mirrokni, Microsoft Research.
Chapter 15- Game Theory: The Mathematics of Competition
An Introduction to Game Theory Part I: Strategic Games
Game Theory Part 5: Nash’s Theorem.
Chapter 6 © 2006 Thomson Learning/South-Western Game Theory.
5/16/20151 Game Theory Game theory was developed by John Von Neumann and Oscar Morgenstern in Economists! One of the fundamental principles of.
1 Algorithmic Game Theoretic Perspectives in Networking Dr. Liane Lewin-Eytan.
Mechanism Design without Money Lecture 4 1. Price of Anarchy simplest example G is given Route 1 unit from A to B, through AB,AXB OPT– route ½ on AB and.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Review Midterm3/19 3/12.
Beyond selfish routing: Network Formation Games. Network Formation Games NFGs model the various ways in which selfish agents might create/use networks.
An Introduction to Game Theory Part III: Strictly Competitive Games Bernhard Nebel.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Algorithmic Issues in Non- cooperative (i.e., strategic) Distributed Systems.
Game Applications Chapter 29. Nash Equilibrium In any Nash equilibrium (NE) each player chooses a “best” response to the choices made by all of the other.
Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:
Introduction to Game Theory and its Applications in Computer Networks
Game Theory Objectives:
How Bad is Selfish Routing A survey on existing models for selfish routing Professor John Lui, David Yau and Dah-Ming Qiu presented by Joe W.J. Jiang
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Game Theory.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:
Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman.
1 Issues on the border of economics and computation נושאים בגבול כלכלה וחישוב Congestion Games, Potential Games and Price of Anarchy Liad Blumrosen ©
Game Theory.
Game Theory.
Chapter 12 & Module E Decision Theory & Game Theory.
Introduction 1 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAA A.
Beyond Routing Games: Network (Formation) Games. Network Games (NG) NG model the various ways in which selfish users (i.e., players) strategically interact.
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
THE “CLASSIC” 2 x 2 SIMULTANEOUS CHOICE GAMES Topic #4.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Lecture 2: two-person non.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia.
Beyond Routing Games: Network (Formation) Games. Network Games (NG) NG model the various ways in which selfish users (i.e., players) strategically interact.
Mechanism Design without Money Lecture 3 1. A game You need to get from A to B Travelling on AX or YB takes 20 minutes Travelling on AY or XB takes n.
1 Intrinsic Robustness of the Price of Anarchy Tim Roughgarden Stanford University.
Network Congestion Games
Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish agents strategically interact in using a network They.
1 CRP 834: Decision Analysis Week Three Notes. 2 Review Decision-Flow Diagram –A road map of all possible strategies and outcomes –At a decision fork,
Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish users (i.e., players) strategically interact in using.
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
1. 2 You should know by now… u The security level of a strategy for a player is the minimum payoff regardless of what strategy his opponent uses. u A.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia.
Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia.
Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.
Game Theory [geym theer-ee] : a mathematical theory that deals with the general features of competitive situations in a formal abstract way.
Introduction to Game Theory
Introduction to Game Theory
CSCI 3210: Computational Game Theory
UNIT II: The Basic Theory
Network Formation Games
Presentation transcript:

Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Two-person non zero-sum games Giovanni Neglia INRIA – EPI Maestro 28 January 2013 Slides are based on a previous course with D. Figueiredo (UFRJ) and H. Zhang (Suffolk University)

Outline r Two-person zero-sum games m Matrix games Pure strategy equilibria (dominance and saddle points), ch 2 Mixed strategy equilibria, ch 3 m Game trees, ch 7 r Two-person non-zero-sum games m Nash equilibria… …And its limits (equivalence, interchangeability, Prisoner’s dilemma), ch. 11 and 12 m Strategic games, ch. 14 m Subgame Perfect Nash Equilibria (not in the book) m Repeated Games, partially in ch. 12 m Evolutionary games, ch. 15 r N-persons games

Two-person Non-zero Sum Games r Players are not strictly opposed m payoff sum is non-zero AB A3, 42, 0 B5, 1-1, 2 Player 1 Player 2 r Situations where interest is not directly opposed m players could cooperate m communication may play an important role for the moment assume no communication is possible

What do we keep from zero-sum games? r Dominance r Movement diagram m pay attention to which payoffs have to be considered to decide movements r Enough to determine pure strategies equilibria m but still there are some differences (see after) AB A5, 42, 0 B3, 1-1, 2 Player 1 Player 2

What can we keep from zero-sum games? r As in zero-sum games, pure strategies equilibria do not always exist… r …but we can find mixed strategies equilibria AB A5, 0-1, 4 B3, 22, 1 Player 1 Player 2

Mixed strategies equilibria r Same idea of equilibrium m each player plays a mixed strategy (equalizing strategy), that equalizes the opponent payoffs m how to calculate it? AB A5, 0-1, 4 B3, 22, 1 Rose Colin

Mixed strategies equilibria r Same idea of equilibrium m each player plays a mixed strategy, that equalizes the opponent payoffs m how to calculate it? AB A-0-4 B-2 Rose Colin Rose considers Colin’s game 4 1 1/5 4/5

Mixed strategies equilibria r Same idea of equilibrium m each player plays a mixed strategy, that equalizes the opponent payoffs m how to calculate it? AB A5 B32 Rose Colin Colin considers Rose’s game 3/5 2/5

Mixed strategies equilibria r Same idea of equilibrium m each player plays a mixed strategy, that equalizes the opponent payoffs m how to calculate it? AB A5, 0-1, 4 B3, 22, 1 Rose Colin Rose playing (1/5,4/5) Colin playing (3/5,2/5) is an equilibrium Rose gains 13/5 Colin gains 8/5

Good news: Nash’s theorem [1950] r Every two-person games has at least one equilibrium either in pure strategies or in mixed strategies m Proved using fixed point theorem m generalized to N person game r This equilibrium concept called Nash equilibrium in his honor m A vector of strategies (a profile) is a Nash Equilibrium (NE) if no player can unilaterally change its strategy and increase its payoff

A useful property r Given a finite game, a profile is a mixed NE of the game if and only if for every player i, every pure strategy used by i with non-null probability is a best response to other players mixed strategies in the profile m see Osborne and Rubinstein, A course in game theory, Lemma 33.2

Bad news: what do we lose? r equivalence r interchangeability r identity of equalizing strategies with prudential strategies r main cause m at equilibrium every player is considering the opponent’s payoffs ignoring its payoffs. r New problematic aspect m group rationality versus individual rationality (cooperation versus competition) m absent in zero-sum games  we lose the idea of the solution

Game of Chicken 2 2 r Game of Chicken (aka. Hawk-Dove Game) m driver who swerves looses swervestay swerve0, 0-1, 5 stay5, -1-10, -10 Driver 1 Driver 2 Drivers want to do opposite of one another Two equilibria: not equivalent not interchangeable! playing an equilibrium strategy does not lead to equilibrium

The Prisoner’s Dilemma r One of the most studied and used games m proposed in 1950 r Two suspects arrested for joint crime m each suspect when interrogated separately, has option to confess NCC 2, 210, 1 C1, 105, 5 Suspect 1 Suspect 2 payoff is years in jail (smaller is better) single NE better outcome

Pareto Optimal NCC 2, 210, 1 C1, 105, 5 Suspect 1 Suspect 2 r Def: outcome o* is Pareto Optimal if no other outcome would give to all the players a payoff not smaller and a payoff higher to at least one of them r Pareto Principle: to be acceptable as a solution of a game, an outcome should be Pareto Optimal o the NE of the Prisoner’s dilemma is not! r Conflict between group rationality (Pareto principle) and individual rationality (dominance principle) Pareto Optimal

Payoff polygon r All the points in the convex hull of the pure strategy payoffs correspond to payoffs obtainable by mixed strategies r The north-east boundary contains the Pareto optimal points AB A5, 0-1, 4 B3, 22, 1 Rose Colin A,A B,A A,B B,B NE Rose’s payoff Colin’s payoff

Another possible approach to equilibria  NE  equalizing strategies r What about prudential strategies?

Prudential strategies r Each player tries to minimize its maximum loss (then it plays in its own game) AB A5, 0-1, 4 B3, 22, 1 Rose Colin

Prudential strategies r Rose assumes that Colin would like to minimize her gain r Rose plays in Rose’s game r Saddle point in BB r B is Rose’s prudential strategy and guarantees to Rose at least 2 (Rose’s security level) AB A5 B32 Rose Colin

Prudential strategies r Colin assumes that Rose would like to minimize his gain (maximize his loss) r Colin plays in Colin’s game r mixed strategy equilibrium, r (3/5,2/5) is Colin’s prudential strategy and guarantees Colin a gain not smaller than 8/5 AB A0-4 B-2 Rose Colin

Prudential strategies r Prudential strategies m Rose plays B, Colin plays A w. prob. 3/5, B w. 2/5 m Rose gains 13/5 (>2), Colin gains 8/5 r Is it stable? m No, if Colin thinks that Rose plays B, he would be better off by playing A (Colin’s counter-prudential strategy) AB A5, 0-1, 4 B3, 22, 1 Rose Colin

Prudential strategies r are not the solution neither: m do not lead to equilibria m do not solve the group rationality versus individual rationality conflict r dual basic problem: m look at your payoff, ignoring the payoffs of the opponents

Exercises r Find NE and Pareto optimal outcomes: NCC 2, 210, 1 C1, 105, 5 AB A2, 33, 2 B1, 00, 1 swervestay swerve0, 0-1, 5 stay5, -1-10, -10 AB A2, 41, 0 B3, 10, 4

Performance Evaluation Routing as a Potential game Giovanni Neglia INRIA – EPI Maestro 28 January 2013

Routing games r Possible in the Internet? ? Traffic (cars#) Delay

Overlay networks Internet Overlay Underlay

Routing games r Users can ignore ISP choices route allowed by the overlay underlay route An Overlay for routing: Resilient Overlay Routing

Traffic demand r unit traffic demands between pair of nodes a b c d e f 1,3 f 2,3

Delay costs r Social cost: C S = Σ  ε E f  *c  (f  ) r User cost: m C 1,3 (f)= Σ  ε R 1,3 c  (f  ) a b c d e R 1,3 = {a,b}, R 2,3 ={b} f a =f 1,3, f b = f 1,3 + f 2,3, f c =f d =0 f 1,3 f 2,3 c  (f  ),  ε E={a,b,c,d,e}, Non-negative, non decreasing functions

Pigou’s example r Two possible roads between 1 and 2 m a) a longer highway (almost constant transit time) ‏ m b) shorter but traffic sensitive city road r 2 Selfish users (choose the road in order to minimize their delay) 12 transit_time a =2 hour transit_time b =x hours ab a-2, -2-2, -1 b-1, -2-2, -2 Rose Colin

Pigou’s example r Two possible roads between 1 and 2 m a) a longer highway (almost constant transit time) ‏ m b) shorter but traffic sensitive city road r 2 Selfish users (choose the road in order to minimize their delay) m There is 1 (pure-strategy) NE where they all choose the city road... m even if the optimal allocation is not worse for the single user!  What if transit_time a =2+ ε?  In what follows we only consider pure strategy NE 12 transit_time a =2 hour transit_time b =x hours fbfb Social cost

What is the cost of user selfishness for the community? r Loss of Efficiency (LoE) m given a NE with social cost C S (f NE ) m and the traffic allocation with minimum social cost C S (f Opt ) m LoE = C S (f NE ) / C S (f Opt )

Pigou’s example r The LoE of (b,b) is 4/3 r The LoE of (b,a) and (a,b) is 1 12 transit_time a =2 hour transit_time b =x hours ab a-2, -2-2, -1 b-1, -2-2, -2 Rose Colin

Braess's paradox  User cost: 3 +ε r Social cost: C NE = 6 +2ε (=C Opt ) 12 c(x)=x 3 4 c(x)=2+ε

Braess's paradox 12 transit_time a =3+ε hours c(x)=x 3 4 c(x)=2+ε c(x)=0

Braess's paradox r User cost: 4  Social cost: C NE = 8 > 6 +ε (C Opt ) r LoE = 8/(6+ ε ) -> 4/3 12 transit_time a =3+ε hours c(x)=x 3 4 c(x)=2+ε c(x)=0 ε ->0

Routing games 1. Is there always a (pure strategy) NE? 2. Can we always find a NE with a “small” Loss of Efficiency (LoE)?

Always an equilibrium? r Best Response dynamics m Start from a given routing and let each player play its Best Response strategy m What if after some time there is no change?

BR dynamics 1. Users costs: (3 +ε, 3 +ε) 2. Blue plays BR, costs: (3, 4 +ε ) 3. Pink plays BR, costs: (4, 4) 4. Nothing changes….  Social cost: C NE = 6 +2ε (=C Opt ) 12 c(x)=x 3 4 c(x)=2+ε

Always an equilibrium? r Best Response dynamics m Start from a given routing and let each player play its Best Response strategy m What if after some time there is no change? m Are we sure to stop?

Games with no saddle-point r There are games with no saddle-point! r An example? RPS min R P S max RPS min R 01 P 10 S 10 max 111 maximin <> minimax maximin minimax

Always an equilibrium? r Best Response dynamics m Start from a given routing and let each player play its Best Response strategy m What if after some time there is no change? m Are we sure to stop? In some cases we can define a potential function that keeps decreasing at each BR until a minimum is reached. Is the social cost a good candidate?

Potential for routing games r Potential : P =Σ  ε E P  (f  )=Σ  ε E Σ t=1,…f  c  (t) a b c d e P 1,3 = {a,b}, P 2,3 ={b} f a =f 1,3, f b = f 1,3 + f 2,3, f c =f d =0 f 1,3 f 2,3 c  (f  ),  ε E={a,b,c,d,e}, Non-negative, non decreasing functions

Potential decreases at every BR 1. User costs: (3 +ε, 3 +ε), P=6+2ε 2. Blue plays BR, costs: (3, 4 +ε ), P= 6+ε 3. Pink plays BR, costs: (4, 4), P=6 4. Nothing changes…. 12 c(x)=x 3 4 c(x)=2+ε

Potential decreases at every BR 12 c(x)=x 3 4 c(x)=2+ε From route R to route R’ r f’  =f  +1 if  in R’-R, f’  =f  -1 if  in R-R’ r P  -P’  =-c(f  +1) if  in R’-R, r P  -P’  =c(f  ) if  in R-R’  P-P’=Σ  ε R c(f  )-Σ  ε R’ c(f’  )= =user difference cost between R and R’>0

BR dynamics converges to an equilibrium r The potential decreases at every step r There is a finite number of possible potential values r After a finite number of steps a potential local minimum is reached r The final routes identify a (pure strategy) NE

Always an equilibrium with small Loss of Efficiency?  Consider only affine cost functions, i.e. c α (x)= a α +b α x r We will use the potential to derive a bound on the social cost of a NE m P(f) <= C S (f) <= 2 P(f) fαfα C α (x ) x P α (f α ) C α (f α )

Always an equilibrium with small Loss of Efficiency? r Consider only affine cost functions i.e. c α (x)= a α +b α x r We will use the potential to derive a bound on the social cost of a NE m P(f) <= C S (f) <= 2 P(f) r P(f) = Σ  ε E P  =Σ  ε E Σ t=1,…f  c(t) <= <=Σ  ε E Σ t=1,…f  c(f  )=Σ  ε E f  c(f  )=C S (f) r P(f) = Σ  ε E P  =Σ  ε E Σ t=1,…f  (a α +b α t)= =Σ  ε E f α a α +b α f α (f α +1)/2 >= Σ  ε E f α (a α +b α f α )/2 =C S (f)/2

Always an equilibrium with small Loss of Efficiency? r Consider only affine cost functions i.e. c α (x)= a α +b α x r P(f) <= C S (f) <= 2 P(f) r Let’s imagine to start from routing f Opt with the optimal social cost C S (f Opt ), r Applying the BR dynamics we arrive to a NE with routing f NE and social cost C S (f NE ) r C S (f NE ) <= 2 P(f NE ) <= 2 P(f Opt ) <= 2 C S (f Opt ) r The LoE of this equilibrium is at most 2

Same technique, different result r Consider a network with a routing at the equilibrium r Add some links r Let the system converge to a new equilibrium r The social cost of the new equilibrium can be at most 4/3 of the previous equilibrium social cost (as in the Braess Paradox)

Loss of Efficiency, Price of Anarchy, Price of Stability r Loss of Efficiency (LoE) m given a NE with social cost C S (f NE ) m LoE = C S (f NE ) / C S (f Opt ) r Price of Anarchy (PoA) [Koutsoupias99] m Different settings G (a family of graph, of cost functions,...) m X g =set of NEs for the setting g in G  PoA = sup g ε G sup NE ε Xg {C S (f NE ) / C S (f Opt )} => “worst” loss of efficiency in G r Price of Stability (PoS) [Anshelevish04]  PoS = sup g ε G inf NE ε Xg {C S (f NE ) / C S (f Opt )} => guaranteed loss of efficiency in G

Stronger results for affine cost functions r We have proven that for unit-traffic routing games the PoS is at most 2 r For unit-traffic routing games and single- source pairs the PoS is 4/3 r For non-atomic routing games the PoA is 4/3 m non-atomic = infinite players each with infinitesimal traffic r For other cost functions they can be much larger (even unbounded)

Potential games r A class of games for which there is a function P(s 1,s 2,…s N ) such that m For each i U i (s 1,s 2,…x i,…s N )>U i (s 1,s 2,…y i,…s N ) if and only if P(s 1,s 2,…x i,…s N )>P(s 1,s 2,…y i,…s N ) r Properties of potential games: Existence of a pure-strategy NE and convergence to it of best-response dynamics r The routing games we considered are particular potential games