A Short Tutorial on Cooperative Games

Slides:

Advertisements

Similar presentations

Problems and Their Classes

Advertisements

Reaching Agreements II. 2 What utility does a deal give an agent? Given encounter  T 1,T 2  in task domain  T,{1,2},c  We define the utility of a.

6.896: Topics in Algorithmic Game Theory Lecture 20 Yang Cai.

Auction Theory Class 5 – single-parameter implementation and risk aversion 1.

Incremental Linear Programming Linear programming involves finding a solution to the constraints, one that maximizes the given linear function of variables.

NON - zero sum games.

Price Of Anarchy: Routing

Manipulation, Control, and Beyond: Computational Issues in Weighted Voting Games Edith Elkind (U. of Southampton) Based on joint work with: Y.Bachrach,

Agent Failures in Totally Balanced Games and Convex Games Authors: Yoram Bachrach 1 Ian Kash 1 Nisarg Shah 2 (speaker) 1 Microsoft Research Cambridge.

An Approximate Truthful Mechanism for Combinatorial Auctions An Internet Mathematics paper by Aaron Archer, Christos Papadimitriou, Kunal Talwar and Éva.

Introduction to Game theory Presented by: George Fortetsanakis.

Algorithmic aspects of the core in cooperative games over graphs Vangelis Markakis Athens University of Economics and Business Dept. of Informatics Joint.

MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)

Study Group Randomized Algorithms 21 st June 03. Topics Covered Game Tree Evaluation –its expected run time is better than the worst- case complexity.

Techniques for Dealing with Hard Problems Backtrack: –Systematically enumerates all potential solutions by continually trying to extend a partial solution.

EC941 - Game Theory Lecture 7 Prof. Francesco Squintani

Regret Minimization and the Price of Total Anarchy Paper by A. Blum, M. Hajiaghayi, K. Ligett, A.Roth Presented by Michael Wunder.

Chapter 14 Game Theory to accompany Operations Research: Applications and Algorithms 4th edition by Wayne L. Winston Copyright (c) 2004 Brooks/Cole, a.

Cooperative/coalitional game theory Vincent Conitzer

EC941 - Game Theory Prof. Francesco Squintani Lecture 8 1.

Cooperative Weakest Link Games Yoram Bachrach, Omer Lev, Shachar Lovett, Jeffrey S. Rosenschein & Morteza Zadimoghaddam CoopMAS 2013 St. Paul, Minnesota.

Bundling Equilibrium in Combinatorial Auctions Written by: Presented by: Ron Holzman Rica Gonen Noa Kfir-Dahav Dov Monderer Moshe Tennenholtz.

Yoram Bachrach Jeffrey S. Rosenschein November 2007.

Overlapping Coalition Formation: Charting the Tractability Frontier Y. Zick, G. Chalkiadakis and E. Elkind (submitted to AAMAS 2012)

Prisoners Dilemma rules 1.Binding agreements are not possible. Note in Prisoners dilemma, if binding agreements were possible, there would be no dilemma.

Taxation and Stability in Cooperative Games Yair Zick Maria Polukarov Nick R. Jennings AAMAS 2013.

Cooperative/coalitional game theory A composite of slides taken from Vincent Conitzer and Giovanni Neglia (Modified by Vicki Allan) 1.

The Allocation of Value For Jointly Provided Services By P. Linhart, R. Radner, K. G. Ramkrishnan, R. Steinberg Telecommunication Systems, Vol. 4, 1995.

Path Disruption Games (Cooperative Game Theory meets Network Security) Yoram Bachrach, Ely Porat Microsoft Research Cambridge.

Computing Shapley Values, Manipulating Value Distribution Schemes, and Checking Core Membership in Multi-Issue Domains Vincent Conitzer and Tuomas Sandholm.

Job Market Signaling (Spence model)

Coalition Structures in Weighted Voting Games Georgios Chalkiadakis Edith Elkind Nicholas R. Jennings.

Arbitrators in Overlapping Coalition Formation Games

Cooperative/coalitional game theory A composite of slides taken from Vincent Conitzer and Giovanni Neglia (Modified by Vicki Allan) 1.

Near-Optimal Network Design with Selfish Agents By Elliot Anshelevich, Anirban Dasgupta, Eva Tardos, Tom Wexler STOC’03 Presented by Mustafa Suleyman CIFTCI.

On the Aggregation of Preferences in Engineering Design Beth Allen University of Minnesota NSF DMI

Cooperative/coalitional game theory A composite of slides taken from Vincent Conitzer and Giovanni Neglia (Modified by Vicki Allan) 1.

Computational aspects of stability in weighted voting games Edith Elkind (NTU, Singapore) Based on joint work with Leslie Ann Goldberg, Paul W. Goldberg,

Collusion and the use of false names Vincent Conitzer

The Core MIT , Fall Lecture Outline  Coalitional Games and the Core The non-transferable utility ( “ NTU ” ) formulation The transferable.

Efficiency Loss in a Network Resource Allocation Game Paper by: Ramesh Johari, John N. Tsitsiklis [ Informs] Presented by: Gayatree Ganu.

Coalition Games: A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor SIUC A C B.

Computational problems associated with a solution concept Why we need the compact representation of coalitional games Compactly-represented coalitional.

Presenter: Jen Hua Chi Adviser: Yeong Sung Lin Network Games with Many Attackers and Defenders.

Game Theory is Evolving MIT , Fall Our Topics in the Course  Classical Topics Choice under uncertainty Cooperative games  Values  2-player.

Bounding the Cost of Stability in Games with Restricted Interaction Reshef Meir, Yair Zick, Edith Elkind and Jeffrey S. Rosenschein COMSOC 2012 (to appear)

Channel Capacity.

Complexity of Determining Nonemptiness of the Core Vincent Conitzer, Tuomas Sandholm Computer Science Department Carnegie Mellon University.

Manipulating the Quota in Weighted Voting Games (M. Zuckerman, P. Faliszewski, Y. Bachrach, and E. Elkind) ‏ Presented by: Sen Li Software Technologies.

Basic Principles (continuation) 1. A Quantitative Measure of Information As we already have realized, when a statistical experiment has n eqiuprobable.

Algorithmic Game Theory and Internet Computing Vijay V. Vazirani Georgia Tech Primal-Dual Algorithms for Rational Convex Programs II: Dealing with Infeasibility.

Hedonic Clustering Games Moran Feldman Joint work with: Seffi Naor and Liane Lewin-Eytan.

1 a1a1 A1A1 a2a2 a3a3 A2A Mixed Strategies When there is no saddle point: We’ll think of playing the game repeatedly. We continue to assume that.

Negotiating Socially Optimal Allocations of Resources U. Endriss, N. Maudet, F. Sadri, and F. Toni Presented by: Marcus Shea.

11 The Cost of Stability in Network Flow Games Ezra Resnick Yoram Bachrach Jeffrey S. Rosenschein.

D. AriflerCMPE 548 Fall CMPE 548 Routing and Congestion Control.

Nash Bargaining Solution and Alternating Offer Games MIT Game Theory.

Approximation Algorithms based on linear programming.

The Shapley Value The concept of the core is useful as a measure of stability. As a solution concept, it presents a set of imputations without distinguishing.

Cooperative Games Based on the book and slides by Chalkiadakis, Elkind and Wooldridge.

Optimization and Stability in Games with Restricted Interactions Reshef Meir, Yair Zick and Jeffrey S. Rosenschein CoopMAS 2012.

Coalitional Games on “Sparse” Social Networks Edith Elkind University of Oxford.

Dynamic Weighted Voting Games Edith Elkind Dmitrii Pasechnik Yair Zick AAMAS 2013.

Computing Shapley values, manipulating value division schemes, and checking core membership in multi-issue domains Vincent Conitzer, Tuomas Sandholm Computer.

Cooperative Games Based on the book and slides by Chalkiadakis, Elkind and Wooldridge.

Approximation algorithms for combinatorial allocation problems

Complexity of Determining Nonemptiness of the Core

Algorithmic Game Theory and Internet Computing

CPS Cooperative/coalitional game theory

Economics and Computation Week 6: Assignment games

Presentation transcript:

A Short Tutorial on Cooperative Games SCE 2012 Sudarshan Guruacharya Nanyang Technological University

Tutorial Overview Introduction Definitions Some solution concepts Computational issues

Assumptions in Non-Cooperative Games Cooperation does not occur in non-cooperative games, because players cannot make binding agreements But what if binding agreements are possible? This is exactly the class of scenarios studied by cooperative game theory

Cooperative Games Cooperative games model scenarios, where agents can benefit by cooperating binding agreements are possible In cooperative games, actions are taken by groups of agents Transferable utility games: payoffs are given to the group and then divided among its members Non-transferable utility games: group actions result in payoffs to individual group members

Coalitional Action Agents form coalitions (teams) Each coalition chooses its action Non-transferable utility (NTU) games: the choice of coalitional actions (by all coalitions) determines each player’s payoffs Transferable utility (TU) games: the choice of coalitional actions (by all coalitions) determines the payoff of each coalition the members of the coalition then need to divide this joint payoff

Characteristic Function Games vs. Partition Function Games In general TU games, the payoff obtained by a coalition depends on the actions chosen by other coalitions these games are also known as partition function games (PFG) Characteristic function games (CFG): the payoff of each coalition only depends on the action of that coalition in such games, each coalition can be identified with the profit it obtains by choosing its best action We will focus on characteristic function games, and use term “TU games” to refer to such games

How Is a Cooperative Game Played? Even though agents work together they are still selfish The partition into coalitions and payoff distribution should be such that no player (or group of players) has an incentive to deviate (idea of stability) We may also want to ensure that the outcome is fair: the payoff of each agent is proportional to his contribution We will now see how to formalize these ideas

Transferable Utility Games Formalized A transferable utility game is a pair (N, v), where: N ={1, ..., n} is the set of players v: 2N → R is the characteristic function for each subset of players C, v(C) is the amount that the members of C can earn by working together usually it is assumed that v is normalized: v(Ø) = 0 non-negative: v(C) ≥ 0 for any C ⊆ N monotone: v(C) ≤ v(D) for any C, D such that C ⊆ D A coalition is any subset of N; N itself is called the grand coalition

Buying Ice-Cream Game n children, each has some amount of money the i-th child has bi dollars three types of ice-cream tubs are for sale: Type 1 costs $7, contains 500g Type 2 costs $9, contains 750g Type 3 costs $11, contains 1kg children have utility for ice-cream, and do not care about money The payoff of each group: the maximum quantity of ice-cream the members of the group can buy by pooling their money The ice-cream can be shared arbitrarily within the group

Ice-Cream Game: Characteristic Function Charlie: $6, Marcie: $4, Pattie: $3 w = 500 w = 750 w = 1000 p = $7 p = $9 p = $11 v(Ø) = v({C}) = v({M}) = v({P}) = 0 v({C, M}) = 750, v({C, P}) = 750, v({M, P}) = 500 v({C, M, P}) = 1000 Charlie, Marcie, and Pattie

Transferable Utility Games: Outcome An outcome of a TU game G =(N, v) is a pair (CS, x), where: CS =(C1, ..., Ck) is a coalition structure, i.e., partition of N into coalitions:  i Ci = N, Ci  Cj = Ø for i ≠ j x = (x1, ..., xn) is a payoff vector, which distributes the value of each coalition in CS: xi ≥ 0 for all i N SiC xi = v(C) for each C is CS 1 2 3 4 5

Transferable Utility Games: Outcome Example: suppose v({1, 2, 3}) = 9, v({4, 5}) = 4 then (({1, 2, 3}, {4, 5}), (3, 3, 3, 3, 1)) is an outcome (({1, 2, 3}, {4, 5}), (2, 3, 2, 3, 3)) is NOT an outcome: transfers between coalitions are not allowed An outcome (CS, x) is called an imputation if it satisfies individual rationality: xi ≥ v({i}) for all i  N Notation: we will denote SiC xi by x(C)

Superadditive Games Definition: a game G = (N, v) is called superadditive if v(C U D) ≥ v(C) + v(D) for any two disjoint coalitions C and D Example: v(C) = |C|2: v(C U D) = (|C|+|D|)2 ≥ |C|2+|D|2 = v(C) + v(D) In superadditive games, two coalitions can always merge without losing money; hence, we can assume that players form the grand coalition Convention: in superadditive games, we identify outcomes with payoff vectors for the grand coalition i.e., an outcome is a vector x = (x1, ..., xn) with SiN xi = v(N)

What Is a Good Outcome? C: $4, M: $3, P: $3 v(Ø) = v({C}) = v({M}) = v({P}) = 0, v({C, M, P}) = 750 v({C, M}) = 500, v({C, P}) = 500, v({M, P}) = 0 This is a superadditive game outcomes are payoff vectors How should the players share the ice-cream? if they share as (200, 200, 350), Charlie and Marcie can get more ice-cream by buying a 500g tub on their own, and splitting it equally the outcome (200, 200, 350) is not stable!

Transferable Utility Games: Stability Definition: the core of a game is the set of all stable outcomes, i.e., outcomes that no coalition wants to deviate from core(G) = {(CS, x) | SiC xi ≥ v(C) for any C ⊆ N} each coalition earns at least as much as it can make on its own Note that G is not assumed to be superadditive Example suppose v({1, 2, 3}) = 9, v({4, 5}) = 4, v({2, 4}) = 7 then (({1, 2, 3}, {4, 5}), (3, 3, 3, 3, 1)) is NOT in the core 1 2 3 4 5

Ice-cream game: Core C: $4, M: $3, P: $3 v(Ø) = v({C}) = v({M}) = v({P}) = 0, v({C, M, P}) = 750 v({C, M}) = 500, v({C, P}) = 500, v({M, P}) = 0 (200, 200, 350) is not in the core: v({C, M}) > xC + xM (250, 250, 250) is in the core: no subgroup of players can deviate so that each member of the subgroup gets more (750, 0, 0) is also in the core: Marcie and Pattie cannot get more on their own!

Games with Empty Core The core is a very attractive solution concept However, some games have empty cores G = (N, v) N = {1, 2, 3}, v(C) = 1 if |C| > 1 and v(C) = 0 otherwise consider an outcome (CS, x) if CS = ({1}, {2}, {3}), the singleton coalitions can deviate if CS = ({1, 2}, {3}), either 1 or 2 gets less than 1, so can deviate with 3 same argument for CS = ({1, 3}, {2}) or CS = ({2, 3}, {1}) suppose CS = {1, 2, 3}: xi > 0 for some i, so x(N\{i}) < 1, yet v(N\{i}) = 1

Core and Superadditivity Suppose the game is not superadditive, but the outcomes are defined as payoff vectors for the grand coalition Then the core may be empty, even if according to the standard definition it is not G = (N, v) N = {1, 2, 3, 4}, v(C) = 1 if |C| > 1 and v(C) = 0 otherwise not superadditive: v({1, 2}) + v({3, 4}) = 2 > v({1, 2, 3, 4}) no payoff vector for the grand coalition is in the core: either {1, 2} or {3, 4} get less than 1, so can deviate (({1, 2}, {3, 4}), (½, ½, ½, ½)) is in the core

e-Core If the core is empty, we may want to find approximately stable outcomes Need to relax the notion of the core: core: (CS, x): x(C) ≥ v(C) for all C  N e-core: (CS, x): x(C) ≥ v(C) - e for all C  N Is usually defined for superadditive games only Example: G = (N, v), N = {1, 2, 3}, v(C) = 1 if |C| > 1, v(C) = 0 otherwise 1/3-core is non-empty: (1/3, 1/3, 1/3) 1/3-core e-core is empty for any e < 1/3: xi ≥ 1/3 for some i = 1, 2, 3, so x(N\{i}) ≤ 2/3, v(N\{i}) = 1

Least Core If an outcome (CS, x) is in e-core, the deficit v(C) - x(C) of any coalition is at most e We are interested in outcomes that minimize the worst-case deficit Let e*(G) = inf { e | e-core of G is not empty } it can be shown that e*(G)-core is not empty Definition: e*(G)-core is called the least core of G e*(G) is called the value of the least core G = (N, v), N = {1, 2, 3}, v(C) = 1 if |C| > 1, v(C) = 0 otherwise 1/3-core is non-empty, but e-core is empty for any e < 1/3, so least core = 1/3-core

Stability vs. Fairness Outcomes in the core may be unfair G = (N, v) N = {1, 2}, v(Ø) = 0, v({1}) = v({2}) = 5, v({1, 2}) = 20 (15, 5) is in the core: player 2 cannot benefit by deviating However, this is unfair since 1 and 2 are symmetric How do we divide payoffs in a fair way?

Marginal Contribution A fair payment scheme would reward each agent according to his contribution First attempt: given a game G = (N, v), set xi = v({1, ..., i-1, i}) - v({1, ..., i-1}) payoff to each player = his marginal contribution to the coalition of his predecessors We have x1 + ... + xn = v(N) x is a payoff vector However, payoff to each player depends on the order G = (N, v) N = {1, 2}, v(Ø) = 0, v({1}) = v({2}) = 5, v({1, 2}) = 20 x1 = v(1) - v(Ø) = 5, x2 = v({1, 2}) - v({1}) = 15

Average Marginal Contribution Idea: to remove the dependence on ordering, can average over all possible orderings G = (N, v) N = {1, 2}, v(Ø) = 0, v({1}) = v({2}) = 5, v({1, 2}) = 20 1, 2: x1 = v(1) - v(Ø) = 5, x2 = v({1, 2}) - v({1}) = 15 2, 1: y2 = v(2) - v(Ø) = 5, y1 = v({1, 2}) - v({2}) = 15 z1 = (x1 + y1)/2 = 10, z2 = (x2 + y2)/2 = 10 the resulting outcome is fair! Can we generalize this idea?

Shapley Value Reminder: a permutation of {1,..., n} is a one-to-one mapping from {1,..., n} to itself let P(N) denote the set of all permutations of N Let Sp(i) denote the set of predecessors of i in pP(N) For C⊆N, let di(C) = v(C U {i}) - v(C) Definition: the Shapley value of player i in a game G = (N, v) with |N| = n is fi(G) = 1/n! S p: p  P(N) di(Sp(i)) In the previous slide we have f1 = f2 = 10 i ... Sp(i)

Shapley Value: Probabilistic Interpretation fi is i’s average marginal contribution to the coalition of its predecessors, over all permutations Suppose that we choose a permutation of players uniformly at random, among all possible permutations of N then fi is the expected marginal contribution of player i to the coalition of his predecessors

Definition: a player i is a dummy in a game G = (N, v) if v(C) = v(C U {i}) for any C ⊆ N Definition: given a game G = (N, v), two players i and j are said to be symmetric if v(C U {i}) = v(C U {j}) for any C ⊆ N\{i, j}

Axiomatic Characterization Properties of Shapley value: Efficiency: f1 + ... + fn = v(N) Dummy: if i is a dummy, fi = 0 Symmetry: if i and j are symmetric, fi = fj Additivity: fi(G1+G2) = fi(G1) + fi(G2) Theorem: Shapley value is the only payoff distribution scheme that has properties (1) - (4)

Computational Issues in Coalitional Games We have defined some solution concepts - but can we compute them efficiently? Problem: the naive representation of a coalitional game is exponential in the number of players n need to list values of all coalitions We are usually interested in algorithms whose running time is polynomial in n So what can we do?

Checking Non-emptiness of the Core: Superadditive Games An outcome in the core of a superadditive game satisfies the following constraints: xi ≥ 0 for all i N SiN xi = v(N) SiC xi ≥ v(C) for any C ⊆ N A linear feasibility program, with one constraint for each coalition: 2n+n+1 constraints

Superadditive Games: Computing the Least Core LFP for the core min e xi ≥ 0 for all i N SiN xi = v(N) SiC xi ≥ v(C) - e for any C ⊆ N A minimization program, rather than a feasibility program LP for the least core

Computing Shapley Value fi(G) = 1/n! S p: p  P(N) di(Sp(i)) fi is the expected marginal contribution of player i to the coalition of his predecessors Quick and dirty way: Use Monte-Carlo method to compute fi Convergence guaranteed by Law of Large Numbers

Two Person Nash Bargaining Solution Simplest cooperative game (u1, u2) ε U is the set of attainable payoff pairs d ε U denotes the fixed threat point The players will receive the payoff d = (d1, d2) if they fail to achieve an agreement Assumption: Set U is compact and convex

Characterization of Nash solution The element of U that maximizes the product of gains from agreement (u1 - d1)(u2 - d2) is the unique Nash solution We need to solve the following optimization problem: max (u1 - d1)(u2 - d2)

Axiomatic Characterization Properties of Nash solution (1) Individually Rational: f(U,d) > d (2) Invariance to affine transform: f(U’,d’) = a•f(U,d) + b (3) Symmetry: f1(U,d) = f2 (U,d) (4) Independence of irrelevant alternatives Nash solution uniquely satisfies all these properties

Proportional Fairness Usually we have the utility function defined over a set of feasible resources X Sufficient for the utility to be quasi-concave By taking the logarithm of the objective function, we can convert the Nash solution into a more convenient form max log (u1 (x) - d1) + log (u2 (x) - d2) s.t. x ε X Can be solved using convex optimization

Thank You!!