Concise representations of games Vincent Conitzer

Slides:



Advertisements
Similar presentations
The role of compatibility in the diffusion of technologies in social networks Mohammad Mahdian Yahoo! Research Joint work with N. Immorlica, J. Kleinberg,
Advertisements

Mathematical Preliminaries
Approximate Nash Equilibria in interesting games Constantinos Daskalakis, U.C. Berkeley.
Recognising Languages We will tackle the problem of defining languages by considering how we could recognise them. Problem: Is there a method of recognising.
Computational Complexity
On the Local and Global Price of Anarchy of Graphical Games Oren Ben-Zwi Ronen Oren Ben-ZwiAmir Ronen Amir Ronen.
Preprocessing Techniques for Computing Nash Equilibria Vincent Conitzer Duke University Based on: Conitzer and Sandholm. A Generalized Strategy Eliminability.
Bayesian network for gene regulatory network construction
Dimitri DeFigueiredo Earl Barr S. (Felix) Wu Adobe Systems Inc. UC Davis UC Davis International Conference on Privacy, Security, Risk and Trust
Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman TexPoint fonts used in EMF. Read the TexPoint manual.
CPS Extensive-form games Vincent Conitzer
Graphs, representation, isomorphism, connectivity
CPSC 422, Lecture 11Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 11 Jan, 29, 2014.
Variants of Nash Equilibrium CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte 1.
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
Vincent Conitzer CPS Repeated games Vincent Conitzer
Algorithmic mechanism design Vincent Conitzer
Epp, section 10.? CS 202 Aaron Bloomfield
all-pairs shortest paths in undirected graphs
Complexity Results about Nash Equilibria Vincent Conitzer, Tuomas Sandholm International Joint Conferences on Artificial Intelligence 2003 (IJCAI’03) Presented.
1 A Graph-Theoretic Network Security Game M. Mavronicolas , V. Papadopoulou , A. Philippou  and P. Spirakis § University of Cyprus, Cyprus  University.
EC941 - Game Theory Prof. Francesco Squintani Lecture 4 1.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
Price Of Anarchy: Routing
Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.
Continuation Methods for Structured Games Ben Blum Christian Shelton Daphne Koller Stanford University.
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Congestion Games with Player- Specific Payoff Functions Igal Milchtaich, Department of Mathematics, The Hebrew University of Jerusalem, 1993 Presentation.
Joint Strategy Fictitious Play Sherwin Doroudi. “Adapted” from J. R. Marden, G. Arslan, J. S. Shamma, “Joint strategy fictitious play with inertia for.
Social Networks 101 P ROF. J ASON H ARTLINE AND P ROF. N ICOLE I MMORLICA.
Regret Minimization and the Price of Total Anarchy Paper by A. Blum, M. Hajiaghayi, K. Ligett, A.Roth Presented by Michael Wunder.
Extensive-form games. Extensive-form games with perfect information Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 5 Players do not move simultaneously.
Mechanism Design without Money Lecture 4 1. Price of Anarchy simplest example G is given Route 1 unit from A to B, through AB,AXB OPT– route ½ on AB and.
by Vincent Conitzer of Duke
Temporal Action-Graph Games: A New Representation for Dynamic Games Albert Xin Jiang University of British Columbia Kevin Leyton-Brown University of British.
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Potential games, Congestion games Computational game theory Spring 2010 Adapting slides by Michal Feldman TexPoint fonts used in EMF. Read the TexPoint.
Graphical Models for Game Theory Undirected graph G capturing local interactions Each player represented by a vertex N_i(G) = neighbors.
Graphical Games Kjartan A. Jónsson. Nash equilibrium Nash equilibrium Nash equilibrium N players playing a dominant strategy is a Nash equilibrium N players.
Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:
Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”
Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:
Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman.
1 Issues on the border of economics and computation נושאים בגבול כלכלה וחישוב Congestion Games, Potential Games and Price of Anarchy Liad Blumrosen ©
CPS Learning in games Vincent Conitzer
CPS 270: Artificial Intelligence Bayesian networks Instructor: Vincent Conitzer.
Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”
History-Dependent Graphical Multiagent Models Quang Duong Michael P. Wellman Satinder Singh Computer Science and Engineering University of Michigan, USA.
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
Presenter: Jen Hua Chi Adviser: Yeong Sung Lin Network Games with Many Attackers and Defenders.
Dynamic Games & The Extensive Form
The Science of Networks 6.1 Today’s topics Game Theory Normal-form games Dominating strategies Nash equilibria Acknowledgements Vincent Conitzer, Michael.
Complexity of Determining Nonemptiness of the Core Vincent Conitzer, Tuomas Sandholm Computer Science Department Carnegie Mellon University.
Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish agents strategically interact in using a network They.
Modeling Agents’ Reasoning in Strategic Situations Avi Pfeffer Sevan Ficici Kobi Gal.
Hedonic Clustering Games Moran Feldman Joint work with: Seffi Naor and Liane Lewin-Eytan.
CPS Computational problems, algorithms, runtime, hardness (a ridiculously brief introduction to theoretical computer science) Vincent Conitzer.
Vasilis Syrgkanis Cornell University
By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.
Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.
Complexity of Determining Nonemptiness of the Core
Congestion games Computational game theory Fall 2010
Structured Models for Multi-Agent Interactions
Instructors: Fei Fang (This Lecture) and Dave Touretzky
CPS 570: Artificial Intelligence Bayesian networks
Instructor: Vincent Conitzer
Lecture Game Theory.
Normal Form (Matrix) Games
Presentation transcript:

Concise representations of games Vincent Conitzer

Games with many agents How do we represent a (say, simultaneous-move) game with n agents? Even with only 2 actions (pure strategies) per player, there are 2 n possible outcomes –Impractical to list them all Real-world games often have structure that allows us to describe them concisely E.g., complete symmetry among players –How would we represent such a game? –How many numbers (utilities) do we need to specify? –For more, see e.g., Brandt et al. 2009, Jiang et al What other structure can we make use of?

Congestion games [Rosenthal 73] There is a set of resources R Agent is set of actions (pure strategies) A i is a subset of 2 R, representing which subsets of resources would meet her needs –Note: different agents may need different resources There exist cost functions c r : {1, 2, 3, …} such that agent is utility for a = (a i, a -i ) is -Σ r in a i c r (#(r, a)) –#(r, a) is the number of agents that chose r as one of their resources

Example: writing paper/playing video game Player 1 needs to write a paper Player 2 (player 1s roommate) needs to play a video game Resources: 1s Laptop (L1), 2s Laptop (L2), Video game system (V), Internet connection (I), Common room (C), Bedroom (B) –Both rooms have (shared) internet connection, video game system cannot be moved out of common room Player 1s action set: {{L1, I, C}, {L1, I, B}} Player 2s action set: {{V, C}, {L2, I, C}, {L2, I, B}} For all resources r, c r (1) = 0, c r (2) = 1 What are the pure-strategy equilibria?

Example: network routing Player 1 has source A and target E Player 2 has source B and target C Player 3 has source B and target E Resources = edges Player 1s action set: {{AD,DE}, {AB,BD,DE}, {AC,CD,DE}} Player 2s action set: {{AB,AC}, {BD,CD}, {AB,AD,CD}, {BD,AD,AC}} Player 3s action set: {{BD,DE}, {AB,AD,DE}, {AB,AC,CD,DE}} For all resources r, c r (1) = 0, c r (2) = 1, c r (3) = 2 Pure-strategy equilibria? A B C D E

Potential games [Monderer & Shapley 96] A potential game is a game with a potential function p: A (A is set of all joint actions (pure strategy profiles, outcomes)) such that for all players i, all a -i, all a i, all a i, u i (a i, a -i ) - u i (a i, a -i ) = p(a i, a -i ) - p(a i, a -i ) 1, 36, 6 3, 53, 3 Potential function: p(UL) = 0 p(UR) = 3 p(DL) = 2 p(DR) = 0 Can you think of an algorithm for verifying whether a normal-form game is a potential game (i.e., for finding a potential function)?

Potential games always have a pure- strategy equilibrium Recall that u i (a i, a -i ) - u i (a i, a -i ) = p(a i, a -i ) - p(a i, a -i ) Hence, let us simply choose argmax a {p(a)} For any alternative a i, u i (a i, a -i ) - u i (a) = p(a i, a -i ) - p(a) 0, hence it is an equilibrium! More generally, the set of pure-strategy Nash equilibria is exactly the set of local maxima of the potential function –Local maximum = no player can improve the potential function by herself Easy algorithm for finding a pure-strategy equilibrium: –Start with any strategy profile –If a player is not best-responding, switch that players strategy to a better response (must increase potential) –Terminate when no player can improve

Every congestion game is a potential game! Use potential p(a) = -Σ r Σ 1 i #(r, a) c r (i) –One interpretation: the sum of the utilities that the agents would have received if each agent were unaffected by all later agents Why is this a correct potential function? Suppose you change actions –You stop using some resources (R-), start using others (R+) Change in your utility equals Σ r in R- c r (#(r, a)) - Σ r in R+ c r (#(r, a) + 1) This is also the change in the potential function above Conversely, every potential game can be modeled as a congestion game –Proof omitted

Graphical games [Kearns et al. 01] Note: all games in this lecture have something to do with graphs, but only the following are considered graphical games Suppose players are vertices of a graph, and each agent is affected only by its neighbors actions (and own action) E.g., physical neighbors on a road: Can write u 2 (a 1, a 2, a 3, a 6 ) rather than u 2 (a 1, a 2, a 3, a 4, a 5, a 6, a 7, a 8 ) –If each agent has two actions, a table requiring 2 4 = 16 numbers rather than 2 8 = 256 numbers Graphical games can model any normal-form game (by using a fully connected graph)

Action-graph games [Bhat & Leyton-Brown 04] Now there is a vertex for every action that a player may take E.g., action = where to sell hot dogs l1l2l3l4 l5l6l7l8 Players can have distinct action sets, but they can overlap –E.g., perhaps some players cannot stand in every location –E.g., maybe player 1 lives on the right and can only make it to A 1 = {l3, l4, l7, l8}; but player 2 has a car and can make it to any location Your utility is a function of –Which vertex you choose, –How many other players choose your vertex, –For each neighbor of your vertex, how many players choose that neighbor

Any normal-form game can be expressed as an action-graph game Make action sets disjoint, Connect every action to every action by another player a 11 a 12 a 13 A1A1 a 21 a 23 A2A2 a 31 a 32 A3A3

Action-graph games can capture the structure of graphical games Omit edges between actions of players not connected in the graphical game a 11 a 12 a 13 A1A1 a 21 a 23 A2A2 a 31 a 32 A3A But, AGGs can capture more structure than that: e.g., things like: Player 1s utility does not depend on what player 2 does given that player 1 plays a 12 (context-specific independence)

Computing expected utilities in AGGs [Jiang & Leyton-Brown AAAI06] How do we compute the expected utility of playing a given action a i, given the other players mixed strategies σ -i ? –Key step in many algorithms for computing equilibria Observation: only care about neighboring actions of a i –Lump other actions together into one big irrelevant action A configuration D(a i ) specifies the number of (other) players that end up playing each action –Utility of playing a i is a function of the configuration –Need to know the probability distribution over configurations Use dynamic programming Say P k (D(a i )) is the probability of D(a i ) if only the first k other players play Say D(D(a i ), a j ) is the configuration that results from adding one player playing a j to D(a i ) Then P k (D(a i )) = Σ D(a i ), a j : D(D(a i ), a j ) = D(a i ) P k-1 (D(a i )) σ k (a j )

Bayes nets: Rain and sprinklers example raining (X) sprinklers (Y) grass wet (Z) P(Z=1 | X=0, Y=0) =.1 P(Z=1 | X=0, Y=1) =.8 P(Z=1 | X=1, Y=0) =.7 P(Z=1 | X=1, Y=1) =.9 P(X=1) =.3 P(Y=1) =.4 sprinklers is independent of raining, so no edge between them Each node has a conditional probability table (CPT)

More elaborate rain and sprinklers example rained sprinklers were on grass wet dog wet neighbor walked dog P(+r) =.2 P(+n|+r) =.3 P(+n|-r) =.4 P(+s) =.6 P(+g|+r,+s) =.9 P(+g|+r,-s) =.7 P(+g|-r,+s) =.8 P(+g|-r,-s) =.2 P(+d|+n,+g) =.9 P(+d|+n,-g) =.4 P(+d|-n,+g) =.5 P(+d|-n,-g) =.3

MAIDs [Koller & Milch 03] MAID = Multi Agent Influence Diagram Example: two students working on a project together Blue student likes theory and algorithms, green student likes code that works well in practice Rectangles = decision nodes, ovals = random nodes, diamonds = utility nodes Work hard early in the course Neat algorithm discovered Work hard late in the course Code produced works really well Grade Pain and suffering Inherent satisfaction Parents happy

MAIDs… Work hard early in the course Work hard late in the course Pain and suffering u p&s (WHE, WHL) = -30 u p&s (WHE, -WHL) = -20 u p&s (-WHE, WHL) = -20 u p&s (-WHE, -WHL) = 0 The graph by itself is not enough: We need specifications of the utility functions… … and the conditional probabilities for random nodes Work hard early in the course Neat algorithm discovered P(NAD | WHE, WHE) =.9 P(NAD | WHE, -WHE) =.8 P(NAD | -WHE, WHE) =.5 P(NAD | -WHE, -WHE) = 0