Multiagent Systems Extensive Form Games © Manfred Huber 2018.

Slides:

Advertisements

Similar presentations

An Introduction to Game Theory Part V: Extensive Games with Perfect Information Bernhard Nebel.

Advertisements

CPS Extensive-form games Vincent Conitzer

M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture

Basics on Game Theory Class 2 Microeconomics. Introduction Why, What, What for Why Any human activity has some competition Human activities involve actors,

Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.

Non-Cooperative Game Theory To define a game, you need to know three things: –The set of players –The strategy sets of the players (i.e., the actions they.

Short introduction to game theory 1. 2  Decision Theory = Probability theory + Utility Theory (deals with chance) (deals with outcomes)  Fundamental.

Game-theoretic analysis tools Necessary for building nonmanipulable automated negotiation systems.

M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture

Extensive-form games. Extensive-form games with perfect information Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 5 Players do not move simultaneously.

Chapter 11 Game Theory and the Tools of Strategic Business Analysis.

Basics on Game Theory For Industrial Economics (According to Shy’s Plan)

More on Extensive Form Games. Histories and subhistories A terminal history is a listing of every play in a possible course of the game, all the way to.

1 Introduction APEC 8205: Applied Game Theory. 2 Objectives Distinguishing Characteristics of a Game Common Elements of a Game Distinction Between Cooperative.

Chapter 6 Extensive Games, perfect info

Extensive Game with Imperfect Information Part I: Strategy and Nash equilibrium.

Game Theoretic Analysis of Oligopoly lr L R 0000 L R 1 22 The Lane Selection Game Rational Play is indicated by the black arrows.

Chapter 9 Games with Imperfect Information Bayesian Games.

Advanced Artificial Intelligence Lecture 3B: Game theory.

Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.

Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -

EC941 - Game Theory Prof. Francesco Squintani Lecture 5 1.

Extensive-form games Vincent Conitzer

Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.

Games with Imperfect Information Bayesian Games. Complete versus Incomplete Information So far we have assumed that players hold the correct belief about.

Dynamic Games & The Extensive Form

Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.

Chapter 5 Game Theory and the Tools of Strategic Business Analysis.

Extensive Games with Imperfect Information

1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.

Game theory 2 Lukáš Lehotský

Extensive Form (Dynamic) Games With Perfect Information (Theory)

Econ 805 Advanced Micro Theory 1 Dan Quint Fall 2009 Lecture 1 A Quick Review of Game Theory and, in particular, Bayesian Games.

Game representations, game-theoretic solution concepts, and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.

ECO290E: Game Theory Lecture 10 Examples of Dynamic Games.

M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.1.Dynamic Games of Complete and Perfect Information Lecture

Now that we have set of pure strategies for each player, we need to find the payoffs to put the game in strategic form. Random payoffs. The actual outcome.

Q 2.1 Nash Equilibrium Ben

Yuan Deng Vincent Conitzer Duke University

Strategy Grafting in Extensive Games

Introduction to Game Theory

Extensive-Form Game Abstraction with Bounds

Introduction to Game Theory

5. Combining simultaneous and sequential moves.

Dynamic Games of Complete Information

Extensive-form games and how to solve them

Vincent Conitzer CPS Repeated games Vincent Conitzer

Expectimax Lirong Xia. Expectimax Lirong Xia Project 2 MAX player: Pacman Question 1-3: Multiple MIN players: ghosts Extend classical minimax search.

Econ 805 Advanced Micro Theory 1

Artificial Intelligence

Multiagent Systems Game Theory © Manfred Huber 2018.

Game Theory Chapter 12.

Learning 6.2 Game Theory.

Artificial Intelligence

Chapter 29 Game Theory Key Concept: Nash equilibrium and Subgame Perfect Nash equilibrium (SPNE)

Instructor: Vincent Conitzer

Games with Imperfect Information Bayesian Games

ECE700.07: Game Theory with Engineering Applications

CPS Extensive-form games

Multiagent Systems Repeated Games © Manfred Huber 2018.

Vincent Conitzer Repeated games Vincent Conitzer

Vincent Conitzer Extensive-form games Vincent Conitzer

CPS 173 Extensive-form games

Molly W. Dahl Georgetown University Econ 101 – Spring 2009

Equilibrium concepts so far.

Normal Form (Matrix) Games

Introduction to Game Theory

Vincent Conitzer CPS Repeated games Vincent Conitzer

Presentation transcript:

Multiagent Systems Extensive Form Games © Manfred Huber 2018

Extensive Form Games Normal form games don’t allow to represent sequentiality of decisions of the agents Multiple sequential decisions of an agent temporal structure of multiagent decisions Extensive form games provide Explicit representation of temporal structure/protocol of decisions Explicit representation of multiple sequential decisions by an agent © Manfred Huber 2018

Extensive Form Games To capture different amounts of information available in different scenarios, there are two main variants of extensive form games Perfect Information Games Each player knows the current state in the decision making sequence and is aware of all decisions that the other agents have made Imperfect Information Games Different parts of the game look identical to the agent and it can not decide which of them it is in In extensive form games the decision making sequence is represented as a decision tree © Manfred Huber 2018

Perfect Information Games A perfect information game in extensive form is defined as: N is the set of n agents A is the set of actions H is the set of non-terminal choice (decision) nodes Z is the set of terminal nodes χ: H→2A indicates all actions available to the agent in a node ρ:H→N indicates which agent makes decisions in a given node σ:H✕A→H U Z is the successor function indicating the next node in the game u=(u1,…un) is the vector of utility functions for each player © Manfred Huber 2018

Perfect Information Games The sharing game in extensive form Two siblings receive two presents One sibling decides how to share them The second sibling decides whether to accept the shares or to decline the presents 1 2-0 0-2 1-1 2 2 2 yes no yes no yes no (2,0) (0,0) (1,1) (0,0) (0,2) (0,0) © Manfred Huber 2018

Pure Strategies in Perfect Information Games A pure strategy for agent i in a perfect information game is a complete specification of the (deterministic) actions the agent will take in each decision node associated with the agent Strategies have to include action choices even for nodes that can not be encountered under the strategy © Manfred Huber 2018

Pure Strategies in Perfect Information Games 1 2-0 0-2 1-1 2 2 2 yes no yes no yes no (2,0) (0,0) (1,1) (0,0) (0,2) (0,0) Pure strategies for agent 1: (2-0), (1-1), (0-2) Pure strategies for agent 2: (yes,yes,yes), (yes,yes,no), (yes,no,yes), (yes,no,no), (no,yes,yes), (no,yes,no), (no,no,yes), (no,no,no) © Manfred Huber 2018

Pure Strategies in Perfect Information Games 1 2 A B C E F D (1,1) (5,2) H I (3,2) (1,0) G (2,1) Pure strategies for agent 1: (A,H), (A,I), (B,H), (B,I) Note: (A,H) and (A,I) are pure strategies even though the decision between H and I after A never has to be taken Pure strategies for agent 2: (C,E,G), (C,F,G), (D,E,G), (D,F,G) © Manfred Huber 2018

Strategies and Equilibria Solution strategies can be defined as in normal form games: Mixed strategies are defined by a probability distribution over pure strategies Best responses for agent i are strategies that lead to optimal utilities in the context of the strategies of the other agents A Nash equilibrium is a strategy profile in which each agent’s strategy is a best response to the other agents’ strategies in the profile © Manfred Huber 2018

Nash Equilibria in Perfect Information Extensive Form Games Every perfect information game in extensive form has a pure strategy Nash equilibrium Since the agents make decisions sequentially and are aware of all prior decisions, random decisions making can not hide the actual outcome and therefore reduce to a deterministic action choice. Every perfect information game in extensive form can be converted into normal form The reverse is not true since extensive form requires knowledge of prior, sequential decisions © Manfred Huber 2018

Induced Normal Form 1 2 A B C E F D (1,1) (5,2) H I (3,2) (1,0) G (2,1) C,E,G C,F,G D,E,G D,F,G A,H 2,1 1,1 A,I B,H 5,2 3,2 B,I 1,0 What are the pure strategy equilibria ? ((B,H),(C,E,G)) ((B,H),(D,E,G)) ((B,I),(C,E,G)) (B,I),(D,E,G)) Pure strategies for agent 1: (A,H), (A,I), (B,H), (B,I) Note: (A,H) and (A,I) are pure strategies even though the decision between H and I after A never has to be taken Pure strategies for agent 2: (C,E,G), (C,F,G), (D,E,G), (D,F,G) © Manfred Huber 2018

Induced Normal Form Pure strategy Nash equilibria: 1 A B C,E C,F D,E D,F A,G 3,8 8,3 A,H B,G 5,5 2,10 B,H 1,0 2 2 C D E F 1 (3,8) (8,3) (5,5) G H (2,10) (1,0) Pure strategy Nash equilibria: (A,G),(C,F) (A,H),(C,F) (B,H),(C,E) © Manfred Huber 2018

Induced Normal Form Using the induced normal form, all techniques from normal form games can be used Extensive form is more compact than induced normal form More utility values have to be represented Some of the Nash equilibria are counterintuitive E.g. (B,H),(C,E) -Why would agent 1 ever play H ? H is a threat for player 2 not to play F Is this threat credible ? © Manfred Huber 2018

Subgames and Subgame Perfect Equilibria A subgame of a game in extensive form is defined by a subtree rooted in a node in H A subgame perfect equilibrium is a Nash equilibrium for which its restriction to the nodes in any subgame is also a Nash equilibrium Nash equilibria with non-credible threats are not subgame perfect Every perfect information game in extensive form has at least one subgame perfect Nash equilibrium Only (A,G),(C,F) is subgame perfect © Manfred Huber 2018

Computing Subgame Perfect Equilibria Backward induction can be used to compute a subgame perfect equilibrium for n-player general-sum games Starting with the smallest subgames, propagate the vector containing the maximum utility for the particular decision agent to the root of the subtree For the equilibrium strategy the agents take the best action (the one that links to the maximum value) at each node In zero-sum games this is the common minimax algorithm This algorithm works for general n-player general-sum games Only (A,G),(C,F) is subgame perfect There are alternative algorithms that backward propagate all subgame equilibria if more than one exists © Manfred Huber 2018

The Centipede Problem Subgame perfect equilibrium: (E,E,E),(E,E,E) 1 C E 2 C E (1,0) 1 C E (0,2) 2 E C (3,1) 1 C E Subgame perfect equilibrium: (E,E,E),(E,E,E) The outcome of this strategy profile is pareto dominated by all but one other outcome (2,4) (3,5) (4,3) © Manfred Huber 2018

Imperfect Information Games Imperfect information games handle situations where agents do not have complete knowledge of the stage of the game or the decision the other agents are taking Imperfect information is modeled by associating nodes in the decision tree to information sets Different nodes in the same information set can not be distinguished Unknown actions of other agents or incomplete knoweldge of the stage of the game lead to non-distinguishable nodes © Manfred Huber 2018

Imperfect Information Game An imperfect information game in extensive form is defined as: N, A, H, Z,χ, σ, u define a perfect information game I=(I1,…In) is the vector of the information sets Ii of agent i defining the sets of indistinguishable nodes for this agent Ii = (Ii,1,…,Ii,ki) is a partition of the nodes assigned to agent i where nodes in the same partition (equivalence class) are indistinguishable for agent i © Manfred Huber 2018

Pure Strategies in Imperfect Information Games A pure strategy for agent i in an imperfect information game is a complete specification of the (deterministic) actions the agent will take in each information class Strategies have to include action choices even for information classes (and thus nodes) that can not be encountered under the strategy © Manfred Huber 2018

Imperfect Information Game Prisoners’ Dilemma 1 2 C S (-5,-5) (-1,-10) (-10,-1) (-3,-3) Pure strategies for agent 1: (C), (S) Pure strategies for agent 2: (C), (S) © Manfred Huber 2018

Strategies and Equilibria All solution strategies can be defined as in perfect information games As in perfect information games, every imperfect information game can be converted into a normal form game Every normal form game can be converted into an imperfect information game Simply put all nodes for player 2 into the same information class Mapping back and forth will not result in the same game but in a game with the same strategy space and equilibria !! © Manfred Huber 2018

Randomized Strategies In imperfect information games we can define a second way to generate randomized strategies Mixed strategies: randomization over pure strategies Behavioral strategies: strategies containing independent randomization over the actions in each information set © Manfred Huber 2018

Mixed and Behavioral Strategies 1 2 A B C E F D (8,3) (5,5) G H (2,10) (1,0) (3,8) Mixed strategy example for agent 1: (0.6:(A,G); 0.4:(B,H)) Behavioral strategy example for agent 1: ([0.5:A;0.5:B],[0.3:G;0.7:H]) © Manfred Huber 2018

Randomized Strategies Expressive power of mixed and behavioral strategies are noncomparable In some games there are outcomes that can be achieved using mixed strategies but not using behavioral strategies In some games there are outcomes that can be achieved using behavioral strategies but not using mixed strategies © Manfred Huber 2018

Behavioral Strategy Example 1 2 L R U D (1,0) (100,100) (5,1) (2,2) Pure strategies: Agent 1: (L), (R); Agent 2: (U), (D) Mixed strategy equilibrium: R,D Behavioral strategy equilibrium: [98/198:L;100/198:R],D Solve for utility maximum, D is strictly dominant © Manfred Huber 2018

Perfect Recall A player in an imperfect information game has perfect recall if he does not forget anything he knew about moves made so far For every path to two nodes in the same information set for player i, the node sequence leading to the nodes has to be representable by a unique sequence of information classes and for each node sequence, the actions taken by agent i have to be the same as the corresponding ones in any other path © Manfred Huber 2018

Perfect Recall Formally: for any two nodes h, h’ in the same information class, for every path h0,a0,…hn,an,h and h0,a’0,…h’m,a’m,h’ m=n hj and h’j are in the same information class for player i For all j, ρ(hj)=i → aj=a’j A game of perfect recall is an imperfect information game in which every agent has perfect recall © Manfred Huber 2018

Games of Perfect Recall (Kuhn, 1953): In a game of perfect recall, any mixed strategy of a given agent can be replaced by an equivalent behavioral strategy, and any behavioral strategy can be replaced by an equivalent mixed strategy. In games of perfect recall, Nash equilibria can be found in the form of behavioral strategies © Manfred Huber 2018

Equilibria for Games of Perfect Recall Convert the game to normal form and solve for the game. Exponential complexity in the normal form game size In games of perfect recall we can use the sequence form to accelerate the solution by avoiding the increase in the size of the game when converting to normal form Instead of strategies, use the action sequences of the agents on the path to a terminal node and realization probabilities (representing the probabilities of reaching the terminal nodes under the strategy) Zero-sum games can be solved in time polynomial in the size of the extensive form game. General-sum games can be solved in time exponential in the size of the extensive form game © Manfred Huber 2018