CPS 173 Extensive-form games

Slides:



Advertisements
Similar presentations
CPS Extensive-form games Vincent Conitzer
Advertisements

5. Combining simultaneous and sequential moves.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
Basics on Game Theory Class 2 Microeconomics. Introduction Why, What, What for Why Any human activity has some competition Human activities involve actors,
Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
Non-Cooperative Game Theory To define a game, you need to know three things: –The set of players –The strategy sets of the players (i.e., the actions they.
Short introduction to game theory 1. 2  Decision Theory = Probability theory + Utility Theory (deals with chance) (deals with outcomes)  Fundamental.
Game-theoretic analysis tools Necessary for building nonmanipulable automated negotiation systems.
Extensive Games with Imperfect Information
Extensive-form games. Extensive-form games with perfect information Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 5 Players do not move simultaneously.
Chapter 11 Game Theory and the Tools of Strategic Business Analysis.
Sep. 5, 2013 Lirong Xia Introduction to Game Theory.
Basics on Game Theory For Industrial Economics (According to Shy’s Plan)
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Static Games of Complete Information: Subgame Perfection
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
CPS 170: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.
Reading Osborne, Chapters 5, 6, 7.1., 7.2, 7.7 Learning outcomes
Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.
EC941 - Game Theory Prof. Francesco Squintani Lecture 5 1.
CPS LP and IP in Game theory (Normal-form Games, Nash Equilibria and Stackelberg Games) Joshua Letchford.
Extensive-form games Vincent Conitzer
Dynamic Games & The Extensive Form
Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
Chapter 5 Game Theory and the Tools of Strategic Business Analysis.
Extensive Games with Imperfect Information
Topic 3 Games in Extensive Form 1. A. Perfect Information Games in Extensive Form. 1 RaiseFold Raise (0,0) (-1,1) Raise (1,-1) (-1,1)(2,-2) 2.
CPS LP and IP in Game theory (Normal-form Games, Nash Equilibria and Stackelberg Games) Joshua Letchford.
Game theory 2 Lukáš Lehotský
Incomplete information: Perfect Bayesian equilibrium
Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem.
Dynamic games, Stackelburg Cournot and Bertrand
Extensive Form (Dynamic) Games With Perfect Information (Theory)
Lec 23 Chapter 28 Game Theory.
Game representations, game-theoretic solution concepts, and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.
Yuan Deng Vincent Conitzer Duke University
CPS Some practice questions
Introduction to Game Theory
Introduction to Game Theory
Lecture 6: Other Game Models and Solution Concepts
Game Theory: The Competitive Dynamics of Strategy
5. Combining simultaneous and sequential moves.
Computing equilibria in extensive form games
CPS 570: Artificial Intelligence Game Theory
CPS Some practice questions
Communication Complexity as a Lower Bound for Learning in Games
Extensive-form games and how to solve them
Vincent Conitzer CPS Repeated games Vincent Conitzer
Econ 805 Advanced Micro Theory 1
Introduction to Game Theory
Combining Sequential and Simultaneous Moves
Multiagent Systems Extensive Form Games © Manfred Huber 2018.
Game Theory Chapter 12.
Learning 6.2 Game Theory.
Chapter 29 Game Theory Key Concept: Nash equilibrium and Subgame Perfect Nash equilibrium (SPNE)
Instructor: Vincent Conitzer
Game Theory.
ECE700.07: Game Theory with Engineering Applications
CPS Extensive-form games
Multiagent Systems Repeated Games © Manfred Huber 2018.
Some practice questions
Vincent Conitzer Repeated games Vincent Conitzer
Vincent Conitzer Extensive-form games Vincent Conitzer
Molly W. Dahl Georgetown University Econ 101 – Spring 2009
Instructor: Vincent Conitzer
Vincent Conitzer CPS Learning in games Vincent Conitzer
Game theory review for agent design tutorial
A Technique for Reducing Normal Form Games to Compute a Nash Equilibrium Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University, Computer Science.
Vincent Conitzer CPS Repeated games Vincent Conitzer
Presentation transcript:

CPS 173 Extensive-form games Vincent Conitzer conitzer@cs.duke.edu

Extensive-form games with perfect information Players do not move simultaneously When moving, each player is aware of all the previous moves (perfect information) A (pure) strategy for player i is a mapping from player i’s nodes to actions Player 1 Player 2 Player 2 Player 1 2, 4 5, 3 3, 2 Leaves of the tree show player 1’s utility first, then player 2’s utility 1, 0 0, 1

Backward induction When we know what will happen at each of a node’s children, we can decide the best action for the player who is moving at that node Player 1 Player 2 Player 2 Player 1 2, 4 5, 3 3, 2 1, 0 0, 1

A limitation of backward induction If there are ties, then how they are broken affects what happens higher up in the tree Multiple equilibria… Player 1 .12345 .87655 Player 2 Player 2 1/2 1/2 3, 2 2, 3 4, 1 0, 1

Conversion from extensive to normal form Player 1 LR = Left if 1 moves Left, Right if 1 moves Right; etc. LL LR RL RR 3, 2 2, 3 4, 1 0, 1 L Player 2 Player 2 R Nash equilibria of this normal-form game include (R, LL), (R, RL), (L, RR) + infinitely many mixed-strategy equilibria In general, normal form can have exponentially many strategies 3, 2 2, 3 4, 1 0, 1

Converting the first game to normal form LL LR RL RR Player 1 2, 4 5, 3 3, 2 1, 0 0, 1 LL LR RL Player 2 Player 2 RR Pure-strategy Nash equilibria of this game are (LL, LR), (LR, LR), (RL, LL), (RR, LL) But the only backward induction solution is (RL, LL) Player 1 2, 4 5, 3 3, 2 1, 0 0, 1

Subgame perfect equilibrium Each node in a (perfect-information) game tree, together with the remainder of the game after that node is reached, is called a subgame A strategy profile is a subgame perfect equilibrium if it is an equilibrium for every subgame LL LR RL RR LL 2, 4 5, 3 3, 2 1, 0 0, 1 Player 1 LR RL RR Player 2 Player 2 *L *R ** *L 3, 2 1, 0 0, 1 *L 1, 0 0, 1 *R *R Player 1 2, 4 5, 3 3, 2 (RR, LL) and (LR, LR) are not subgame perfect equilibria because (*R, **) is not an equilibrium (LL, LR) is not subgame perfect because (*L, *R) is not an equilibrium *R is not a credible threat 1, 0 0, 1

Imperfect information Dotted lines indicate that a player cannot distinguish between two (or more) states A set of states that are connected by dotted lines is called an information set Reflected in the normal-form representation Player 1 L R 0, 0 -1, 1 1, -1 -5, -5 L Player 2 Player 2 R 0, 0 -1, 1 1, -1 -5, -5 Any normal-form game can be transformed into an imperfect-information extensive-form game this way

A poker-like game 0, 0 1, -1 .5, -.5 1.5, -1.5 -.5, .5 2/3 1/3 cc cf “nature” 2/3 1/3 1 gets King 1 gets Jack cc cf fc ff player 1 player 1 1/3 bb 0, 0 1, -1 .5, -.5 1.5, -1.5 -.5, .5 bet stay bet stay 2/3 bs player 2 player 2 sb call fold call fold call fold call fold ss 2 1 1 1 -2 1 -1 1

Subgame perfection and imperfect information How should we extend the notion of subgame perfection to games of imperfect information? Player 1 Player 2 Player 2 1, -1 -1, 1 -1, 1 1, -1 We cannot expect Player 2 to play Right after Player 1 plays Left, and Left after Player 1 plays Right, because of the information set Let us say that a subtree is a subgame only if there are no information sets that connect the subtree to parts outside the subtree

Subgame perfection and imperfect information… Player 1 Player 2 Player 2 Player 2 4, 1 0, 0 5, 1 1, 0 3, 2 2, 3 One of the Nash equilibria is: (R, RR) Also subgame perfect (the only subgames are the whole game, and the subgame after Player 1 moves Right) But it is not reasonable to believe that Player 2 will move Right after Player 1 moves Left/Middle (not a credible threat) There exist more sophisticated refinements of Nash equilibrium that rule out such behavior

Computing equilibria in the extensive form Can just use normal-form representation Misses issues of subgame perfection, etc. Another problem: there are exponentially many pure strategies, so normal form is exponentially larger Even given polynomial-time algorithms for normal form, time would still be exponential in the size of the extensive form There are other techniques that reason directly over the extensive form and scale much better E.g., using the sequence form of the game

Commitment Consider the following (normal-form) game: 2, 1 4, 0 1, 0 3, 1 How should this game be played? Now suppose the game is played as follows: Player 1 commits to playing one of the rows, Player 2 observes the commitment and then chooses a column What is the optimal strategy for player 1? What if 1 can commit to a mixed strategy?

Commitment as an extensive-form game For the case of committing to a pure strategy: Player 1 Up Down Player 2 Player 2 Left Right Left Right 2, 1 4, 0 1, 0 3, 1

Commitment as an extensive-form game For the case of committing to a mixed strategy: Player 1 (1,0) (=Up) (.5,.5) (0,1) (=Down) … … Player 2 Left Right Left Right Left Right 3, 1 2, 1 4, 0 1.5, .5 3.5, .5 1, 0 Infinite-size game; computationally impractical to reason with the extensive form here

Solving for the optimal mixed strategy to commit to [Conitzer & Sandholm 2006; see also: von Stengel & Zamir 2009, Letchford, Conitzer, Munagala 2009] For every column t separately, we will solve separately for the best mixed row strategy (defined by ps) that induces player 2 to play t maximize Σs ps u1(s, t) subject to for any t’, Σs ps u2(s, t) ≥ Σs ps u2(s, t’) Σs ps = 1 (May be infeasible, e.g., if t is strictly dominated) Pick the t that is best for player 1

Visualization (0,1,0) = M C R L (1,0,0) = U (0,0,1) = D L C R U 0,1 4,0 D 1,1 (0,1,0) = M C R L (1,0,0) = U (0,0,1) = D