CPS 296.1 Extensive-form games Vincent Conitzer

Slides:



Advertisements
Similar presentations
N. Basilico, N. Gatti, F. Amigoni DEI, Politecnico di Milano Leader-Follower Strategies for Robotic Patrolling in Environments with Arbitrary Topology.
Advertisements

Stolen Art 50,000 paintings stolen from museums and private collections around the world (including 287 by Picasso, 43 by Van Gogh, 26 by Renoir, more.
Preprocessing Techniques for Computing Nash Equilibria Vincent Conitzer Duke University Based on: Conitzer and Sandholm. A Generalized Strategy Eliminability.
An Introduction to Game Theory Part V: Extensive Games with Perfect Information Bernhard Nebel.
Concise representations of games Vincent Conitzer
Game Theory Lecture 4 Game Theory Lecture 4.
5. Combining simultaneous and sequential moves.
Vincent Conitzer CPS Repeated games Vincent Conitzer
Pondering more Problems. Enriching the Alice-Bob story Go to AGo to B Go to A Alice Go to B Go to A Go to B Go shoot pool Alice.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
Basics on Game Theory Class 2 Microeconomics. Introduction Why, What, What for Why Any human activity has some competition Human activities involve actors,
Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.
ECON 100 Tutorial: Week 9 office: LUMS C85.
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
Non-Cooperative Game Theory To define a game, you need to know three things: –The set of players –The strategy sets of the players (i.e., the actions they.
Economics 202: Intermediate Microeconomic Theory 1.HW #6 on website. Due Tuesday. 2.Second test covers up through today’s material, and will be “pseudo-cumulative”
Short introduction to game theory 1. 2  Decision Theory = Probability theory + Utility Theory (deals with chance) (deals with outcomes)  Fundamental.
General-sum games You could still play a minimax strategy in general- sum games –I.e., pretend that the opponent is only trying to hurt you But this is.
1 Deter Entry. 2 Here we see a model of deterring entry by an existing monopoly firm. We will also introduce the notion of a sequential, or dynamic, game.
Game-theoretic analysis tools Necessary for building nonmanipulable automated negotiation systems.
Extensive Games with Imperfect Information
Extensive-form games. Extensive-form games with perfect information Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 5 Players do not move simultaneously.
Chapter 11 Game Theory and the Tools of Strategic Business Analysis.
Sep. 5, 2013 Lirong Xia Introduction to Game Theory.
Basics on Game Theory For Industrial Economics (According to Shy’s Plan)
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Static Games of Complete Information: Subgame Perfection
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
CPS 170: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.
Reading Osborne, Chapters 5, 6, 7.1., 7.2, 7.7 Learning outcomes
Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.
EC941 - Game Theory Prof. Francesco Squintani Lecture 5 1.
Agents that can play multi-player games. Recall: Single-player, fully-observable, deterministic game agents An agent that plays Peg Solitaire involves.
CPS LP and IP in Game theory (Normal-form Games, Nash Equilibria and Stackelberg Games) Joshua Letchford.
Extensive-form games Vincent Conitzer
Dynamic Games & The Extensive Form
Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
Chapter 5 Game Theory and the Tools of Strategic Business Analysis.
Extensive Games with Imperfect Information
Topic 3 Games in Extensive Form 1. A. Perfect Information Games in Extensive Form. 1 RaiseFold Raise (0,0) (-1,1) Raise (1,-1) (-1,1)(2,-2) 2.
CPS LP and IP in Game theory (Normal-form Games, Nash Equilibria and Stackelberg Games) Joshua Letchford.
Game theory 2 Lukáš Lehotský
Strategic Behavior in Business and Econ Static Games of complete information: Dominant Strategies and Nash Equilibrium in pure and mixed strategies.
CPS 570: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.
Incomplete information: Perfect Bayesian equilibrium
Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem.
Dynamic games, Stackelburg Cournot and Bertrand
Extensive Form (Dynamic) Games With Perfect Information (Theory)
Lec 23 Chapter 28 Game Theory.
ECO290E: Game Theory Lecture 8 Games in Extensive-Form.
Game representations, game-theoretic solution concepts, and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.
Dynamic Game Theory and the Stackelberg Model. Dynamic Game Theory So far we have focused on static games. However, for many important economic applications.
Yuan Deng Vincent Conitzer Duke University
CPS Some practice questions
Introduction to Game Theory
Introduction to Game Theory
Lecture 6: Other Game Models and Solution Concepts
5. Combining simultaneous and sequential moves.
CPS Some practice questions
Communication Complexity as a Lower Bound for Learning in Games
Extensive-form games and how to solve them
Multiagent Systems Extensive Form Games © Manfred Huber 2018.
ECE700.07: Game Theory with Engineering Applications
CPS Extensive-form games
Multiagent Systems Repeated Games © Manfred Huber 2018.
Some practice questions
Vincent Conitzer Extensive-form games Vincent Conitzer
CPS 173 Extensive-form games
Instructor: Vincent Conitzer
A Technique for Reducing Normal Form Games to Compute a Nash Equilibrium Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University, Computer Science.
Presentation transcript:

CPS Extensive-form games Vincent Conitzer

Extensive-form games with perfect information Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 1 Players do not move simultaneously When moving, each player is aware of all the previous moves (perfect information) A (pure) strategy for player i is a mapping from player i’s nodes to actions Leaves of the tree show player 1’s utility first, then player 2’s utility

Backward induction Player 1 Player 2 Player 1 2, 45, 3 3, 2 1, 00, 1 When we know what will happen at each of a node’s children, we can decide the best action for the player who is moving at that node

A limitation of backward induction Player 1 Player 2 3, 22, 34, 1 If there are ties, then how they are broken affects what happens higher up in the tree Multiple equilibria… 0, 1 1/ /

Conversion from extensive to normal form Player 1 Player 2 3, 22, 34, 1 Nash equilibria of this normal-form game include (R, LL), (R, RL), (L, RR) + infinitely many mixed-strategy equilibria In general, normal form can have exponentially many strategies 0, 1 3, 2 2, 3 4, 10, 14, 10, 1 L R LLLRRLRR LR = Left if 1 moves Left, Right if 1 moves Right; etc.

Converting the first game to normal form Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 1 2, 4 5, 3 2, 4 5, 3 3, 21, 03, 21, 0 3, 20, 13, 20, 1 LLLRRLRR LL RL RR Pure-strategy Nash equilibria of this game are (LL, LR), (LR, LR), (RL, LL), (RR, LL) But the only backward induction solution is (RL, LL) LR

Subgame perfect equilibrium 2, 4 5, 3 2, 4 5, 3 3, 21, 03, 21, 0 3, 20, 13, 20, 1 Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 1 LLLRRLRR LL RL RR Each node in a (perfect-information) game tree, together with the remainder of the game after that node is reached, is called a subgame A strategy profile is a subgame perfect equilibrium if it is an equilibrium for every subgame LR 3, 21, 0 3, 20, 1 *L*R *L *R 1, 0 0, 1 ** *L *R (RR, LL) and (LR, LR) are not subgame perfect equilibria because (*R, **) is not an equilibrium (LL, LR) is not subgame perfect because (*L, *R) is not an equilibrium *R is not a credible threat

Imperfect information Dotted lines indicate that a player cannot distinguish between two (or more) states –A set of states that are connected by dotted lines is called an information set Reflected in the normal-form representation Player 1 Player 2 0, 0-1, 11, -1-5, -5 0, 0-1, 1 1, -1-5, -5 LR L R Any normal-form game can be transformed into an imperfect-information extensive-form game this way

A poker-like game 1 gets King1 gets Jack bet stay callfoldcallfoldcallfoldcallfold “nature” player 1 player , 0 1, -1.5, , -1.50, 01, ,.5 1, -1 0, 01, -10, 01, -1 cccffcff bb sb ss bs 2/3 1/3 2/3

Subgame perfection and imperfect information Player 1 Player 2 1, -1-1, 1 1, -1 How should we extend the notion of subgame perfection to games of imperfect information? We cannot expect Player 2 to play Right after Player 1 plays Left, and Left after Player 1 plays Right, because of the information set Let us say that a subtree is a subgame only if there are no information sets that connect the subtree to parts outside the subtree

Subgame perfection and imperfect information… Player 1 Player 2 4, 10, 05, 11, 0 One of the Nash equilibria is: (R, RR) Also subgame perfect (the only subgames are the whole game, and the subgame after Player 1 moves Right) But it is not reasonable to believe that Player 2 will move Right after Player 1 moves Left/Middle (not a credible threat) There exist more sophisticated refinements of Nash equilibrium that rule out such behavior Player 2 3, 22, 3

Computing equilibria in the extensive form Can just use normal-form representation –Misses issues of subgame perfection, etc. Another problem: there are exponentially many pure strategies, so normal form is exponentially larger –Even given polynomial-time algorithms for normal form, time would still be exponential in the size of the extensive form There are other techniques that reason directly over the extensive form and scale much better –E.g., using the sequence form of the game

Commitment Consider the following (normal-form) game: 2, 14, 0 1, 03, 1 How should this game be played? Now suppose the game is played as follows: –Player 1 commits to playing one of the rows, –Player 2 observes the commitment and then chooses a column What is the optimal strategy for player 1? What if 1 can commit to a mixed strategy?

Commitment as an extensive-form game Player 1 Player 2 2, 14, 01, 03, 1 For the case of committing to a pure strategy: UpDown Left Right

Commitment as an extensive-form game Player 1 Player 2 2, 14, 01, 0 3, 1 For the case of committing to a mixed strategy: (1,0) (=Up) Left Right 1.5,.53.5,.5 LeftRight (0,1) (=Down) (.5,.5) … … Infinite-size game; computationally impractical to reason with the extensive form here

Solving for the optimal mixed strategy to commit to [Conitzer & Sandholm 2006; see also: von Stengel & Zamir 2004, Letchford, Conitzer, Munagala 2009] For every column t separately, we will solve separately for the best mixed row strategy (defined by p s ) that induces player 2 to play t maximize Σ s p s u 1 (s, t) subject to for any t’, Σ s p s u 2 (s, t) ≥ Σ s p s u 2 (s, t’) Σ s p s = 1 (May be infeasible, e.g., if t is strictly dominated) Pick the t that is best for player 1

Visualization LCR U 0,11,00,0 M 4,00,10,0 D 1,01,1 (1,0,0) = U (0,1,0) = M (0,0,1) = D L C R