Chapter 14 & 15 Repeated Games.

Slides:

Advertisements

Similar presentations

Vincent Conitzer CPS Repeated games Vincent Conitzer

Advertisements

Some Problems from Chapt 13

Infinitely Repeated Games

Crime, Punishment, and Forgiveness

Evolution and Repeated Games D. Fudenberg (Harvard) E. Maskin (IAS, Princeton)

Game Theory “Доверяй, Но Проверяй” - Russian Proverb (Trust, but Verify) - Ronald Reagan Mike Shor Lecture 6.

Game Theory “Доверяй, Но Проверяй” (“Trust, but Verify”) - Russian Proverb (Ronald Reagan) Topic 5 Repeated Games.

Infinitely Repeated Games. In an infinitely repeated game, the application of subgame perfection is different - after any possible history, the continuation.

Non-Cooperative Game Theory To define a game, you need to know three things: –The set of players –The strategy sets of the players (i.e., the actions they.

Chapter 14 Infinite Horizon 1.Markov Games 2.Markov Solutions 3.Infinite Horizon Repeated Games 4.Trigger Strategy Solutions 5.Investing in Strategic Capital.

M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.5.Repeated Games Lecture

Infinitely Repeated Games Econ 171. Finitely Repeated Game Take any game play it, then play it again, for a specified number of times. The game that is.

EC941 - Game Theory Lecture 7 Prof. Francesco Squintani

Game Theory: Inside Oligopoly

Game Theory Lecture 9.

Repeated Games and the Prisoner’s Dilemma. Prisoner’s dilemma What if the game is played “repeatedly” for several periods? DefectCooperate Defect10 yr,

Industrial Organization - Matilde Machado Tacit Collusion Tacit Collusion Matilde Machado.

Game Theory Lecture 8.

Repeated Prisoner’s Dilemma If the Prisoner’s Dilemma is repeated, cooperation can come from strategies including: “Grim Trigger” Strategy – one.

Games People Play. 8: The Prisoners’ Dilemma and repeated games In this section we shall learn How repeated play of a game opens up many new strategic.

Dynamic Games of Complete Information.. Repeated games Best understood class of dynamic games Past play cannot influence feasible actions or payoff functions.

Yale Lectures 21 and Repeated Games: Cooperation vs the End Game.

ECON6036 1st semester Format of final exam Same as the mid term

Game Theory: Key Concepts Zero Sum Games Zero Sum Games Non – Zero Sum Games Non – Zero Sum Games Strategic Form Games  Lay out strategies Strategic Form.

APEC 8205: Applied Game Theory Fall 2007

Repeated games - example This stage game is played 2 times Any SPNE where players behave differently than in a 1-time game? Player 2 LMR L1, 10, 05, 0.

TOPIC 6 REPEATED GAMES The same players play the same game G period after period. Before playing in one period they perfectly observe the actions chosen.

Lecture 3: Oligopoly and Strategic Behavior Few Firms in the Market: Each aware of others’ actions Each firm in the industry has market power Entry is.

UNIT III: COMPETITIVE STRATEGY Monopoly Oligopoly Strategic Behavior 7/21.

Communication Networks A Second Course Jean Walrand Department of EECS University of California at Berkeley.

© 2009 Institute of Information Management National Chiao Tung University Lecture Notes II-2 Dynamic Games of Complete Information Extensive Form Representation.

1 Game Theory Sequential bargaining and Repeated Games Univ. Prof.dr. M.C.W. Janssen University of Vienna Winter semester Week 46 (November 14-15)

Punishment and Forgiveness in Repeated Games. A review of present values.

ECO290E: Game Theory Lecture 12 Static Games of Incomplete Information.

Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -

Chapters 29, 30 Game Theory A good time to talk about game theory since we have actually seen some types of equilibria last time. Game theory is concerned.

Lecture 5 Leadership and Reputation Reputations arise in situations where there is an element of repetition, and also where coordination between players.

Punishment, Detection, and Forgiveness in Repeated Games.

Final Lecture. Problem 2, Chapter 13 Exploring the problem Note that c, x yields the highest total payoff of 7 for each player. Is this a Nash equilibrium?

Lecture 6 Reputation Reputations arise in situations where there is an element of repetition, and also where coordination between players is possible.

Noncooperative Collusion The Cournot Oligopoly model was a single-period noncooperative model. In most models of imperfect competition, it is true that.

Punishment, Detection, and Forgiveness in Repeated Games.

Bargaining games Econ 414. General bargaining games A common application of repeated games is to examine situations of two or more parties bargaining.

Lec 23 Chapter 28 Game Theory.

Entry Deterrence Players Two firms, entrant and incumbent Order of play Entrant decides to enter or stay out. If entrant enters, incumbent decides to fight.

Lecture 10 Collusion and Repeated games (Chapter 14, 15)

1 Strategic Thinking Lecture 7: Repeated Strategic Situations Suggested reading: Dixit and Skeath, ch. 9 University of East Anglia School of Economics.

Game theory basics A Game describes situations of strategic interaction, where the payoff for one agent depends on its own actions as well as on the actions.

Yuan Deng Vincent Conitzer Duke University

ECONOMICS FOR BUSINESS (MICROECONOMICS) Lesson 8 Prof. Paolo Buccirossi Alessia Marrazzo.

Intermediate Microeconomics

Module 32 Game Theory.

Dynamic Games of Complete Information

Vincent Conitzer CPS Repeated games Vincent Conitzer

Game Theory in Wireless and Communication Networks: Theory, Models, and Applications Lecture 2 Bayesian Games Zhu Han, Dusit Niyato, Walid Saad, Tamer.

Learning 6.2 Game Theory.

Game Theory Applications in Network Design

Managerial Economics Kyle Anderson

Chapter 29 Game Theory Key Concept: Nash equilibrium and Subgame Perfect Nash equilibrium (SPNE)

Games with Imperfect Information Bayesian Games

ECE700.07: Game Theory with Engineering Applications

Multiagent Systems Repeated Games © Manfred Huber 2018.

Game Theory Fall Mike Shor Topic 5.

Vincent Conitzer Repeated games Vincent Conitzer

Chapter 14 & 15 Repeated Games.

Molly W. Dahl Georgetown University Econ 101 – Spring 2009

UNIT II: The Basic Theory

Lecture Game Theory.

Game Theory Spring Mike Shor Topic 5.

Vincent Conitzer CPS Repeated games Vincent Conitzer

Presentation transcript:

Chapter 14 & 15 Repeated Games

Repeated Prisoners Dilemma Thus far we have studied games, both static and dynamic, which are only played once. We now move to an environment where players interact repeatedly. Each player can now condition his action on previous actions by the other players. The first application will be to the prisoner’s dilemma, studied in chapter 2.

Repeated Prisoners Dilemma Recall the Prisoner’s Dilemma in strategic form (positive payoffs): Cooperate Defect Cooperate 2,2 0,3 Defect 3,0 1,1 Note in the PD game, Cooperate = Remain Silent and Defect = Confess. Clearly one pure strategy NE at (D,D).

Repeated Prisoners Dilemma Suppose now that the game is repeated indefinitely. Are there any strategies we can write down that would sustain (C,C) as the equilibrium strategy in every repetition of the game?

Repeated Prisoners Dilemma Consider the following “Grim Trigger Strategy” for each player: Play C in the first period. Play C in every period as long as all players in all previous periods have played C. If any player has deviated (to D) in any previous period, play D in all periods forward. If the strategy sustains cooperation, players obtain a payoff: 2 + 2 + 2 + … Starting in any period that a player deviates, the deviating player gets 3 + 1 + 1 + … So cooperating in all periods is optimal. The short term gains are outweighed by the long term losses.

Repeated Prisoners Dilemma Issues: Patience: is $1000 today worth the same as $1000 a year from now? We assumed above that players are infinitely patient. What if players weren’t so patient? Option A / Option B Other NE? Another NE would be both players playing D in all periods. Grim Trigger seems like a strategy with a very severe punishment? Could we sustain cooperation with something less severe? What if there is a final period to the game?

Primer on Arithmetic Series Gauss Sum Geometric Sum Limit of a geometric sum. If |a| < 1, then as n  ∞: What if the summation index starts at 1 instead?

Primer on Arithmetic Series Two other useful sums (Still assuming |a|<1). Even powers: Odd powers: Most general: a +ra + r2a + r3a + …. a/(1-r)

Repeated Prisoners Dilemma Discounting. Let di  (0,1) be the discount factor of player i with utility function u(.) If player i takes action at in period t, his discounted aggregate utility over T periods is: So if delta is close to 1, the player is very patient. If delta is close to 0, the player discounts the future a lot. We will usually assume di = d for all players. What if T =  and at = a for all t ? Then:

Repeated Prisoners Dilemma Definition: Repeated Game. Let G be a strategic game. Suppose there are N players and denote the set of actions and payoff function of each player i by Ai and ui respectively. The T-period repeated game of G for the discount factor d is the extensive game with perfect information and simultaneous moves in which: Each player chooses a sequence of actions (a1,…,aT). The players always choose from the same set of actions, Ai . Each player evaluates each action profile according to his discounted aggregate utility (or discounted average utility). Denote the game G(T,d) or in the case of an infinitely repeated game, G( ,d)

Repeated Prisoners Dilemma Nash equilibrium in the finitely repeated Prisoner’s Dilemma ? Solve period T and work backwards as usual. Unique NE is (D,D) in all periods. Cooperation is impossible to sustain. Nash equilibrium in the infinitely repeated Prisoner’s Dilemma ? Most real life situations are not finitely repeated games. Even if they may have a certain ending date, the number of interactions may be uncertain and thus modeling the PD game as an infinitely repeated game seems intuitively pleasing. As we stated, there may be a strategy in the infinitely repeated PD game that generates cooperation in all periods, for a given level of patience of the players.

Repeated Prisoners Dilemma Consider the following strategy for player i where j is the other player: Payoff along the equilibrium path: Payoff following a deviation in the first period:

Repeated Prisoners Dilemma So cooperate is optimal if So for discount factors greater than 1/2, cooperation in all periods can be sustained as a NE. Players must meet this minimum level of patience.

Repeated Prisoners Dilemma We now consider less draconian strategies than the Grim Trigger. Tit for Tat Strategy. Play C in the first period and then do whatever the other player did in the previous period in all subsequent periods. Limited Punishment Strategy. This strategy entails punishing a deviation for a certain number of periods and then reverting to the collusive outcome after the punishment no matter how players have acted during the punishment. For example, Play D in periods t=1, t=2, and t=3 if a deviation has occurred in t=0 and then play C in t=4.

Repeated Prisoners Dilemma Limited Punishment in the Prisoners Dilemma. Suppose the punishment phase is k=3 periods and both players are playing the same strategy. Consider the game starting in any period t. If no deviation occurs in periods t,t+1,t+2, and t+3 then, the payoffs to each player over these periods are: If player i deviates in period t, he knows that his opponent will play D for the next 3 periods so he should also play D in those periods. Thus his payoff is:

Repeated Prisoners Dilemma So cooperation is optimal if: What happens as k, the punishment period, gets larger? Delta approaches 1/2, the grim trigger cutoff.

Repeated Prisoners Dilemma Now consider the tit for tat strategy in the Prisoners Dilemma. Suppose player 1 is playing tit for tat and player 2 considers deviating to D in period t. Player 1 will respond with D in all periods until player 2 again chooses C. If player 2 chooses C, we revert to the same situation we started in (and again player 2 should deviate to D). So player 2 will either deviate and then play D forever or will alternate between D and C. Along the equilibrium path of the game, players earn 2/(1-d).

Repeated Prisoners Dilemma If player 2 deviates and then always plays D, his payoff is: If player 2 deviates and alternates C and D, his payoff is: So we need the equilibrium path payoffs to be larger than both of these: d > 1/2

Repeated Prisoners Dilemma So far we have been using trigger type mechanisms to sustain a collusive outcome (ie, Pareto optimal outcome). Can we attain any other payoffs as an equilibrium outcome of the game? Yes! Definition. The set of feasible payoff profiles of a strategic game is the set of all weighted averages of payoff profiles in the game. Eg, the prisoner’s dilemma: u2 (0,3) (2,2) (1,1) (3,0) u1

Repeated Prisoners Dilemma Folk Theorem for the Prisoners Dilemma: For any discount factor, 0<d<1, the discounted average payoff of each player i in any NE of G( ,d) is at least ui(D,D). Ie, players must at least get the NE payoffs of the static one-shot game. Let (x1,x2) be a feasible pair of payoffs in G for which xi > ui(D,D) for each player i. Then there is some d < 1, such that there is a NE of G( ,d) in which the discounted average payoff of each player i is xi. Note for any discount factor, we can always attain at least ui(D,D) as a NE of the infinitely repeated game.

Repeated Prisoners Dilemma Folk Theorem “Region” (or just the Folk Region) for the PD game: For every point in the shaded region, as long as d is high enough, we can generate those payoffs as the average discounted payoffs in a NE of the infinitely repeated game. u2 (0,3) (2,2) (1,1) (3,0) u1

Repeated Prisoners Dilemma See chapter 15 of Osborne for a discussion of general repeated games. The notation is just more general but the ideas, definitions, and theorems are the same. Two remaining topics. Alternating Offers Bargaining Cournot game / Monopoly outcome using grim trigger