05-11-2006 Center for the Study of Rationality Hebrew University of Jerusalem 1/39 n-Player Stochastic Games with Additive Transitions.

Slides:



Advertisements
Similar presentations
Vincent Conitzer CPS Repeated games Vincent Conitzer
Advertisements

Market Institutions: Oligopoly
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
The Big Match in small space Kristoffer Arnsfelt Hansen Rasmus Ibsen-Jensen Michal Koucký.
Congestion Games with Player- Specific Payoff Functions Igal Milchtaich, Department of Mathematics, The Hebrew University of Jerusalem, 1993 Presentation.
Infinitely Repeated Games. In an infinitely repeated game, the application of subgame perfection is different - after any possible history, the continuation.
Chapter 14 Infinite Horizon 1.Markov Games 2.Markov Solutions 3.Infinite Horizon Repeated Games 4.Trigger Strategy Solutions 5.Investing in Strategic Capital.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.5.Repeated Games Lecture
Copyright (c) 2003 Brooks/Cole, a division of Thomson Learning, Inc
© 2015 McGraw-Hill Education. All rights reserved. Chapter 15 Game Theory.
Repeated games with Costly Observations Eilon Solan, Tel Aviv University Ehud Lehrer Tel Aviv University with.
EC941 - Game Theory Lecture 7 Prof. Francesco Squintani
Industrial Organization - Matilde Machado Tacit Collusion Tacit Collusion Matilde Machado.
Randomness for Free Laurent Doyen LSV, ENS Cachan & CNRS joint work with Krishnendu Chatterjee, Hugo Gimbert, Tom Henzinger.
An Introduction to Game Theory Part I: Strategic Games
Dynamic Games of Complete Information.. Repeated games Best understood class of dynamic games Past play cannot influence feasible actions or payoff functions.
Eponine Lupo.  Game Theory is a mathematical theory that deals with models of conflict and cooperation.  It is a precise and logical description of.
Planning under Uncertainty
An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.
January 20 Lecture Econ 171. The game of matching pennies has A)two pure strategy Nash equilibria B)One pure strategy Nash equilibrium C)One mixed strategy.
ECON6036 1st semester Format of final exam Same as the mid term
Stochastic Zero-sum and Nonzero-sum  -regular Games A Survey of Results Krishnendu Chatterjee Chess Review May 11, 2005.
INSTITUTO DE SISTEMAS E ROBÓTICA Minimax Value Iteration Applied to Robotic Soccer Gonçalo Neto Institute for Systems and Robotics Instituto Superior Técnico.
Learning in Games. Fictitious Play Notation! For n Players we have: n Finite Player’s Strategies Spaces S 1, S 2, …, S n n Opponent’s Strategies Spaces.
Outline MDP (brief) –Background –Learning MDP Q learning Game theory (brief) –Background Markov games (2-player) –Background –Learning Markov games Littman’s.
Nash Q-Learning for General-Sum Stochastic Games Hu & Wellman March 6 th, 2006 CS286r Presented by Ilan Lobel.
Repeated games - example This stage game is played 2 times Any SPNE where players behave differently than in a 1-time game? Player 2 LMR L1, 10, 05, 0.
TOPIC 6 REPEATED GAMES The same players play the same game G period after period. Before playing in one period they perfectly observe the actions chosen.
PRISONER’S DILEMMA By Ajul Shah, Hiten Morar, Pooja Hindocha, Amish Parekh & Daniel Castellino.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Extending Implicit Negotiation to Repeated Grid Games Robin Carnow Computer Science Department Rutgers University.
1 Sequential Correlated Equilibria in Stopping Games Yuval Heller Tel-Aviv University (Part of my Ph.D. thesis supervised by Eilon Solan)
Game Theoretic Analysis of Oligopoly lr L R 0000 L R 1 22 The Lane Selection Game Rational Play is indicated by the black arrows.
Stochastic Games Krishnendu Chatterjee CS 294 Game Theory.
Constraints in Repeated Games. Rational Learning Leads to Nash Equilibrium …so what is rational learning? Kalai & Lehrer, 1993.
Two-Stage Games APEC 8205: Applied Game Theory Fall 2007.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
1 On the Agenda(s) of Research on Multi-Agent Learning by Yoav Shoham and Rob Powers and Trond Grenager Learning against opponents with bounded memory.
Learning and Planning for POMDPs Eyal Even-Dar, Tel-Aviv University Sham Kakade, University of Pennsylvania Yishay Mansour, Tel-Aviv University.
Game Theory Statistics 802. Lecture Agenda Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for.
Game Theory.
MAKING COMPLEX DEClSlONS
1 Game Theory Sequential bargaining and Repeated Games Univ. Prof.dr. M.C.W. Janssen University of Vienna Winter semester Week 46 (November 14-15)
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
Game theory & Linear Programming Steve Gu Mar 28, 2008.
Equilibrium transitions in stochastic evolutionary games Dresden, ECCS’07 Jacek Miękisz Institute of Applied Mathematics University of Warsaw.
Mixed Strategies and Repeated Games
February 2, 2016 Stochastic Games Mr Sujit P Gujar. e-Enterprise Lab Computer Science and Automation IISc, Bangalore.
Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem.
Statistics Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for Windows software Modeling.
EC941 - Game Theory Prof. Francesco Squintani Lecture 6 1.
Markov Processes What is a Markov Process?
Markov Decision Processes AIMA: 17.1, 17.2 (excluding ), 17.3.
By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.
ECO290E: Game Theory Lecture 10 Examples of Dynamic Games.
Discrete-time markov chain (continuation)
Making complex decisions
Dynamic Games of Complete Information
Vincent Conitzer CPS Repeated games Vincent Conitzer
Information Design: A unified Perspective
Game Theory in Wireless and Communication Networks: Theory, Models, and Applications Lecture 10 Stochastic Game Zhu Han, Dusit Niyato, Walid Saad, and.
Equlibrium Selection in Stochastic Games
ECE700.07: Game Theory with Engineering Applications
Multiagent Systems Repeated Games © Manfred Huber 2018.
Vincent Conitzer Repeated games Vincent Conitzer
Definition of Renegotiation-safe
Collaboration in Repeated Games
Eilon Solan, Tel Aviv University
Vincent Conitzer CPS Repeated games Vincent Conitzer
Presentation transcript:

Center for the Study of Rationality Hebrew University of Jerusalem 1/39 n-Player Stochastic Games with Additive Transitions Frank Thuijsman János Flesch & Koos Vrieze Maastricht University European Journal of Operational Research 179 (2007) 483–497

Center for the Study of Rationality Hebrew University of Jerusalem 2/39 Outline Model Brief History of Stochastic Games Additive Transitions Examples

Center for the Study of Rationality Hebrew University of Jerusalem 3/39 a s = (a 1 s, a 2 s, , a n s ) joint action r s (a s ) = (r 1 s (a s ), r 2 s (a s ), , r n s (a s )) rewards p s (a s ) = (p s (1|a s ), p s (2|a s ), , p s (z|a s )) transitions Infinite horizon Complete Information Perfect Recall Independent and Simultaneous Choices Finite Stochastic Game 1sz rs(as)rs(as) ps(as)ps(as)

Center for the Study of Rationality Hebrew University of Jerusalem 4/39 3-Player Stochastic Game 0, 0, 0 1, 3, 00, 3, 13, 0, (1, 0, 0, 0) (2/3, 1/3, 0, 0) (2/3, 0, 1/3, 0) (2/3, 0, 0, 1/3) (1/3, 1/3, 0, 1/3) (1/3, 0, 1/3, 1/3) (1/3, 1/3, 1/3, 0) (0, 1/3, 1/3, 1/3) (0, 1, 0, 0) (0, 0, 1, 0)(0, 0, 0, 1) 0, 0, 0 T L F R B N 1 2 3

Center for the Study of Rationality Hebrew University of Jerusalem 5/39 general strategy   i : N × S × H → X i (k, s, h) → X i s Markov strategy f i : N × S → X i (k, s) → X i s stationary strategy x i : S → X i (s) → X i s opponents’ strategy    i, f  i and x  i Strategies mixed actions

Center for the Study of Rationality Hebrew University of Jerusalem 6/39  -Discounted rewards (with 0 <  < 1)  i  s (  ) = E s  ((1  )  k  k  1 R i k ) Limiting average rewards  i s (  ) = E s  (lim K →  K  1  K k=1 R i k ) Rewards

Center for the Study of Rationality Hebrew University of Jerusalem 7/39 MinMax Values  -Discounted minmax v i  s  inf   i sup  i  i  s (  ) Limiting average minmax v i s  inf   i sup  i  i s (  ) Highest rewards player i can defend

Center for the Study of Rationality Hebrew University of Jerusalem 8/39  Equilibrium  = (  i ) i  N is an  -equilibrium if  i s (  i,   i ) ≤  i s (  ) +  for all  i, for all i and for all s.

Center for the Study of Rationality Hebrew University of Jerusalem 9/39 Question (0, 0, 1, 0)(0, 1, 0, 0) (0, 0, 0, 1) (1, 0, 0, 0) (0, 0.5, 0.5, 0) , 0 3,  1 0, 0 Any  -equilibrium ? (0, 0, 0, 1) 4 2,1

Center for the Study of Rationality Hebrew University of Jerusalem 10/39 Highlights from (Finite) Stochastic Games History Shapley, sum, “discounted” Everett, sum, recursive, undiscounted Gillette, sum, big match problem

Center for the Study of Rationality Hebrew University of Jerusalem 11/39 Highlights from (Finite) Stochastic Games History Fink, 1964 & Takahashi, 1964 n-player, discounted Blackwell & Ferguson, sum, big match solution Liggett & Lippmann, sum, perfect inf., undiscounted

Center for the Study of Rationality Hebrew University of Jerusalem 12/39 Highlights from (Finite) Stochastic Games History Kohlberg, sum, absorbing, undiscounted Mertens & Neyman, sum, undiscounted Sorin, player, Paris Match, undiscounted

Center for the Study of Rationality Hebrew University of Jerusalem 13/39 Highlights from (Finite) Stochastic Games History Vrieze & Thuijsman, player, absorbing, undiscounted Thuijsman & Raghavan, 1997 n-player, perfect inf., undiscounted Flesch, Thuijsman, Vrieze, player, absorbing example, undiscounted

Center for the Study of Rationality Hebrew University of Jerusalem 14/39 Highlights from (Finite) Stochastic Games History Solan, player, absorbing, undiscounted Vieille, player, undiscounted Solan & Vieille, 2001 n-player, quitting, undiscounted

Center for the Study of Rationality Hebrew University of Jerusalem 15/39 Additive Transitions p s (a s ) =  n i=1 i s p i s (a i s ) p i s (a i s ) transition probabilities controlled by player i in state s i s transition power of player i in state s 0 ≤ i s ≤ 1 and  i i s = 1 for each s

Center for the Study of Rationality Hebrew University of Jerusalem 16/39 Example for 2-Player Additive Transitions p s (a s ) =  n i=1 i s p i s (a i s ) (1, 0, 0) (0.7, 0, 0.3)(0, 0.7, 0.3) (0.3, 0.7, 0) p 1 1 (2) = (0, 0, 1) p 1 1 (1) = (1, 0, 0) p 2 1 (1)=(1, 0, 0)p 2 1 (2)=(0, 1, 0) 1 1 = = 0.7 1

Center for the Study of Rationality Hebrew University of Jerusalem 17/39 Results 1.0-equilibria for n-player AT games (threats!) 2.0-opt. stationary strat. for 0-sum AT games 3.Stat.  -equilibria for 2-player abs. AT games 4.Result 3 can not be strengthened, neither to 3-player abs. AT games, nor to 2-player non-abs. AT games, nor to give stat. 0-equilibria

Center for the Study of Rationality Hebrew University of Jerusalem 18/39 The Essential Observation Additive Transitions induce a Complete Ordering of the Actions If a i s is “better” than b i s against some strategy, Then a i s is “better” than b i s against any strategy.

Center for the Study of Rationality Hebrew University of Jerusalem 19/39 “Better” Consider strategies a i s and b i s for player i If  for some strategy  a  i s we have  t  S p s (t | a i s, a  i s ) v i t ≥  t  S p s (t | b i s, a  i s ) v i t Then for all strategies b  i s we have  t  S p s (t | a i s, b  i s ) v i t ≥  t  S p s (t | b i s, b  i s ) v i t

Center for the Study of Rationality Hebrew University of Jerusalem 20/39 Because …. If  t  S p s (t | a i s, a  i s )v i t ≥  t  S p s (t | b i s, a  i s )v i t Then i s  t  S p s (t | a i s ) v i t +  j  i j s  t  S p s (t | a  j s ) v i t ≥ i s  t  S p s (t | b i s ) v i t +  j  i j s  t  S p s (t | a  j s ) v i t

Center for the Study of Rationality Hebrew University of Jerusalem 21/39 which implies that …. i s  t  S p s (t | a i s ) v i t +  j  i j s  t  S p s (t | b  j s ) v i t ≥ i s  t  S p s (t | b i s ) v i t +  j  i j s  t  S p s (t | b  j s ) v i t And therefore  t  S p s (t | a i s, b  i s )v i t ≥  t  S p s (t | b i s, b  i s )v i t

Center for the Study of Rationality Hebrew University of Jerusalem 22/39 “Best” The “best” actions for player i in state s are those that maximize the expression  t  S p s (t | a i s ) v i t

Center for the Study of Rationality Hebrew University of Jerusalem 23/39 The Restricted Game Let G be the original AT game and let G* be the restricted AT game, where each player is restricted to his “best” actions.

Center for the Study of Rationality Hebrew University of Jerusalem 24/39 The Restricted Game Now v * i ≥ v i for each player i. In G* :  t  S p s (t | a* s ) v* i t = v* i s  i, s, a* s In G :  t  S p s (t | b i s, a*  i s ) v i t < v i s  i, s,  a*  i s, b i s

Center for the Study of Rationality Hebrew University of Jerusalem 25/39 The Restricted Game If x* i yields at least v* i in G*, then x* i yields at least v* i in G as well.

Center for the Study of Rationality Hebrew University of Jerusalem 26/39 Ex. 1: 2-Player Absorbing AT Game (0, 0, 1)(0, 1, 0) (1, 0, 0)(0.5, 0.5, 0) (0, 0.5, 0.5) (0.5, 0, 0.5) , 0  1, 3  3, 1 (1, 0, 0) (0, 1, 0) (0, 0, 1) 0.5

Center for the Study of Rationality Hebrew University of Jerusalem 27/39 (0, 0, 1)(0, 1, 0) (1, 0, 0)(0.5, 0.5, 0) (0, 0.5, 0.5) (0.5, 0, 0.5) , 0  1, 3  3, 1 T, L B, L B, RT, R 0, 0 -1, 3 -2, 2-3, 1 T, L 0, 0 NO stationary 0-equilibrium Ex. 1: 2-Player Absorbing AT Game

Center for the Study of Rationality Hebrew University of Jerusalem 28/39 (0, 0, 1)(0, 1, 0) (1, 0, 0)(0.5, 0.5, 0) (0, 0.5, 0.5) (0.5, 0, 0.5) , 0 stationary  -equilibrium with  >0 1  1 0 equilibrium rewards ≈ ((  1   1, 3  3, 1 Ex. 1: 2-Player Absorbing AT Game

Center for the Study of Rationality Hebrew University of Jerusalem 29/39 (0, 0, 1)(0, 1, 0) (1, 0, 0)(0.5, 0.5, 0) (0, 0.5, 0.5) (0.5, 0, 0.5) , 0  1, 3  3, 1 non-stationary  -equilibrium Player 1: B, T, T, T, …. Player 2: R, R, R, R, …. equilibrium rewards ((  2  2  3, 1  1, 3  Ex. 1: 2-Player Absorbing AT Game

Center for the Study of Rationality Hebrew University of Jerusalem 30/39 Ex. 2: 2-Player Non-Absorbing AT Game (0, 0, 1, 0)(0, 1, 0, 0) (0, 0, 0, 1) (1, 0, 0, 0) (0, 0.5, 0.5, 0) , 0 3,  1 0, 0 q > 0 p <  q 1  q 1  p p q = 0 p > 1   q > 0 NO stationary  -equilibrium with  >0 (0, 0, 0, 1) 4 2,1

Center for the Study of Rationality Hebrew University of Jerusalem 31/39 non-stationary  -equilibrium Player 1: T, B, B, B, …. Player 2: R, R, R, R, …. (0, 0, 1, 0)(0, 1, 0, 0) (0, 0, 0, 1) (1, 0, 0, 0) (0, 0.5, 0.5, 0) , 0 3,  1 0, 0 (0, 0, 0, 1) 4 2,1 Ex. 2: 2-Player Non-Absorbing AT Game equilibrium rewards ((2, 1), ( 

Center for the Study of Rationality Hebrew University of Jerusalem 32/39 alternative  -equilibrium Player 1: T, T, T, T, …. Player 2: L, R, R, R, …. (0, 0, 1, 0)(0, 1, 0, 0) (0, 0, 0, 1) (1, 0, 0, 0) (0, 0.5, 0.5, 0) , 0 3,  1 0, 0 (0, 0, 0, 1) 4 2,1 Ex. 2: 2-Player Non-Absorbing AT Game equilibrium rewards ((2, 1), ( 

Center for the Study of Rationality Hebrew University of Jerusalem 33/39 Ex. 3: 3-Player Absorbing AT Game 0, 0, 0 1, 3, 00, 1, 33, 0, (1, 0, 0, 0) (2/3, 1/3, 0, 0) (2/3, 0, 1/3, 0) (2/3, 0, 0, 1/3) (1/3, 1/3, 0, 1/3) (1/3, 0, 1/3, 1/3) (1/3, 1/3, 1/3, 0) (0, 1/3, 1/3, 1/3) (0, 1, 0, 0) (0, 0, 1, 0)(0, 0, 0, 1) 0, 0, 0 T L F R B N How to share 4 among three people if only few solutions are allowed? 1 2 3

Center for the Study of Rationality Hebrew University of Jerusalem 34/39 0, 0, 0 1, 3, 0 0, 1, 3 3, 0, 1 1, 3, 00, 1, 33, 0, (1, 0, 0, 0) (2/3, 1/3, 0, 0) (2/3, 0, 1/3, 0) (2/3, 0, 0, 1/3) (1/3, 1/3, 0, 1/3) (1/3, 0, 1/3, 1/3) (1/3, 1/3, 1/3, 0) (0, 1/3, 1/3, 1/3) (0, 1, 0, 0) (0, 0, 1, 0)(0, 0, 0, 1) 1/2, 2, 3/2 2, 3/2, 1/2 3/2, 1/2, 2 4/3, 4/3, 4/3 T L F R B N Ex. 3: 3-Player Absorbing AT Game

Center for the Study of Rationality Hebrew University of Jerusalem 35/39 0, 0, 0 1, 3, 0 0, 1, 3 3, 0, 1 1/3 * 2/3 * 1 * 1/2, 2, 3/2 2, 3/2, 1/2 3/2, 1/2, 2 4/3, 4/3, 4/3 T L F R B N NO stationary  -equilibrium Ex. 3: 3-Player Absorbing AT Game 1, 3, 0 0, 1, 3 3, 0, 1

Center for the Study of Rationality Hebrew University of Jerusalem 36/39 0, 0, 0 1, 3, 0 0, 1, 3 3, 0, 1 1/3 * 2/3 * 1 * 1/2, 2, 3/2 2, 3/2, 1/2 3/2, 1/2, 2 4/3, 4/3, 4/3 T L F R B N Player 1 on B: 1, ¾, 0, 0, 0, 0, 1, ¾, 0, 0, 0, 0, …. Player 2 on R: 0, 0, 1, ¾, 0, 0, 0, 0, 1, ¾, 0, 0, …. Player 3 on F: 0, 0, 0, 0, 1, ¾, 0, 0, 0, 0, 1, ¾, …. non-stationary  -equilibrium Ex. 3: 3-Player Absorbing AT Game

Center for the Study of Rationality Hebrew University of Jerusalem 37/39 0, 0, 0 1, 3, 0 0, 1, 3 3, 0, 1 1/3 * 2/3 * 1 * 1/2, 2, 3/2 2, 3/2, 1/2 3/2, 1/2, 2 4/3, 4/3, 4/3 T L F R B N equilibrium rewards (1, 2, 1) Ex. 3: 3-Player Absorbing AT Game Player 1 on B: 1, ¾, 0, 0, 0, 0, 1, ¾, 0, 0, 0, 0, …. Player 2 on R: 0, 0, 1, ¾, 0, 0, 0, 0, 1, ¾, 0, 0, …. Player 3 on F: 0, 0, 0, 0, 1, ¾, 0, 0, 0, 0, 1, ¾, ….

Center for the Study of Rationality Hebrew University of Jerusalem 38/39 Results 1.0-equilibria for n-player AT games (threats!) 2.0-opt. stationary strat. for 0-sum AT games 3.Stat.  -equilibria for 2-player abs. AT games 4.Result 3 can not be strengthened, neither to 3-player abs. AT games, nor to 2-player non-abs. AT games, nor to give stat. 0-equilibria

Center for the Study of Rationality Hebrew University of Jerusalem 39/39 ?

Center for the Study of Rationality Hebrew University of Jerusalem 40/39 GAME VER

Center for the Study of Rationality Hebrew University of Jerusalem 41/39 GAME VER

Center for the Study of Rationality Hebrew University of Jerusalem 42/39 GAME VER

Center for the Study of Rationality Hebrew University of Jerusalem 43/39 GAME VER