Game representations, game-theoretic solution concepts, and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.

Slides:



Advertisements
Similar presentations
An Introduction to Game Theory Part V: Extensive Games with Perfect Information Bernhard Nebel.
Advertisements

Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
BASICS OF GAME THEORY. Recap Decision Theory vs. Game Theory Rationality Completeness Transitivity What’s in a game? Players Actions Outcomes Preferences.
This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.
Non-Cooperative Game Theory To define a game, you need to know three things: –The set of players –The strategy sets of the players (i.e., the actions they.
1 Chapter 14 – Game Theory 14.1 Nash Equilibrium 14.2 Repeated Prisoners’ Dilemma 14.3 Sequential-Move Games and Strategic Moves.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
EC941 - Game Theory Lecture 7 Prof. Francesco Squintani
Short introduction to game theory 1. 2  Decision Theory = Probability theory + Utility Theory (deals with chance) (deals with outcomes)  Fundamental.
Game-theoretic analysis tools Necessary for building nonmanipulable automated negotiation systems.
CS 886: Agents and Game Theory Sept 15. Agenthood We use economic definition of agent as locus of self-interest –Could be implemented e.g. as several.
Extensive Games with Imperfect Information
Extensive-form games. Extensive-form games with perfect information Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 5 Players do not move simultaneously.
Dynamic Games of Complete Information.. Repeated games Best understood class of dynamic games Past play cannot influence feasible actions or payoff functions.
Chapter 6 © 2006 Thomson Learning/South-Western Game Theory.
Sep. 5, 2013 Lirong Xia Introduction to Game Theory.
A camper awakens to the growl of a hungry bear and sees his friend putting on a pair of running shoes, “You can’t outrun a bear,” scoffs the camper. His.
Complexity Results about Nash Equilibria
Harsanyi transformation Players have private information Each possibility is called a type. Nature chooses a type for each player. Probability distribution.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Static Games of Complete Information: Subgame Perfection
Extensive Game with Imperfect Information Part I: Strategy and Nash equilibrium.
Complexity of Mechanism Design Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University Computer Science Department.
On Bounded Rationality and Computational Complexity Christos Papadimitriou and Mihallis Yannakakis.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
Alternating-Offers Bargaining under One-Sided Uncertainty on Deadlines Francesco Di Giunta and Nicola Gatti Dipartimento di Elettronica e Informazione.
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
Reading Osborne, Chapters 5, 6, 7.1., 7.2, 7.7 Learning outcomes
Chapter 9 Games with Imperfect Information Bayesian Games.
Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
EC941 - Game Theory Prof. Francesco Squintani Lecture 5 1.
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 4.1.Dynamic Games of Incomplete Information Lecture
Games with Imperfect Information Bayesian Games. Complete versus Incomplete Information So far we have assumed that players hold the correct belief about.
Dynamic Games & The Extensive Form
Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.
Chapters 29, 30 Game Theory A good time to talk about game theory since we have actually seen some types of equilibria last time. Game theory is concerned.
Regret Minimizing Equilibria of Games with Strict Type Uncertainty Stony Brook Conference on Game Theory Nathanaël Hyafil and Craig Boutilier Department.
Extensive Games with Imperfect Information
Topic 3 Games in Extensive Form 1. A. Perfect Information Games in Extensive Form. 1 RaiseFold Raise (0,0) (-1,1) Raise (1,-1) (-1,1)(2,-2) 2.
Automated Mechanism Design Tuomas Sandholm Presented by Dimitri Mostinski November 17, 2004.
ECO290E: Game Theory Lecture 13 Dynamic Games of Incomplete Information.
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Incomplete information: Perfect Bayesian equilibrium
Intermediate Microeconomics Game Theory. So far we have only studied situations that were not “strategic”. The optimal behavior of any given individual.
Zero-sum Games The Essentials of a Game Extensive Game Matrix Game Dominant Strategies Prudent Strategies Solving the Zero-sum Game The Minimax Theorem.
EC941 - Game Theory Prof. Francesco Squintani Lecture 6 1.
Extensive Form (Dynamic) Games With Perfect Information (Theory)
Econ 805 Advanced Micro Theory 1 Dan Quint Fall 2009 Lecture 1 A Quick Review of Game Theory and, in particular, Bayesian Games.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.1.Dynamic Games of Complete and Perfect Information Lecture
Simultaneous Move Games: Discrete Strategies
Game representations, game-theoretic solution concepts, and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.
Communication Complexity as a Lower Bound for Learning in Games
Extensive-form games and how to solve them
Econ 805 Advanced Micro Theory 1
Game representations, game-theoretic solution concepts, and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.
Multiagent Systems Extensive Form Games © Manfred Huber 2018.
Multiagent Systems Game Theory © Manfred Huber 2018.
Game Theory Chapter 12.
Learning 6.2 Game Theory.
CPS Extensive-form games
Multiagent Systems Repeated Games © Manfred Huber 2018.
Vincent Conitzer Extensive-form games Vincent Conitzer
CPS 173 Extensive-form games
Molly W. Dahl Georgetown University Econ 101 – Spring 2009
Normal Form (Matrix) Games
Presentation transcript:

Game representations, game-theoretic solution concepts, and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University

The heart of the problem In a 1-agent setting, agent’s expected utility maximizing strategy is well-defined But in a multiagent system, the outcome may depend on others’ strategies also

Terminology Agent = player Action = move = choice that agent can make at a point in the game Strategy s i = mapping from history (to the extent that the agent i can distinguish) to actions Strategy set S i = strategies available to the agent Strategy profile (s 1, s 2,..., s |A| ) = one strategy for each agent Agent’s utility is determined after each agent (including nature that is used to model uncertainty) has chosen its strategy, and game has been played: u i = u i (s 1, s 2,..., s |A| )

Game representations Extensive form player 1 1, 2 3, 4 player 2 Up Down Left Right 5, 6 7, 8 player 2 Left Right Matrix form (aka normal form aka strategic form) player 1’s strategy player 2’s strategy 1, 2 Up Down Left, Left Left, Right 3, 4 5, 67, 8 Right, Left Right, Right 3, 41, 2 5, 67, 8 Potential combinatorial explosion

Dominant strategy “equilibrium” Best response s i *: for all s i ’, u i (s i *,s -i ) ≥ u i (s i ’,s -i ) Dominant strategy s i *: s i * is a best response for all s -i –Does not always exist –Inferior strategies are called “dominated” Dominant strategy equilibrium is a strategy profile where each agent has picked its dominant strategy –Does not always exist –Requires no counterspeculation cooperate defect 3, 30, 5 5, 0 1, 1 Pareto optimal? Social welfare maximizing?

Nash equilibrium [Nash50] Sometimes an agent’s best response depends on others’ strategies: a dominant strategy does not exist A strategy profile is a Nash equilibrium if no player has incentive to deviate from his strategy given that others do not deviate: for every agent i, u i (s i *,s -i ) ≥ u i (s i ’,s -i ) for all s i ’ –Dominant strategy equilibria are Nash equilibria but not vice versa –Defect-defect is the only Nash eq. in Prisoner’s Dilemma –Battle of the Sexes game Has no dominant strategy equilibria

Criticisms of Nash equilibrium Not unique in all games, e.g. Battle of the Sexes –Approaches for addressing this problem Refinements (=strengthenings) of the equilibrium concept –Eliminate weakly dominated strategies first –Choose the Nash equilibrium with highest welfare –Subgame perfection … Focal points Mediation Communication Convention Learning Does not exist in all games

Existence of (pure strategy) Nash equilibria Thrm. –Any finite game, –where each action node is alone in its information set (i.e. at every point in the game, the agent whose turn it is to move knows what moves have been played so far) –is dominance solvable by backward induction (at least as long as ties are ruled out) Constructive proof: Multi-player minimax search

Rock-scissors-paper game Sequential moves

Rock-scissors-paper game Simultaneous moves

Mixed strategy Nash equilibrium move of agent 1 move of agent 2 rock scissors paper 0, 0 1, -1 -1, 1 Symmetric mixed strategy Nash eq: Each player plays each pure strategy with probability 1/3 Mixed strategy = agent’s chosen probability distribution over pure strategies from its strategy set Each agent has a best response strategy and beliefs (consistent with each other) In mixed strategy equilibrium, each strategy that occurs in the mix of agent i has equal expected utility to i Information set (the mover does not know which node of the set she is in)

Existence & complexity of mixed strategy Nash equilibria Every finite player, finite strategy game has at least one Nash equilibrium if we admit mixed strategy equilibria as well as pure [Nash 50] –(Proof is based on Kakutani’s fix point theorem) May be hard to compute –Complexity of finding a Nash equilibrium in a normal form game: 2-player 0-sum games can be solved in polytime with LP 2-player games are PPAD-complete (even with 0/1 payoffs) [Chen, Deng & Teng JACM-09; Abbott, Kane & Valiant FOCS-05; Daskalakis, Goldberg & Papadimitriou STOC-06], and NP-complete to find an even approximately good Nash equilibrium [Conitzer & Sandholm GEB-08] 3-player games are FIXP-complete [Etessami & Yannakakis FOCS-07]

Ultimatum game (for distributional bargaining)

Subgame perfect equilibrium [Selten 72] & credible threats Proper subgame = subtree (of the game tree) whose root is alone in its information set Subgame perfect equilibrium = strategy profile that is in Nash equilibrium in every proper subgame (including the root), whether or not that subgame is reached along the equilibrium path of play E.g. Cuban missile crisis Pure strategy Nash equilibria: (Arm,Fold), (Retract,Nuke) Pure strategy subgame perfect equilibria: (Arm,Fold) Conclusion: Kennedy’s Nuke threat was not credible Khrushchev Kennedy Arm Retract Fold Nuke -1, , , -10

Ultimatum game, again

Thoughts on credible threats Could use software as a commitment device –If one can credibly convince others that one cannot change one’s software agent, then revealing the agent’s code acts as a credible commitment to one’s strategy –E.g. nuke in the missile crisis –E.g. accept no less than 60% as the second mover in the ultimatum game Restricting one’s strategy set can increase one’s utility –This cannot occur in single agent settings Social welfare can increase or decrease

Solution concepts Ex post equilibrium = Nash equilibrium for all priors There are other equilibrium refinements too (see, e.g., following slides & wikipedia) Nash eq Dominant strategy eq Strength against collusion Subgame perfect equilibrium Perfect Bayesian equilibrium Bayes-Nash eq Sequential equilibrium Strong Nash eq Coalition-Proof Nash eq

Definition of a Bayesian game [Harsanyi 67/68] N is the set of players. Ω is the set of the states of nature. –For instance, in a card game, it can be any order of the cards. A i is the set of actions for player i. A = A 1 × A 2 × … × A n T i is the type set of player i. For each state of nature, the game will have different types of players (one type per player). –For instance, in a car selling game, it will be how much the player values the car C i A i × T i defines the available actions for player i of some type in T i. u: Ω × A → R is the payoff function for player i. p i is the probability distribution over Ω for each player i, i.e., each player has different views of the probability distribution over the states of nature. –In the game, they may never know the exact state of nature.

Solution concepts for Bayesian games A (Bayesian) Nash equilibrium is a strategy profile and beliefs specified for each player about the types of the other players that maximizes the expected utility for each player given their beliefs about the other players' types and given the strategies played by the other players Perfect Bayesian equilibrium (PBE) –Players place beliefs on nodes occurring in their information sets –A belief system is consistent for a given strategy profile if the probability assigned by the system to every node is computed as the probability of that node being reached given the strategy profile, i.e., by Bayes’ rule –A strategy profile is sequentially rational at a particular information set for a particular belief system if the expected utility of the player whose information set it is is maximal given the strategies played by the other players A strategy profile is sequentially rational for a particular belief system if it satisfies the above for every information set –A PBE is a strategy profile and a belief system such that the strategies are sequentially rational given the belief system and the belief system is consistent, wherever possible, given the strategy profile 'wherever possible' clause is necessary: some information sets might be reached with zero probability given the strategy profile; hence Bayes' rule cannot be employed to calculate the probability of nodes in those sets. Such information sets are said to be off the equilibrium path and any beliefs can be assigned to them Sequential equilibrium [Kreps and Wilson 82]. Refinement of PBE that specifies constraints on beliefs in such zero-probability information sets. Strategies and beliefs must be a limit point of a sequence of totally mixed strategy profiles and associated sensible (in PBE sense) beliefs Extensive-form trembling hand perfect equilibrium [Selten 75]. Require every move at every information set to be taken with non-zero probability. Take limit as tremble probability →0 Extensive-form proper equilibrium [Myerson 78]. Idea: Costly trembles much less likely. At any information set, for any two actions A and B, if the mover’s utility from B is less than from A, then prob(B) ≤ ε prob(A). Take limit as ε→0 More refined

Solution concepts for Bayesian games … Extensive-form perfect / proper equilibrium can involve playing weakly dominated strategies => argument for other solution concepts: Normal-form perfect equilibrium –Normal- and extensive-form perfect equilibria are incomparable –A normal-form perfect equilibrium of an extensive-form game may or may not be sequential (and might not even be subgame perfect) Quasi-perfect equilibrium [van Damme 84] –Informally, a player takes observed as well as potential future mistakes of his opponents into account but assumes that he himself will not make a mistake in the future, even if he observes that he has done so in the past –Incomparable to extensive-form perfect / proper Normal-form proper equilibrium –Always sequential –For 0-sum games, provides a strategy that maximizes the conditional utility (among minmax strategies), conditioned on the opponent making a mistake. (Here a mistake is defined as a pure strategy that doesn’t achieve the value of the game against all minmax strategies.) More refined