Risk attitudes, normal-form games, dominance, iterated dominance Vincent Conitzer

Slides:

Advertisements

Similar presentations

CPS Utility theory Vincent Conitzer

Advertisements

Utility theory U: O-> R (utility maps from outcomes to a real number) represents preferences over outcomes ~ means indifference We need a way to talk about.

Strongly based on Slides by Vincent Conitzer of Duke

1 Game Theory. 2 Definitions Game theory -- formal way to analyze interactions among a group of rational agents behaving strategically Agents – players.

1 1 Slide © 2009 South-Western, a part of Cengage Learning Slides by John Loucks St. Edward’s University.

1 Chapter 4: Minimax Equilibrium in Zero Sum Game SCIT1003 Chapter 4: Minimax Equilibrium in Zero Sum Game Prof. Tsang.

MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)

Extensive-form games. Extensive-form games with perfect information Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 5 Players do not move simultaneously.

Economics 202: Intermediate Microeconomic Theory 1.HW #6 on website. Due Thursday. 2.No new reading for Thursday, should be done with Ch 8, up to page.

An Introduction to Game Theory Part I: Strategic Games

Vincent Conitzer CPS Normal-form games Vincent Conitzer

Nash Equilibrium: Theory. Strategic or Simultaneous-move Games Definition: A simultaneous-move game consists of: A set of players For each player, a set.

by Vincent Conitzer of Duke

Lecture 4 on Individual Optimization Risk Aversion

1 Utility Examples Scott Matthews Courses: /

1 Utility Examples Scott Matthews Courses: /

Complexity Results about Nash Equilibria

This Wednesday Dan Bryce will be hosting Mike Goodrich from BYU. Mike is going to give a talk during :30 to 12:20 in Main 117 on his work on UAVs.

Review: Game theory Dominant strategy Nash equilibrium

1 Section 2d Game theory Game theory is a way of thinking about situations where there is interaction between individuals or institutions. The parties.

QR 38, 2/22/07 Strategic form: dominant strategies I.Strategic form II.Finding Nash equilibria III.Strategic form games in IR.

Today: Some classic games in game theory

Games of Incomplete Information. These games drop the assumption that players know each other’s preferences. Complete info: players know each other’s preferences.

Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer

Vincent Conitzer CPS Game Theory Vincent Conitzer

CPS 170: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.

CPS 270: Artificial Intelligence Decision theory Instructor: Vincent Conitzer.

Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.

Reminder Midterm Mar 7 Project 2 deadline Mar 18 midnight in-class

Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.

THE “CLASSIC” 2 x 2 SIMULTANEOUS CHOICE GAMES Topic #4.

The Science of Networks 6.1 Today’s topics Game Theory Normal-form games Dominating strategies Nash equilibria Acknowledgements Vincent Conitzer, Michael.

CPS 270: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.

Lecture 5 Introduction to Game theory. What is game theory? Game theory studies situations where players have strategic interactions; the payoff that.

CPS LP and IP in Game theory (Normal-form Games, Nash Equilibria and Stackelberg Games) Joshua Letchford.

1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.

Strategic Behavior in Business and Econ Static Games of complete information: Dominant Strategies and Nash Equilibrium in pure and mixed strategies.

CPS 570: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.

CPS Utility theory, normal-form games

Game tree search Thanks to Andrew Moore and Faheim Bacchus for slides!

Vincent Conitzer CPS Learning in games Vincent Conitzer

Games of pure conflict two-person constant sum games.

1 CSC 384 Lecture Slides (c) , C. Boutilier and P. Poupart CSC384: Lecture 11  Last time Costs, Heuristics, LCFS, BeFS, A*, IDS  Today game.

By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.

Games, Strategies, and Decision Making By Joseph Harrington, Jr. First Edition Chapter 4: Stable Play: Nash Equilibria in Discrete Games with Two or Three.

Advanced Subjects in GT Outline of the tutorials Static Games of Complete Information Introduction to games Normal-form (strategic-form) representation.

1 Fahiem Bacchus, University of Toronto CSC384: Intro to Artificial Intelligence Game Tree Search I ● Readings ■ Chapter 6.1, 6.2, 6.3, 6.6 ● Announcements:

Vincent Conitzer CPS Utility theory Vincent Conitzer

Utilities and Decision Theory

Lecture 6: Other Game Models and Solution Concepts

CPS 570: Artificial Intelligence Game Theory

CPS 570: Artificial Intelligence Decision theory

Communication Complexity as a Lower Bound for Learning in Games

Vincent Conitzer CPS Repeated games Vincent Conitzer

Vincent Conitzer Normal-form games Vincent Conitzer

Vincent Conitzer CPS 173 Utility theory Vincent Conitzer

Instructor: Vincent Conitzer

Vincent Conitzer Utility theory Vincent Conitzer

Vincent Conitzer Repeated games Vincent Conitzer

Lecture 20 Linear Program Duality

Vincent Conitzer Extensive-form games Vincent Conitzer

Vincent Conitzer Utility theory Vincent Conitzer

Artificial Intelligence Decision theory

Instructor: Vincent Conitzer

Utilities and Decision Theory

Utilities and Decision Theory

Game theory review for agent design tutorial

Vincent Conitzer CPS Repeated games Vincent Conitzer

Presentation transcript:

Risk attitudes, normal-form games, dominance, iterated dominance Vincent Conitzer

Risk attitudes Which would you prefer? –A lottery ticket that pays out $10 with probability.5 and $0 otherwise, or –A lottery ticket that pays out $3 with probability 1 How about: –A lottery ticket that pays out $100,000,000 with probability.5 and $0 otherwise, or –A lottery ticket that pays out $30,000,000 with probability 1 Usually, people do not simply go by expected value An agent is risk-neutral if she only cares about the expected value of the lottery ticket An agent is risk-averse if she always prefers the expected value of the lottery ticket to the lottery ticket –Most people are like this An agent is risk-seeking if she always prefers the lottery ticket to the expected value of the lottery ticket

Decreasing marginal utility Typically, at some point, having an extra dollar does not make people much happier (decreasing marginal utility) utility money $200$1500$5000 buy a bike (utility = 1) buy a car (utility = 2) buy a nicer car (utility = 3)

Maximizing expected utility Lottery 1: get $1500 with probability 1 –gives expected utility 2 Lottery 2: get $5000 with probability.4, $200 otherwise –gives expected utility.4*3 +.6*1 = 1.8 –(expected amount of money =.4*$ *$200 = $2120 > $1500) So: maximizing expected utility is consistent with risk aversion utility money $200$1500$5000 buy a bike (utility = 1) buy a car (utility = 2) buy a nicer car (utility = 3)

Different possible risk attitudes under expected utility maximization utility money Green has decreasing marginal utility → risk-averse Blue has constant marginal utility → risk-neutral Red has increasing marginal utility → risk-seeking Grey’s marginal utility is sometimes increasing, sometimes decreasing → neither risk-averse (everywhere) nor risk-seeking (everywhere)

What is utility, anyway? Function u: O →  (O is the set of “outcomes” that lotteries randomize over) What are its units? –It doesn’t really matter –If you replace your utility function by u’(o) = a + bu(o), your behavior will be unchanged Why would you want to maximize expected utility? For two lottery tickets L and L’, let pL + (1-p)L’ be the “compound” lottery ticket where you get lottery ticket L with probability p, and L’ with probability 1-p L ≥ L’ means that L is (weakly) preferred to L’ –(≥ should be complete, transitive) Expected utility theorem. Suppose –(continuity axiom) for all L, L’, L’’, {p: pL + (1-p)L’ ≥ L’’} and {p: pL + (1- p)L’ ≤ L’’} are closed sets, –(independence axiom – more controversial) for all L, L’, L’’, p, we have L ≥ L’ if and only if pL + (1-p)L’’ ≥ pL’ + (1-p)L’’ then there exists a function u: O →  so that L ≥ L’ if and only if L gives a higher expected value of u than L’

Normal-form games

Rock-paper-scissors 0, 0-1, 11, -1 0, 0-1, 1 1, -10, 0 Row player aka. player 1 chooses a row Column player aka. player 2 (simultaneously) chooses a column A row or column is called an action or (pure) strategy Row player’s utility is always listed first, column player’s second Zero-sum game: the utilities in each entry sum to 0 (or a constant) Three-player game would be a 3D table with 3 utilities per entry, etc.

“Chicken” 0, 0-1, 1 1, -1-5, -5 D S DS S D D S Two players drive cars towards each other If one player goes straight, that player wins If both go straight, they both die not zero-sum

Rock-paper-scissors – Seinfeld variant 0, 01, -1 -1, 10, 0-1, 1 1, -10, 0 MICKEY: All right, rock beats paper! (Mickey smacks Kramer's hand for losing) KRAMER: I thought paper covered rock. MICKEY: Nah, rock flies right through paper. KRAMER: What beats rock? MICKEY: (looks at hand) Nothing beats rock.

Dominance Player i’s strategy s i strictly dominates s i ’ if –for any s -i, u i (s i, s -i ) > u i (s i ’, s -i ) s i weakly dominates s i ’ if –for any s -i, u i (s i, s -i ) ≥ u i (s i ’, s -i ); and –for some s -i, u i (s i, s -i ) > u i (s i ’, s -i ) 0, 01, -1 -1, 10, 0-1, 1 1, -10, 0 strict dominance weak dominance -i = “the player(s) other than i”

Prisoner’s Dilemma -2, -20, -3 -3, 0-1, -1 confess Pair of criminals has been caught District attorney has evidence to convict them of a minor crime (1 year in jail); knows that they committed a major crime together (3 years in jail) but cannot prove it Offers them a deal: –If both confess to the major crime, they each get a 1 year reduction –If only one confesses, that one gets 3 years reduction don’t confess confess

“Should I buy an SUV?” -10, -10-7, , -7-8, -8 cost: 5 cost: 3 cost: 5 cost: 8cost: 2 purchasing cost accident cost

Mixed strategies Mixed strategy for player i = probability distribution over player i’s (pure) strategies E.g. 1/3, 1/3, 1/3 Example of dominance by a mixed strategy: 3, 00, 0 3, 0 1, 0 1/2

Checking for dominance by mixed strategies Linear program for checking whether strategy s i * is strictly dominated by a mixed strategy: normalize to positive payoffs first, then solve: minimize Σ s i p s i such that: for any s -i, Σ s i p s i u i (s i, s -i ) ≥ u i (s i *, s -i ) Linear program for checking whether strategy s i * is weakly dominated by a mixed strategy: maximize Σ s -i (Σ s i p s i u i (s i, s -i )) – u i (s i *, s -i ) such that: –for any s -i, Σ s i p s i u i (s i, s -i ) ≥ u i (s i *, s -i ) –Σ s i p s i = 1 Note: linear programs can be solved in polynomial time

Iterated dominance Iterated dominance: remove (strictly/weakly) dominated strategy, repeat Iterated strict dominance on Seinfeld’s RPS: 0, 01, -1 -1, 10, 0-1, 1 1, -10, 0 1, -1 -1, 10, 0

Iterated dominance: path (in)dependence 0, 10, 0 1, 0 0, 00, 1 Iterated weak dominance is path-dependent: sequence of eliminations may determine which solution we get (if any) (whether or not dominance by mixed strategies allowed) 0, 10, 0 1, 0 0, 00, 1 0, 0 1, 0 0, 00, 1 Iterated strict dominance is path-independent: elimination process will always terminate at the same point (whether or not dominance by mixed strategies allowed)

Two computational questions for iterated dominance 1. Can a given strategy be eliminated using iterated dominance? 2. Is there some path of elimination by iterated dominance such that only one strategy per player remains? For strict dominance (with or without dominance by mixed strategies), both can be solved in polynomial time due to path-independence: –Check if any strategy is dominated, remove it, repeat For weak dominance, both questions are NP-hard (even when all utilities are 0 or 1), with or without dominance by mixed strategies [Conitzer, Sandholm 05] –Weaker version proved by [Gilboa, Kalai, Zemel 93]