Learning and teaching in games: Statistical models of human play in experiments Colin F. Camerer, Social Sciences Caltech Teck.

Slides:



Advertisements
Similar presentations
PowerPoint Slides by Robert F. BrookerCopyright (c) 2001 by Harcourt, Inc. All rights reserved. Strategic Behavior Game Theory –Players –Strategies –Payoff.
Advertisements

Nagel: Unraveling in Guessing Games: An Experimental Study Economics 328 Spring 2005.
Building Agents for the Lemonade Game Using a Cognitive Hierarchy Population Model Michael Wunder Michael Kaisers Michael Littman John Yaros.
Ultimatum Game Two players bargain (anonymously) to divide a fixed amount between them. P1 (proposer) offers a division of the “pie” P2 (responder) decides.
Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
3. Basic Topics in Game Theory. Strategic Behavior in Business and Econ Outline 3.1 What is a Game ? The elements of a Game The Rules of the.
1 On the Methodology of Inequity Aversion Theory.
Behavioral game theory* Colin F. Camerer, Caltech Behavioral game theory: –How people actually play games –Uses concepts from psychology.
Belief Learning in an Unstable Infinite Game Paul J. Healy CMU.
Games What is ‘Game Theory’? There are several tools and techniques used by applied modelers to generate testable hypotheses Modeling techniques widely.
GAME THEORY.
1 Duke PhD Summer Camp August 2007 Outline  Motivation  Mutual Consistency: CH Model  Noisy Best-Response: QRE Model  Instant Convergence: EWA Learning.
A camper awakens to the growl of a hungry bear and sees his friend putting on a pair of running shoes, “You can’t outrun a bear,” scoffs the camper. His.
Convergent Learning in Unknown Graphical Games Dr Archie Chapman, Dr David Leslie, Dr Alex Rogers and Prof Nick Jennings School of Mathematics, University.
1 Teck H. Ho April 8, 2004 Outline  In-Class Experiment and Motivation  Adaptive Experience-Weighted Attraction (EWA) Learning in Games: Camerer and.
Complexity Results about Nash Equilibria
Communication Networks A Second Course Jean Walrand Department of EECS University of California at Berkeley.
B OUNDED R ATIONALITY in L ABORATORY B ARGAINING with A SSYMETRIC I NFORMATION Timothy N. Cason and Stanley S. Reynolds Economic Theory, 25, (2005)
1 Teck H. Ho Duke PhD Summer CampAugust 2007 Outline  Motivation  Mutual Consistency: CH Model  Noisy Best-Response: QRE Model  Instant Convergence:
Behavioral Game Theory: A Brief Introduction Networked Life CSE 112 Spring 2005 Prof. Michael Kearns Supplementary slides courtesy of Colin Camerer, CalTech.
1 Teck-Hua Ho CH Model March – June, 2003 A Cognitive Hierarchy Theory of One-Shot Games Teck H. Ho Haas School of Business University of California, Berkeley.
1 University of Michigan, Ann Arbor Teck H. Ho A Cognitive Hierarchy (CH) Model of Games Teck H. Ho Haas School of Business University of California, Berkeley.
XYZ 6/18/2015 MIT Brain and Cognitive Sciences Convergence Analysis of Reinforcement Learning Agents Srinivas Turaga th March, 2004.
1 Berkeley January 2004 Teck H. Ho A Parsimonious Model of Stock-keeping-Unit (SKU) Choice Teck H. Ho Haas School of Business UC, Berkeley Joint work with.
1 Teck-Hua Ho CH Model April 15, 2004 A Cognitive Hierarchy Model of One-Shot Games Teck H. Ho Haas School of Business University of California, Berkeley.
1 Teck-Hua Ho CH Model April 4, 2004 Outline  In-class Experiment and Examples  Empirical Alternative I: Model of Cognitive Hierarchy (Camerer, Ho, and.
1 Teck H. Ho April 8, 2004 Outline  In-Class Experiment and Motivation  Adaptive Experience-Weighted Attraction (EWA) Learning in Games: Camerer and.
1 Duke Doctoral Summer Camp August, 2007 Behavioral Game Theory Teck H. Ho University of California, Berkeley.
Outline  Motivation and Modeling Philosophy  Empirical Alternative I: Model of Cognitive Hierarchy (Camerer, Ho, and Chong, QJE, 2004)  Empirical Alternative.
1 Teck-Hua Ho CH Model April 4, 2004 Outline  In-class Experiment and Examples  Empirical Alternative I: Model of Cognitive Hierarchy (Camerer, Ho, and.
Electoral agency in the lab Leif Helland & Lars Monkerud NSM BI.
Software Agents in Economic Environments Robert S. Gazzale Ph.D. Candidate, Department of Economics Jeffrey MacKie Mason Professor, Dept. of Economics.
Introduction to Game Theory and Behavior Networked Life CIS 112 Spring 2009 Prof. Michael Kearns.
Quantal Response Equilibrium APEC 8205: Applied Game Theory Fall 2007.
Outline  In-Class Experiment on Centipede Game  Test of Iterative Dominance Principle I: McKelvey and Palfrey (1992)  Test of Iterative Dominance Principle.
1 Teck H. Ho Spring 2006 A Cognitive Hierarchy (CH) Model of Games Teck H. Ho Haas School of Business University of California, Berkeley Joint work with.
1 Raising Revenue With Raffles: Evidence from a Laboratory Experiment Wooyoung Lim, University of Pittsburgh Alexander Matros, University of Pittsburgh.
Game Theoretic Analysis of Oligopoly lr L R 0000 L R 1 22 The Lane Selection Game Rational Play is indicated by the black arrows.
Outline  Motivation and Modeling Philosophy  Empirical Alternative I: Model of Cognitive Hierarchy (Camerer, Ho, and Chong, QJE, 2004)  Empirical Alternative.
January 29, 2004 Experimental Economics 1 Outline  In-class experiment on IPV First-Price Auctions  Data from Cox, Robertson, and Smith (1982)  Glenn.
A Dynamic Level-k Model in Games Teck Ho and Xuanming Su UC Berkeley April 2011 Teck Hua Ho 1.
Game Theory Statistics 802. Lecture Agenda Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for.
Strategic Game Theory for Managers. Explain What is the Game Theory Explain the Basic Elements of a Game Explain the Importance of Game Theory Explain.
Game Theory, Strategic Decision Making, and Behavioral Economics 11 Game Theory, Strategic Decision Making, and Behavioral Economics All men can see the.
History-Dependent Graphical Multiagent Models Quang Duong Michael P. Wellman Satinder Singh Computer Science and Engineering University of Michigan, USA.
Introduction: Thinking Like an Economist 1 CHAPTER 2 CHAPTER 12 Game Theory, Strategic Decision Making, and Behavioral Economics All men can see the tactics.
Game Theory The Prisoner’s Dilemma Game. “Strategic thinking is the art of outdoing an adversary, knowing that the adversary is trying to do the same.
Prepared by Robert F. Brooker, Ph.D. Copyright ©2004 by South-Western, a division of Thomson Learning. All rights reserved.Slide 1 Managerial Economics.
University of Cagliari, Faculty of Economics, Business Strategy and Policy A course within the II level degree in Managerial Economics year II,
Experimental Economics and Neuroeconomics. An Illustration: Rules.
McGraw-Hill/Irwin Copyright  2008 by The McGraw-Hill Companies, Inc. All rights reserved. GAME THEORY, STRATEGIC DECISION MAKING, AND BEHAVIORAL ECONOMICS.
Experimental Economics NSF short course David Laibson August 11, 2005.
Data Analysis Econ 176, Fall Populations When we run an experiment, we are always measuring an outcome, x. We say that an outcome belongs to some.
3.1.4 Types of Games. Strategic Behavior in Business and Econ Outline 3.1. What is a Game ? The elements of a Game The Rules of the Game:
C opyright  2007 by Oxford University Press, Inc. PowerPoint Slides Prepared by Robert F. Brooker, Ph.D.Slide 1.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 22.
Intermediate Microeconomics Game Theory. So far we have only studied situations that were not “strategic”. The optimal behavior of any given individual.
Statistics Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for Windows software Modeling.
Replicator Dynamics. Nash makes sense (arguably) if… -Uber-rational -Calculating.
Lec 23 Chapter 28 Game Theory.
Oligopoly and Game Theory Topic Students should be able to: Use simple game theory to illustrate the interdependence that exists in oligopolistic.
Experiments and “Rational” Behavior, 5/1/07. Beauty Contest Game Each person choose a number from 0 to 100. We will average these numbers. The person.
Game theory Chapter 28 and 29
Correlated equilibria, good and bad: an experimental study
GAME THEORY.
Ekonomi Manajerial dalam Perekonomian Global
Game theory Chapter 28 and 29
Behavioral game theory* Colin F. Camerer, Caltech
BEC 30325: MANAGERIAL ECONOMICS
Volume 24, Issue 18, Pages R867-R871 (September 2014)
Presentation transcript:

Learning and teaching in games: Statistical models of human play in experiments Colin F. Camerer, Social Sciences Caltech Teck Ho, Berkeley (Haas Business School) Kuan Chong, National Univ Singapore How can bounded rationality be modelled in games? Theory desiderata: Precise, general, useful (game theory), and cognitively plausible, empirically disciplined (cog sci) Three components: –Cognitive hierarchy thinking model (one parameter, creates initial conditions) –Learning model (EWA, fEWA) - Sophisticated teaching’ model (repeated games) - Sophisticated teaching’ model (repeated games) Shameless plug: Camerer, Behavioral Game Theory (Princeton, Feb ’03) or see website hss.caltech.edu/~camerer

Behavioral models use some game theory principles, and weaken other principles Principle equilibrium Thinking LearningTeaching concept of a game   strategic thinking   best response  mutual consistency  learning   strategic foresight  

(Typical) experimental economics methods Repeated matrix stage game (Markov w/ 1 state) Repeated with “one night stand” (“stranger”) rematching protocol & feedback (to allow learning without repeated-game reputation- building) Game is described abstractly, payoffs are public knowledge (e.g., read out loud) Subjects paid $ according to choices (~$12/hr) Why this style? Basic question is whether S’s can “compute” equiilibrium *, not meant to be realistic Establish regularity across S’s, different game structures Statistical fitting: Parsimonious (1+ parameters) models, fit (in sample) & predict (out of sample) & compute economic value * Question now answered (No): Would be useful to move to low- information MAL designs

Beauty contest game: Pick numbers [0,100] closest to (2/3)*(average number) wins

“Beauty contest” game (Ho, Camerer, Weigelt Amer Ec Rev 98): Pick numbers x i  [0,100] Closest to (2/3)*(average number) wins $20

Table: Data and estimates of  in pbc games (equilibrium = 0) datasteps of subjects/gamemeanstd devthinking game theorists Caltech newspaper portfolio mgrs econ PhD class Caltech g= high school /2 mean yr olds Germany CEOs game p= Caltech g= PCC g= game p= PCC g= mean1.56 median1.30

EWA learning Attraction A i j (t) for strategy j updated by A i j (t) =(  A i j (t-1) +  i [s i (t),s -i (t)]/ (  (1-  )+1) (chosen j) A i j (t) =(  A i j (t-1) +   i [s i j,s -i (t)]/ (  (1-  )+1) (unchosen j) logit response (softmax) P i j (t)=e^{A i j (t)}/[Σ k e^{A i k (t)}] key parameters:  imagination (weight on foregone payoffs)  decay (forgetting) or change-detection  growth rate of attractions (  =0  averages;  =1  cumulations;  =1  “lock-in” after exploration) “In nature a hybrid [species] is usually sterile, but in science the opposite is often true”-- Francis Crick ’88 Weighted fictitious play (  =1,  =0) Simple choice reinforcement (  =0)

Studies comparing EWA and other learning models

20 estimates of learning model parameters 20 estimates of learning model parameters

Functional EWA learning (“EWA Lite”) Use functions of experience to create parameter values (only free parameter )  i (t) is a change detector:  i (t) is a change detector:  i (t)=1-.5[  k ( s -i k (t) -   =1 t s s -i k (  )/t ) 2 ]  i (t)=1-.5[  k ( s -i k (t) -   =1 t s s -i k (  )/t ) 2 ] Compares average of past freq’s s -i (1), s -i (2)…with s -i (t)  Decay old experience (low  ) if change is detected  =1 when other players always repeat strategies  =1 when other players always repeat strategies  falls after a “surprise”  falls after a “surprise” falls more if others have been highly variable falls less if others have been consistent  =  /(  of Nash strategies) (creates low  in mixed games)  =  /(  of Nash strategies) (creates low  in mixed games)Questions: (now) Do functional values pick up differences across games? (Yes.) (later) Can function changes create sensible, rapid switching in stochastic games?

Example: Price matching with loyalty rewards (Capra, Goeree, Gomez, Holt AER ‘99) Players 1, 2 pick prices [80,200] ¢ Price is P=min(P 1,,P 2 ) Low price firm earns P+R High price firm earns P-R What happens? (e.g., R=50)

Teaching in repeated (partner) games Finitely-repeated trust game (Camerer & Weigelt Econometrica ‘88) borrower action repaydefault lenderloan40,60-100,150 no loan 10,10 1 borrower plays against 8 lenders A fraction (p(honest)) borrowers prefer to repay (controlled by experimenter)

Empirical results (conditional frequencies of no loan and default)

Teaching in repeated trust games (Camerer, Ho, Chong J Ec Theory 02) Some (  =89%) borrowers know lenders learn by fEWA Actions in t “teach” lenders what to expect in t+1  (=.93) is “peripheral vision” weight E.g. entering period 4 of sequence 17 Seq.period Repay Repay Repay Default..... Repay Repay Repay Default.....  look “peripherally” (  weight) look “peripherally” (  weight)  look back Repay No loan Repay Teaching: Strategies have reputations Bayesian-Nash equilibrium: Borrowers have reputations (types)

Heart of the model: Attraction of sophisticated Borrower strategy j after sequence k before period t J t+1 is possible sequence of choices by borrower First term is expected (myopic) payoff from strategy j Second term is summation of expected payoffs in the future (undiscounted) given effect of j and optimal planned future choices (J t+1 )

Empirical results (top) and teaching model (bottom)

Conclusions Learning ( response sensitivity) Hybrid fits & predicts well (20+ games) One-parameter fEWA fits well, easy to estimate Well-suited to Markov games because Φ means players can “relearn” if new state is quite different? Teaching (  fraction of teaching) Retains strategic foresight in repeated games with partner matching Fits trust, entry deterrence better than softmax Bayesian-Nash (aka QRE) Next? Field applications, explore low-information Markov domains…

Parametric EWA learning (E’metrica ‘99) free parameters , , , , N(0) Functional EWA learning functions for parameters parameter (  ) Strategic teaching (JEcTheory ‘02) Reputation-building w/o “types” Two parameters ( ,  ) Thinking steps (parameter  )