1 Teck H. Ho April 8, 2004 Outline  In-Class Experiment and Motivation  Adaptive Experience-Weighted Attraction (EWA) Learning in Games: Camerer and.

Slides:



Advertisements
Similar presentations
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture
Advertisements

Tacit Coordination Games, Strategic Uncertainty, and Coordination Failure John B. Van Huyck, Raymond C. Battalio, Richard O. Beil The American Economic.
Joint Strategy Fictitious Play Sherwin Doroudi. “Adapted” from J. R. Marden, G. Arslan, J. S. Shamma, “Joint strategy fictitious play with inertia for.
Game Theory and Computer Networks: a useful combination? Christos Samaras, COMNET Group, DUTH.
Calibrated Learning and Correlated Equilibrium By: Dean Foster and Rakesh Vohra Presented by: Jason Sorensen.
Belief Learning in an Unstable Infinite Game Paul J. Healy CMU.
Game Theory 1. Game Theory and Mechanism Design Game theory to analyze strategic behavior: Given a strategic environment (a “game”), and an assumption.
Learning in games Vincent Conitzer
Learning and teaching in games: Statistical models of human play in experiments Colin F. Camerer, Social Sciences Caltech Teck.
1 Duke PhD Summer Camp August 2007 Outline  Motivation  Mutual Consistency: CH Model  Noisy Best-Response: QRE Model  Instant Convergence: EWA Learning.
A camper awakens to the growl of a hungry bear and sees his friend putting on a pair of running shoes, “You can’t outrun a bear,” scoffs the camper. His.
Convergent Learning in Unknown Graphical Games Dr Archie Chapman, Dr David Leslie, Dr Alex Rogers and Prof Nick Jennings School of Mathematics, University.
Rational Learning Leads to Nash Equilibrium Ehud Kalai and Ehud Lehrer Econometrica, Vol. 61 No. 5 (Sep 1993), Presented by Vincent Mak
Minority Games A Complex Systems Project. Going to a concert… But which night to pick? Friday or Saturday? You want to go on the night with the least.
A Memetic Framework for Describing and Simulating Spatial Prisoner’s Dilemma with Coalition Formation Sneak Review by Udara Weerakoon.
Communication Networks A Second Course Jean Walrand Department of EECS University of California at Berkeley.
1 Teck H. Ho Duke PhD Summer CampAugust 2007 Outline  Motivation  Mutual Consistency: CH Model  Noisy Best-Response: QRE Model  Instant Convergence:
Outline  In-Class Experiment on a Coordination Game  Test of Equilibrium Selection I :Van Huyck, Battalio, and Beil (1990)  Test of Equilibrium Selection.
1 Teck-Hua Ho CH Model March – June, 2003 A Cognitive Hierarchy Theory of One-Shot Games Teck H. Ho Haas School of Business University of California, Berkeley.
6/2/2001 Cooperative Agent Systems: Artificial Agents Play the Ultimatum Game Steven O. Kimbrough Presented at FMEC 2001, Oslo Joint work with Fang Zhong.
Outline  In-Class Experiment on Centipede Game  Test of Iterative Dominance Principle I: McKelvey and Palfrey (1992)  Test of Iterative Dominance Principle.
Reinforcement Learning Rafy Michaeli Assaf Naor Supervisor: Yaakov Engel Visit project’s home page at: FOR.
Outline  In-Class Experiment on a Coordination Game  Test of Equilibrium Selection I :Van Huyck, Battalio, and Beil (1990)  Test of Equilibrium Selection.
Outline  In-Class Experiment on a Coordination Game  Test of Equilibrium Selection I :Van Huyck, Battalio, and Beil (1990)  Test of Equilibrium Selection.
Learning in Games. Fictitious Play Notation! For n Players we have: n Finite Player’s Strategies Spaces S 1, S 2, …, S n n Opponent’s Strategies Spaces.
XYZ 6/18/2015 MIT Brain and Cognitive Sciences Convergence Analysis of Reinforcement Learning Agents Srinivas Turaga th March, 2004.
1 Berkeley January 2004 Teck H. Ho A Parsimonious Model of Stock-keeping-Unit (SKU) Choice Teck H. Ho Haas School of Business UC, Berkeley Joint work with.
1 Teck-Hua Ho CH Model April 15, 2004 A Cognitive Hierarchy Model of One-Shot Games Teck H. Ho Haas School of Business University of California, Berkeley.
1 Teck-Hua Ho CH Model April 4, 2004 Outline  In-class Experiment and Examples  Empirical Alternative I: Model of Cognitive Hierarchy (Camerer, Ho, and.
1 Teck H. Ho April 8, 2004 Outline  In-Class Experiment and Motivation  Adaptive Experience-Weighted Attraction (EWA) Learning in Games: Camerer and.
Social Learning. A Guessing Game Why are Wolfgang Puck restaurants so crowded? Why do employers turn down promising job candidates on the basis of rejections.
1 Duke Doctoral Summer Camp August, 2007 Behavioral Game Theory Teck H. Ho University of California, Berkeley.
Outline  Motivation and Modeling Philosophy  Empirical Alternative I: Model of Cognitive Hierarchy (Camerer, Ho, and Chong, QJE, 2004)  Empirical Alternative.
Rationality and information in games Jürgen Jost TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAA A A A AAA Max Planck.
1 Teck-Hua Ho CH Model April 4, 2004 Outline  In-class Experiment and Examples  Empirical Alternative I: Model of Cognitive Hierarchy (Camerer, Ho, and.
Static Games of Complete Information: Subgame Perfection
Quantal Response Equilibrium APEC 8205: Applied Game Theory Fall 2007.
Outline  In-Class Experiment on Centipede Game  Test of Iterative Dominance Principle I: McKelvey and Palfrey (1992)  Test of Iterative Dominance Principle.
Extensive Game with Imperfect Information Part I: Strategy and Nash equilibrium.
1 Teck H. Ho Spring 2006 A Cognitive Hierarchy (CH) Model of Games Teck H. Ho Haas School of Business University of California, Berkeley Joint work with.
Simulating Normal Random Variables Simulation can provide a great deal of information about the behavior of a random variable.
The Affects on Expected Market Outcomes when People Look for Information Where There is None Drew Marticorena Ashleigh Edwards.
Outline  Motivation and Modeling Philosophy  Empirical Alternative I: Model of Cognitive Hierarchy (Camerer, Ho, and Chong, QJE, 2004)  Empirical Alternative.
EC941 - Game Theory Francesco Squintani Lecture 3 1.
A Dynamic Level-k Model in Games Teck Ho and Xuanming Su UC Berkeley April 2011 Teck Hua Ho 1.
1 On the Agenda(s) of Research on Multi-Agent Learning by Yoav Shoham and Rob Powers and Trond Grenager Learning against opponents with bounded memory.
1 Economics & Evolution. 2 Cournot Game 2 players Each chooses quantity q i ≥ 0 Player i’s payoff is: q i (1- q i –q j ) Inverse demand (price) No cost.
CPS Learning in games Vincent Conitzer
Chapter 9 Games with Imperfect Information Bayesian Games.
1 A unified approach to comparative statics puzzles in experiments Armin Schmutzler University of Zurich, CEPR, ENCORE.
Game Theory The Prisoner’s Dilemma Game. “Strategic thinking is the art of outdoing an adversary, knowing that the adversary is trying to do the same.
ECO290E: Game Theory Lecture 12 Static Games of Incomplete Information.
Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 4.1.Dynamic Games of Incomplete Information Lecture
Experimental Economics and Neuroeconomics. An Illustration: Rules.
Presenter: Chih-Yuan Chou GA-BASED ALGORITHMS FOR FINDING EQUILIBRIUM 1.
Games with Imperfect Information Bayesian Games. Complete versus Incomplete Information So far we have assumed that players hold the correct belief about.
3.1.4 Types of Games. Strategic Behavior in Business and Econ Outline 3.1. What is a Game ? The elements of a Game The Rules of the Game:
Social Cognitive Learning Theory. What factors influence learning? Environmental –examples Cognitive –examples.
Coordination and Learning in Dynamic Global Games: Experimental Evidence Olga Shurchkov MIT The Economic Science Association World Meeting 2007.
Vincent Conitzer CPS Learning in games Vincent Conitzer
Outline 1. An institutional perspective 2.Experimental design 3.Main result: convergence 4."Clumsy" behavior 5.Simulations 6.Final remarks Raluca PARVULESCU.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 0.Game Theory – Brief Introduction Lecture
Replicator Dynamics. Nash makes sense (arguably) if… -Uber-rational -Calculating.
Economics (and Finance) as an Evolutionary Game A Helicopter Tour for Santa Fe Institute Workshop “Beyond Equilibrium and Efficiency,” May 18-20, 2000.
M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 2.1.Dynamic Games of Complete and Perfect Information Lecture
Replicator Dynamics.
實驗經濟學一:行為賽局論 Experimental Economics I: Behavioral Game Theory 第九講:學習理論:制約、計牌與EWA Lecture 9: Learning: Reinforcement, Fictitious Play, and EWA 授課教師:國立臺灣大學.
Economics & Evolution.
Games with Imperfect Information Bayesian Games
Presentation transcript:

1 Teck H. Ho April 8, 2004 Outline  In-Class Experiment and Motivation  Adaptive Experience-Weighted Attraction (EWA) Learning in Games: Camerer and Ho (Econometrica, 1999)  Sophisticated EWA Learning and Strategic Teaching: Camerer, Ho, and Chong (JET, 2002)  Self-tuning EWA Learning (EWA Lite): Ho, Camerer, and Chong (2004)

2 Teck H. Ho April 8, 2004 Median Action Game

3 Teck H. Ho April 8, 2004 Continental Divide Game

4 Teck H. Ho April 8, 2004 The learning setting  Normal-form game where each player is aware of the payoff table  Player i’s strategy space consists of discrete choices indexed by j (e.g., 1, 2,…, 13, 14)  The game is repeated for several rounds  At each round, all players observed:  Strategy or action history of all other players  Own payoff history

5 Teck H. Ho April 8, 2004 Research question  To develop a good descriptive model of adaptive learning to predict the probability of player i (i=1,…,n) choosing strategy j at round t

6 Teck H. Ho April 8, 2004 Criteria of a “good” model  Use (potentially) all available information subjects receive in a sensible way  Satisfies plausible principles of behavior (i.e., conformity with other sciences such as psychology)  Fits and predicts choice behavior well  Ability to generate new insights  As simple as the data allow

7 Teck H. Ho April 8, 2004 Competing Models  Introspection ( P j ): Requires too much human cognition  Nash equilibrium (Nash, 1950)  Quantal response equilibrium (McKelvey and Palfrey, 1995)  Evolution ( P j (t) ): Players are pre-programmed  Replicator dynamics (Friedman, 1991)  Genetic algorithm (Holland, 1975; Ho, 1996)  Learning ( P i j (t) ): Uses about the right level of cognition  Reinforcement (Roth and Erev, 1995)  Belief-based learning  Cournot best-response dynamics (Cournot, 1838)  Simple Fictitious play (Brown, 195)  Weighted Ficitious play (Fudenberg and Levine, 1998)  Directional learning (Selten, 1991)  Imitation (Schlag, 1998; 1999)  Experience-weighted attraction learning (Camerer and Ho, 1999)

8 Teck H. Ho April 8, 2004 Information Usage in Learning  Choice reinforcement learning (Thorndike, 1911; Bush and Mosteller, 1955; Herrnstein, 1970; Erev and Roth, 1998): successful strategies played again  Belief-based Learning (Cournot, 1838; Brown, 1951; Fudenberg and Levine 1998): form beliefs based on opponents’ action history and choose according to expected payoffs  The information used by reinforcement learning is own payoff history and by belief-based models is opponents action history  EWA uses both kinds of information

9 Teck H. Ho April 8, 2004 “Laws” of Effects in Learning  Law of actual effect: successes increase the probability of chosen strategies  Teck is more likely than Colin to “stick” to his previous choice (other things being equal)  Law of simulated effect: strategies with simulated successes will be chosen more often  Colin is more likely to switch to T than M  Law of diminishing effect: Incremental effect of reinforcement diminishes over time  $1 has more impact in round 2 than in round L R T M B Row player’s payoff table Colin chose B and received 4 Teck chose M and received 5 Their opponents chose L

10 Teck H. Ho April 8, 2004 Assumptions of Reinforcement and Belief Learning  Reinforcement learning ignores simulated effect  Belief learning predicts actual and simulated effects are equally strong  EWA learning allows for a positive (and smaller than actual) simulated effect

11 Teck H. Ho April 8, 2004 The EWA Model  Initial attractions and experience (i.e., )  Updating rules  Choice probabilities

12 Teck H. Ho April 8, 2004 EWA Model and Laws of Effects  Law of actual effect: successes increase the probability of chosen strategies (positive incremental reinforcement increases attraction and hence probability)  Law of simulated effect: strategies with simulated successes will be chosen more often (  > 0 )  Law of diminishing effect: Incremental effect of reinforcement diminishes over time ( N(t) >= N(t-1) )

13 Teck H. Ho April 8, 2004 The EWA model: An Example  Period 0:  Period 1: TBTB L R Row player’s payoff table History: Period 1 = (B,L)

14 Teck H. Ho April 8, 2004 Reinforcement Model: An Example  Period 0:  Period 1: TBTB L R Row player’s payoff table History: Period 1 = (B,L)

15 Teck H. Ho April 8, 2004 Belief-based(BB) model: An Example  Period 0:  Period 1: TBTB L R Bayesian Learning with Dirichlet priors

16 Teck H. Ho April 8, 2004 Model Interpretation  Simulation or attention parameter (  ) : measures the degree of sensitivity to foregone payoffs  Exploitation parameter ( ): measures how rapidly players lock-in to a strategy (average versus cumulative)  Stationarity or motion parameter (  ): measures players’ perception of the degree of stationarity of the environment

17 Teck H. Ho April 8, 2004 Model Interpretation Cournot Weighted Fictitious Play Fictitious Play Average Reinforcement Cumulative Reinforcement

18 Teck H. Ho April 8, 2004 New Insight  Reinforcement and belief learning were thought to be fundamental different for 50 years.  For instance, “….in rote [reinforcement] learning success and failure directly influence the choice probabilities. … Belief learning is very different. Here experiences strengthen or weaken beliefs. Belief learning has only an indirect influence on behavior.” (Selten, 1991)  EWA shows that belief and reinforcement learning are related and special kinds of EWA learning

19 Teck H. Ho April 8, 2004 Actual versus Belief-Based Model Frequencies: Median Action Game

20 Teck H. Ho April 8, 2004 Actual versus Reinforcement Model Frequencies: Median Action Game

21 Teck H. Ho April 8, 2004 Actual versus EWA Model Frequencies: Median Action Game

22 Teck H. Ho April 8, 2004 Estimation and Results

23 Teck H. Ho April 8, 2004 Actual versus Belief-Based Model Frequencies: CDG

24 Teck H. Ho April 8, 2004 Actual versus Reinforcement Model Frequencies: CDG

25 Teck H. Ho April 8, 2004 Actual versus EWA Model Frequencies: CDG

26 Teck H. Ho April 8, 2004

27 Teck H. Ho April 8, 2004

28 Teck H. Ho April 8, 2004 Extensions  Heterogeneity (Camerer and Ho, 1999)  Payoff learning (Camerer, Ho, and Wang, 2000)  Sophistication and strategic teaching (Camerer, Ho, and Chong, 2002)  Self-tuning EWA (or EWA Lite) (Ho, Camerer, and Chong, 2004)  Applications:  Signaling games (Anderson and Camerer, 1999)  Auction markets (Camerer, Ho, and Hsia, 2000)  Product Choice at Supermarkets (Ho and Chong, 2003)