Evolutionary Dynamics, Game Theory, and Psychology.

Slides:



Advertisements
Similar presentations
Nagel: Unraveling in Guessing Games: An Experimental Study Economics 328 Spring 2005.
Advertisements

Innuendos and Common Knowledge. Consider the following examples of innuendos: "I hear you're the foreman of the jury in the Soprano trial. It's an important.
Math Intro: Matrix Form Games and Nash Equilibrium.
3. Basic Topics in Game Theory. Strategic Behavior in Business and Econ Outline 3.1 What is a Game ? The elements of a Game The Rules of the.
The National Council on Economic Education/John Templeton Foundation Teaching the Ethical Foundations of Economics Lesson 2: What Is the Difference Between.
Is It Rational to Vote? Political scientists study all aspects of voting behavior. The most interesting question, of course, is who votes for whom and.
EDUC 200 Final Power Point Presentation Scott Reding Spring 2005.
Chapter Twenty-Eight Game Theory. u Game theory models strategic behavior by agents who understand that their actions affect the actions of other agents.
Chapter 14 Infinite Horizon 1.Markov Games 2.Markov Solutions 3.Infinite Horizon Repeated Games 4.Trigger Strategy Solutions 5.Investing in Strategic Capital.
1 Chapter 14 – Game Theory 14.1 Nash Equilibrium 14.2 Repeated Prisoners’ Dilemma 14.3 Sequential-Move Games and Strategic Moves.
Game Theory and Computer Networks: a useful combination? Christos Samaras, COMNET Group, DUTH.
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium
 1. Introduction to game theory and its solutions.  2. Relate Cryptography with game theory problem by introducing an example.  3. Open questions and.
EC941 - Game Theory Lecture 7 Prof. Francesco Squintani
4 Why Should we Believe Politicians? Lupia and McCubbins – The Democratic Dilemma GV917.
An Introduction to Game Theory Part I: Strategic Games
A camper awakens to the growl of a hungry bear and sees his friend putting on a pair of running shoes, “You can’t outrun a bear,” scoffs the camper. His.
Algoritmi per Sistemi Distribuiti Strategici
Cooperating Without Looking. Suppose a friend asks you to proofread a paper… You hesitate while thinking about how big a pain it is and say, “Hmm. Um.
An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.
ECON6036 1st semester Format of final exam Same as the mid term
Why How We Learn Matters Russell Golman Scott E Page.
Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.
Job Market Signaling (Spence model)
An introduction to game theory Today: The fundamentals of game theory, including Nash equilibrium.
An introduction to game theory Today: The fundamentals of game theory, including Nash equilibrium.
Introduction to Game Theory and Behavior Networked Life CIS 112 Spring 2009 Prof. Michael Kearns.
Extensive Game with Imperfect Information Part I: Strategy and Nash equilibrium.
Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.
Games in the normal form- An application: “An Economic Theory of Democracy” Carl Henrik Knutsen 5/
EC941 - Game Theory Francesco Squintani Lecture 3 1.
UNIT II: The Basic Theory Zero-sum Games Nonzero-sum Games Nash Equilibrium: Properties and Problems Bargaining Games Bargaining and Negotiation Review.
An introduction to game theory Today: The fundamentals of game theory, including Nash equilibrium.
Introduction to Game Theory
Common Knowledge: The Math. We need a way to talk about “private information”
Game Theory, Strategic Decision Making, and Behavioral Economics 11 Game Theory, Strategic Decision Making, and Behavioral Economics All men can see the.
Chapter 12 Choices Involving Strategy Copyright © 2014 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written.
Nash equilibrium Nash equilibrium is defined in terms of strategies, not payoffs Every player is best responding simultaneously (everyone optimizes) This.
By: Isaiah Magpali-Isaac, Tatianna Smith, Viris Colmenero Farrelly, Daniel, Lazarus, John, & Roberts, Gilbert (2007). Altruists Attract. Evolutionary Psychology.
McGraw-Hill/Irwin Copyright  2008 by The McGraw-Hill Companies, Inc. All rights reserved. GAME THEORY, STRATEGIC DECISION MAKING, AND BEHAVIORAL ECONOMICS.
Ethical Rules, Games, and Evolution Ted Bergstrom, Economics Dept, UCSB.
Chapters 29, 30 Game Theory A good time to talk about game theory since we have actually seen some types of equilibria last time. Game theory is concerned.
Lecture 7 Course Summary The tools of strategy provide guiding principles that that should help determine the extent and nature of your professional interactions.
Extensive Games with Imperfect Information
Topic 3 Games in Extensive Form 1. A. Perfect Information Games in Extensive Form. 1 RaiseFold Raise (0,0) (-1,1) Raise (1,-1) (-1,1)(2,-2) 2.
Voter Turnout. Overview Recap the “Paradox” of Voting Incentives and Voter Turnout Voter Mobilization.
3.1.4 Types of Games. Strategic Behavior in Business and Econ Outline 3.1. What is a Game ? The elements of a Game The Rules of the Game:
Frank Cowell: Microeconomics Game Theory: Basics MICROECONOMICS Principles and Analysis Frank Cowell March 2004.
Experimental Design Econ 176, Fall Some Terminology Session: A single meeting at which observations are made on a group of subjects. Experiment:
Wrap-Up. Our approach: game theory + evolutionary dynamics = ultimate explanation for people’s puzzling preferences, beliefs, ideologies, and heuristics.
Auctions serve the dual purpose of eliciting preferences and allocating resources between competing uses. A less fundamental but more practical reason.
Chapter 6 Extensive Form Games With Perfect Information (Illustrations)
Chapter 9 Prosocial Behavior: Doing What’s Best for Others © 2014 Wadsworth Cengage Learning Oskar Schindler’s grave. The Hebrew inscription reads: “A.
Replicator Dynamics. Nash makes sense (arguably) if… -Uber-rational -Calculating.
Lec 23 Chapter 28 Game Theory.
In your notebooks: 1.) Write down the following names: 1. Auguste Comte 2. Harriet Martineau 3. Herbert Spencer 4. Emile Durkeim 5. Max Weber 6. Karl Marx.
Indirect Reciprocity in the Selective Play Environment Nobuyuki Takahashi and Rie Mashima Department of Behavioral Science Hokkaido University 08/07/2003.
Game theory Chapter 28 and 29
Correlated equilibria, good and bad: an experimental study
Contents of the Talk Preliminary Materials Motivation and Contribution
Sex Ratios Illustrates both the power of game theory. As well as how to “test’ game theory model. Albeit not our domain.
Replicator Dynamics.
11b Game Theory Must Know / Outcomes:
Conclusion.
Game theory Chapter 28 and 29
Unit 4 SOCIAL INTERACTIONS.
11b – Game Theory This web quiz may appear as two pages on tablets and laptops. I recommend that you view it as one page by clicking on the open book icon.
Chapter 29 Game Theory Key Concept: Nash equilibrium and Subgame Perfect Nash equilibrium (SPNE)
Molly W. Dahl Georgetown University Econ 101 – Spring 2009
Presentation transcript:

Evolutionary Dynamics, Game Theory, and Psychology

Today and next Tues: we will show you some work we are doing at PED -that uses evolutionary dynamics game theory experiments -to understand human psychology, e.g. Altruistic motives Sense of beauty Why we are principled

Will start off philosophical (in order to set the stage) Then will present ongoing research… This is work done at PED, with Erez, Oliver, Carl, Alex, Matthiajs, Martin …

Topic 1: Charitable Giving

We give A LOT ~2% of GDP are donated to charity ~ 2-4% of hours worked are volunteered

But we dont always give in the most effective way…

Habitat for Humanity: college educated 19 year olds whove never held a hammer fly halfway across the world To build new homes in places where there is plenty of cheap, qualified labor!

Is this just because we dont realize how ineffective Habitat for Humanity is? Or do we not care how effective we are?

IF we ACTUALLY care about being effective…we should give more when our gifts are matched, no? And even more so our gifts are tripled, no? Do we?

Some economists ran a study where they collected $ for charity… And they manipulated the matching rate

What impact did the matching rate have? None…

We ask… Why dont we care about effectiveness? How can we make giving more effective? More generally… Why are our altruistic preferences so funny? Can we characterize our altruistic preferences? And can we use this knowledge to increase giving? Or to make giving more impactful?

Topic 2: Beauty

A few facts about beauty…

Fact 1: varies by culture

e.g., lets learn about the ideal body weight in some indigenous populations in Nigeria...

Clearly, ideal body weight isnt the same there as here. Why not? Random cultural variation?

Likewise, ideal skin tone varies by culture… In Eastern countries, un-tanned skin is considered more attractive …

And likewise, in the West in the old days…

But nowadays in the West we seem to prefer tanned skin…

Why did our preferences change? Why do they differ between East and West? Random fluctuations?

Even our ideal finger nail length varies by culture… Heres what finger nails look like on some men among the Khasi in N.E. India

Fact 2: We like art that is authentic (even if looks the same!)

Let me tell you about a study that illustrated this…

Some subjects were told the second painting was purposely designed by a different artist, making the second painting a forgery. Some subjects were told the same artist painted both. Within each group, half the subjects were told painting A was created first.

making the second painting a replica. Source: Newman and bloom (2011) Same artist Different artist making the second painting a replica.

Why do we like our art to be original?

Perhaps this has nothing to do with aesthetics… Well…when the study was repeated with an artifact (e.g. a car) instead of paintings, there was no effect…

Source: Newman and bloom (2011)

Topic 3: Principles

Why do we like people who are principled?

For instance, this statesmen in the West Wing who returns a card that can save his life...out of principlethis statesmen in the West Wing who returns a card that can save his life And we admire him for it…even though turning down the card helps no one

In contrast, people who are strategic and calculating, like this cop from the wire who prefer better stats to solving murders …are repulsive. Even though his strategic actions dont harm you…

Why do we like people who are principled/idealistic and dislike those who are strategic/calculated/Machiavellian, regardless of whether their actions help or harm us?

More generally… Where do our preferences and ideologies come from?

In our research… We try to understand from where such preferences and ideologies come from. Using evolutionary dynamics + game theory + experiments

Quick review: what is game theory?

Game theory models behavior in any social interaction Social interaction=my payoffs depend on what I do as well as what others do. Let me illustrate using a simple game…

5, 6 8, 4 3, 20, -3 U D LR The simplest game can be represented by the following payoff matrix

5,6 8, 4 3, 20, -3 U D LR Player 1 chooses between two actions

5, 6 8, 4 3, 20, -3 U D LR Player 2 simultaneously chooses between 2 actions

5, 6 8, 4 3, 20, -3 U D LR The payoffs to player 1 are determined by her action as well as the action of player 2

5, 6 8, 4 3, 20, -3 U D LR The payoffs to player 2 are determined by her action as well as the action of player 2

The main insight of game theory comes from the Nash equilibria Which often have counterintuitive properties, or allows us to clarify things we already know.

5, 6 8, 4 3, 20, -3 U D LR This game can be solved by finding the Nash equilibrium

5, 6 8, 4 3, 20, -3 U D LR (U, L) is a Nash Equilibrium b/c neither can benefit by unilaterally deviating

Predictions of game theory: If both expected (U,L), both would play (U,L)! (Nash is self enforcing)

5, 6 8, 4 3, 20, -3 U D LR (U,R) is NOT a Nash Equilibrium b/c 2 can benefit by unilaterally deviating to L

Game theory predicts: If both expected (U,R), player 2 would deviate! (I.e. if not Nash, cannot be stable)

Nash makes sense (arguably) if… -Uber-rational -Calculating

Nash also makes sense (as 1 st approx.) if: 1) strategies that yield higher payoffs, reproduce faster 2) evolutionary dynamics Nash

E.g. Zahavis handicap principle

But why would game theory matter for our preferences/ideologies? We dont choose what to find beautiful? We didnt evolve to like long finger nails?

But… If preferences/ideologies learned/evolve… Nash becomes relevant… Heres why:

Our thesis (in a few steps):

1) Reinforcement learning/or prestige biased imitation causes behaviors that do well to grow in frequency…

T=0T=1 More successful behaviors imitated more Prestige Biased Imitation

T=0T=1 Reinforcement Learning More successful behaviors held more tenaciously

2) If one behavior is ALWAYS best…this will eventually lead to that behavior dominating…

T=0T=1T=2T=3

3) If however, the behavior is a strategy in a game…strategies still become more frequent if they fair well… But whether they do well could depend on what others are doing…so things can be a bit more complicated…nevertheless… The dynamics often (Not always! Must Check!) settle on a NE

T=0T=1T=2T=3 L L L L L L L L R R R R R R R L L L L L L L L R R L R R L L L L

4) Suppose now that instead of choosing actions in a game…people merely act in accordance with their ideologies or preferences… But ideologies and preferences ALSO can be (at least partially!) imitated or learned…

E.g. P L is preferences that causes action L to be taken…

T=0T=1 PLPL PLPL PLPL PLPL PLPL PLPL PLPL PLPL PRPR PRPR PRPR PRPR PRPR PRPR PRPR PLPL Feelings/beliefs that do better become more frequent

Then behavior will end up at a NE

T=0T=1T=2T=3 PLPL PLPL PLPL PLPL PLPL PLPL PLPL PLPL PRPR PRPR PRPR PRPR PRPR PRPR PRPR PLPL PLPL PLPL PLPL PLPL PLPL PLPL PLPL PLPL PRPR PLPL PRPR PRPR PLPL PLPL PLPL PLPL

Then behavior will end up at a NE… Notice: -At NE with respect to the original payoffs, not the learned preferences!) -IF dynamics nash!) -no need to be aware of why we have these preferences/ideologies (philosophers can spin their wheels on how these ideologies are right) -IF dynamics nash under weak selection/high mutation, then dont even need to believe preferences/ideologies are THAT responsive to payoffs…just a little bit… And preferences/ideologies will have all sorts of (predictable) quirks that Nash has… -e.g. might like wasteful displays, if preferred partners can display at lower cost… -e.g. might desire to cooperate but not care about effectiveness...

OK… What can this approach teach us? What work needs to be done?

Some game theory modeling, demonstrating the quirky preferences/ideologies consistent with NE Some evolutionary dynamics, demonstrating the NE emerges from dynamic processes. Some experiments, testing the predictions of the models.

We will present examples of each: 1) Evolutionary dynamics costly signals (Which can explain why we evolved/learned to like authentic art…or why Indians like long nails) 2) Game Theory Model + Evolutionary Dynamics of cooperate without looking (which can explain why we like those who are principled. And also why love blinds us…and why we find markets for kidneys gross…And also leads to valuable prescriptions!) 3) Experiment demonstrating that people give more efficiently when efficiency commonly known (which is consistent with the ED+GT model of why we give inefficiently…and also leads to valuable prescription!)

Project 1: Experiment on (In)efficient Giving -(preliminary) experiment demonstrating that people give more efficiently when efficiency commonly known -(brief) discussion of ED+GT model

Heres the idea … If preferences/ideologies that motivate us to give evolved/learned because of reciprocity or partner choice… Then private information about effectiveness cannot matter Why? Because others dont know that information so cant reciprocate, punish, match etc. based on it

Ran a simple MTurk experiment showing that people give more effectively not only when they know efficiency, but when efficiency is commonly known

Design: Ask participants to distribute donations across a list of similar charities (in one of four categories) Subjects were told their contributions would be observed by a third party

We obtained ratings for charities by scraping

Three treatments: No Ratings: subjects provided no information about charity effectiveness Private Ratings: subjects given ratings by an external rating source, but told 3 rd party would not be given ratings Public Ratings: subjects given ratings and told 3 rd party would also be given ratings

Private Information – Condition

Public Information – Condition

Note: can explain why we are not impacted by -matching -or scope -and give to inefficient charities like habitat for humanity… EVEN IF we knew (in)efficiency, since efficiency not commonly known!

Note: NOT just useless theorizing…leads to valuable prescription: To nudge the most impact out of existing prosocial motives, need to make information about effectiveness not just known but ALSO commonly known.

Project 2: Evolutionary Dynamics of Costly Signals

Recall…

First lets argue that long fingernails yield behavior consistent with Nash in a costly signaling model (and also ideal weight, skin tans, and authentic art)

Recall Zahavis explanation for the peacock tail Tail is costly for all, more costly for unfit males. So is NE where females more likely to mate With males with long tails, and only fit males find it worth the extra mating to grow long tail

Similarly for long nails… Long nails costly for all (tail hinders flight) More costly for farmers than teachers (more hindrance for unfit males) Females prefer mating with teachers (peahens prefer fit males)

This, we will argue is WHY Khasi females find long nails beautiful (we will later discuss how the same model can explain our other puzzles about beauty)

However, for this explanation to work, we need to be confident this Nash emerges in a dynamic model. (No one chooses what to find beautiful, they simply learn, via RL or PBI)

We -created stylized costly signaling model, in which there are costly signaling equilibria (separating equilibria … as well as other less interesting equilibria) -We investigate various dynamics and find conditions under which separating equilibrium emerges.

Here is the stylized model…

1 1 High Low P 1 1 S0S0 S1S1 S2S2 S3S3 2 2 Accept Reject

1 1 High Low P

1 1 High Low P

1 1 High Low P 1 1 S0S0 S1S1 S2S2 S3S S n < S n+1 S n< << S n+1 if low

1 1 High Low P 1 1 S0S0 S1S1 S2S2 S3S S n < S n+1 S n< << S n+1 if low

High Low P 1 1 S0S0 S1S1 S2S2 S3S3 2 2 Accept Reject S n < S n+1 S n< << S n+1 if low

High Low P 1 1 S0S0 S1S1 S2S2 S3S3 2 2 Accept Reject S n < S n+1 S n< << S n+1 if low e.g. 0,3,6,9 and 0,1,2,3 e.g. 5,5,-5 e.g. P=1/3

Examples of Strategy Profiles and Payoffs (0,-1,0) (-1,-1,-10/3)

Nash Equilibrium = s.t. none benefit by unilaterally deviating

We will investigate the dynamics E.g. Moran

s2s2 s2s2 s2s2 s2s2 s3s3 s3s3 s0s0 s0s0 s3s3 s3s3 High 1s s0s0 s0s0 s1s1 s1s1 s1s1 s1s1 s1s1 s1s1 s0s0 s0s0 s1s1 s1s1 s0s0 s0s0 Low 1s {s 2, s 3 } 2s {s 2, s 3 } e.g.N L =100 N H =100 N 2 =150 {s 2, s 3 }

s0s0 s0s0 s1s1 s1s1 s1s1 s1s1 s1s1 s1s1 s0s0 s0s0 s1s1 s1s1 s0s0 s0s0 Low 1s

s0s0 s0s0 s1s1 s1s1 s1s1 s1s1 s1s1 s1s1 s0s0 s0s0 s1s1 s1s1 s0s0 s0s0 1-w+w(payoffs) e.g. w=.1.6.9

s0s0 s0s0 s1s1 s1s1 s1s1 s1s1 s1s1 s1s1 s1s1 s1s1 s1s1 s1s1 s0s0 s0s0.

s0s0 s0s0 s1s1 s1s1 s1s1 s1s1 s1s1 s1s1 s3s3 s3s3 s1s1 s1s1 s0s0 s0s0. With probably μ choose random strategy

mu w

Efficient separating!

And if we aggregate across time, and many simulation runs?

X X X

Why is this equilibrium emerging? Because it is hard to get out of, compared to the other equilibrium.

As soon as receiver drifts to accepting 2 or 3 Enough receivers must have neutrally drifted to accept 1 so worth for good but not bad types Since good but not bad sending 1, receivers start accepting 1, to point where bad start sending Very quickly After bad start Sending 1, receivers stop Accepting 1 If in meantime Receivers stop Accepting 2 (by drift), then Both good and Bad better Sending 0 As soon as receiver drifts to accepting 1 or 2

Is this result robust?

1)payoffs 2)noise 3)experimentation rate 4)reinforcement learning

Reinforcement Learning

Even works for super high experimentation rates!

Does depend on interesting new condition: Do females prefer to pair with random male? P=1/2 No longer easy to leave pooling!

How can we interpret this condition? What if there arent that many farmers…e.g. in the U.S.? wont be attracted to long finger nails!

Similarly if, signals not costly (art that is replica?) Or if more costly for high type (sun exposure in U.S. today vs past vs China?)

Conclusion: Even though dont choose whats beautiful Some aspects can still be explained by costly signaling… Provided (ever so slightly) more likely to imitate successful peoples notion of beauty, or (even if just a tiny bit ) more likely to adhere to notions of beauty when lead to nice outcomes…

But for this conclusion… We needed to show how evolutionary dynamics work in costly signaling games

Project 3: Game Theory model + Evo Dynamics of Cooperating without Looking (recall: principled vs. strategic)

Suppose a friend asks you to proofread a paper… You hesitate while thinking about how big a pain it is and say, Hmm. Um. Well, OK. You get less credit than if you agreed w/o hesitation

Colleague asks you to attend his talk. You ask, will this benefit my research? before agreeing to attend. You get less credit than if you agreed without asking

Why do you get less credit for cooperating when you deliberate (look)?

Note: cannot be explained by existing models of repeated games, like repeated prisoners dilemma (In such models, players can only attend to your past actions not deliberation process)

Intuitively… Cooperators who dont look (cwol) can be trusted to cooperate even when the temptation to defect is high But how do we know this added trust is worth the cost of losing out on missed opportunities to defect?

We will… 1) Describe a simple model, the envelope game 2) Find (natural, intuitive) conditions under which CWOL is an equilibrium of this game 3) Show that even if agents are not consciously choosing their strategies but instead strategies are learned or evolved (replicator dynamic), cwol still emerges (i.e. has a sizeable basin of attraction) 4) Interpret these results in terms of some less straightforward social applications, such as why we: 1) like politicians who appear principled 2) shun taboo tradeoffs 3) are blinded by love For which… Our equilibrium condition will yield novel predictions This analysis will also lead to some useful prescriptions

Here is our model…

The Envelope Game

First… We model variation in costs of cooperation as follows: With probability p, Low Temptation card is chosen and stuffed in envelope With probability 1-p, High Temptation is chosen

Second… We model player 1s choice of whether to look 1 chooses whether or not to open the envelope Crucially we assume others (player 2) can observe whether the envelope was opened 2 2

2 2 Third… 1 then chooses whether or not to Cooperate 2 is again able to observe

Fourth… We model others trust in Player 1 Player 2 chooses whether to continue the interaction or exit (If he continues, the game repeats, with future payoffs discounted by w)

We assume the payoffs have the following properties: 1)Cooperation is costly for Player 1, especially when the temptation is high 2)Both players like cooperative interactions, but Player 2 would prefer no interaction to one in which Player 1 sometimes defects

a, b c H, dc L, d C D High Temptation Low Temptation We represent this using the following variables:

a, b c H, dc L, d C D High Temptation Low Temptation Our assumptions then amount to: 1) c H >c L > a> 0 2) b*p + d*(1–p) < 0 < b

Result 1: 1 cooperates without looking (CWOL) 2 continues iff 1 CWOL is an equilibrium, provided: a/(1-w) > c L p + c H (1-p)

Intuition: If 1 deviates to look, might as well defect, in which case expect c 1 p + c 2 (1-p) today and 0 ever after If CWOL, get a today and henceforth, i.e. a/(1-w) CWOL is an equilibrium iff a/(1-w) > c L p + c H (1-p) Interpretation: CWOL is an equilibrium iff EXPECTED gains from defecting today are less than the value of maintaining a cooperative interaction

Lets contrast this with equilibrium conditions for cooperate with looking (CWL) to see when looking matters Now player 1 may be tempted to deviate when she knows the temptation is high, in which case she would get c H So we need a/(1-w) > c H I.e. CWL is an equilibrium iff MAXIMAL gains from defecting today are less than the value of maintaining a cooperative interaction Hence we predict Looking will matter when the expected gains from defecting are small but the maximal gains are large

Likewise, if we relax our assumption that b*p+d*(1–p)<0<b, (but retain d<0<b) We get another equilibrium where player 1 looks and defects when the temptation is high and player 2 exits iff 1 defects and the temptation is low For looking to matter we ALSO need that defection is sufficiently bad for 2 that he doesnt want to interact with 1s who even seldomly defect

What if strategies are not consciously calculated? For instance, we might trust people based on a gut feeling or we might refuse to interact with people who disobey our ethics. Or we might just have a heuristic that tells us not to look Thats where the main thesis of last class fits in! We assume that feelings, ethics, heuristics that yield higher payoffs are more likely to be imitated (prestige biased imitation), reproduce (natural selection), or held tenaciously (reinforcement learning) We will model this using the replicator dynamic, and show CWOL also emerges in the relevant parameter region

In the replicator dynamic… -an infinite population of each Player 1s and player 2s -at any point in time, each strategy has a certain frequency -payoffs are determined based on the expected opponents play, given this frequency -strategies reproduce proportional to their payoffs

Note: -Replicator requires few strategies, so we restrict to 7 that include all important deviants -Replicator cannot be solved analytically; we numerically estimate in computer simulation -We will need to classify strategies into those that are behaviorally equivalent

Restricted Strategy Space

Classifying Populations

Simulation For each of many parameter values… For each of 5,000 trials… We seed the population with random mixtures of strategies Numerically estimate the replicator dynamic (which is an ODE) Wait for the population to stabilize Then classify the outcomes (ignoring small errors)

We find… Population ends up at CWOL fairly often in relevant parameter region

a*=c H /c L p + c H (1-p),)a**=c H /(1-w)

Now lets discuss some social applications…

First application: Why do we like politicians who have principles and not those who are strategic (e.g. those who flip flop)? (and more generally, why do we like those who are principled? And when will we care?)

We argue… Someone who is strategic is likely to choose the policy that benefits himself when given the power When will we care if others are strategic? Not if incentives sufficiently aligned that wont ever be in a position where tempted to drastically harm us (i.e. b*p+d*(1–p)<0<b) E.g., crucial that girlfriend/boyfriend is principled, but not so crucial that doubles partner is, because she has no occasions where tempted to really harm you

Second application: Taboo tradeoffs I.e. unwillingness to tradeoff sacred (e.g. life), against mundane (e.g. money) E.g. many find economists perverse for applying cost benefit to value of life Taboo to CONSIDER tradeoff

We find such tradeoffs disgusting because… Such tradeoffs signal a willingness to look at the benefits of defecting disgust signals we wouldnt look, or wouldnt interact with looker When will we find such tradeoffs disgusting? Usually not worth transgressing, but sometimes very beneficial i.e. c H >a/(1-w) > c L p + c H (1-p) (And those transgressions harmful)

This has an important policy implication: While politicians might want to signal that they would never trade lives for money The gains from appearing trustworthy accrue to the politicians while costs accrue to us We should force policymakers and lawyers to tackle these admittedly hard-to-fathom tradeoffs

Third application: love Love has the property that it blinds us to the costs and benefits of doing good to our partner (e.g., dont cheat regardless of opportunity)

Why does love have this property? Because those in love can be trusted, so will make better long-term partners

Gives three new predictions about love

First… Falling in or out of love depends on the distribution of temptations, but not their immediate realizations That is, people may fall out of love when there is a permanent change in opportunities, but not an immediate temptation

Second… Love comes with a cost–the cost of ignored temptations–and suggests that this cost must be compensated with commensurate investment in the relationship Only sometimes is it worthwhile for the recipient of love to compensate a suitor for such missed opportunities, o/w will prefer suitor not fall in love

Third… Just looking at the costs or benefits can hasten the demise of a relationship, even if dont defect Perhaps why partner may get upset if sees you looking even if never act on temptation Or why get upset when partners suggest a pre-nup

General conclusion:

In our research… We try to understand from where such preferences and ideologies come from. Using evolutionary dynamics + game theory + experiments

Some game theory modeling, demonstrating the quirky preferences/ideologies consistent with NE Some evolutionary dynamics, demonstrating the NE emerges from dynamic processes. Some experiments, testing the predictions of the models.

We will present examples of each: 1) Evolutionary dynamics costly signals (Which can explain why we evolved/learned to like authentic art…or why Indians like long nails) 2) Game Theory Model + Evolutionary Dynamics of cooperate without looking (which can explain why we like those who are principled. And also why love blinds us…and why we find markets for kidneys gross…And also leads to valuable prescriptions!) 3) Experiment demonstrating that people give more efficiently when efficiency commonly known (which is consistent with the ED+GT model of why we give inefficiently…and also leads to valuable prescription!)