Adapting de Finetti's Proper Scoring Rules for Measuring Bayesian Subjective Probabilities when Those Probabilities Are not Bayesian Peter P. Wakker (&

Slides:



Advertisements
Similar presentations
Paradoxes in Decision Making With a Solution. Lottery 1 $3000 S1 $4000 $0 80% 20% R1 80%20%
Advertisements

Utility Theory.
Research Methods Chapter 2.
Why Do People Under-Search? —The Effects of Payment Dominance on Individual Search Decisions And Learning Gong, Binglin Shanghai JiaoTong University Ramachandran,
Notes: Use this cover page for internal presentations The Behavioural Components Of Risk Aversion Greg B Davies University College.
STAT 135 LAB 14 TA: Dongmei Li. Hypothesis Testing Are the results of experimental data due to just random chance? Significance tests try to discover.
Rational choice: An introduction Political game theory reading group 3/ Carl Henrik Knutsen.
Helsinki University of Technology Systems Analysis Laboratory 1 Dynamic Portfolio Selection under Uncertainty – Theory and Its Applications to R&D valuation.
Judgment and Decision Making How Rational Are We?.
Introduction Dr. Yan Liu Department of Biomedical, Industrial & Human Factors Engineering Wright State University.
Behavioural Economics A presentation by - Alex Godwin, William Pratt, Lucy Mace, Jack Bovey, Luke Baker and Elise Girdler.
BEE3049 Behaviour, Decisions and Markets Miguel A. Fonseca.
Using Prospect Theory to Study Unknown Probabilities ("Ambiguity") by Peter P. Wakker, Econ. Dept., Erasmus Univ. Rotterdam (joint with Mohammed Abdellaoui.
Making Rank-Dependent Utility Tractable for the Study of Ambiguity Peter P. Wakker, June 16, 2005 MSE, Université de Paris I Aim:Make rank-dependent utility.
Topic: Subjective chance estimates, e.g.: - chance of rain tomorrow; - chance of strategy choice of opponent. Application considered first: grading of.
Topic: Our chance estimates of uncertain events; e.g.: Hillary Clinton next president of the US. Adapting Proper Scoring Rules for Measuring Subjective.
Ambiguity will concern Dow Jones & Nikkei indexes today: Topic: Quantitative measurement of capacities under ambiguity. Question: How do people perceive.
Evaluating Hypotheses
Tractable Quantifications of Ambiguity Attitudes by Peter P. Wakker, Econ. Dept., Erasmus Univ. Rotterdam (joint with Mohammed Abdellaoui & Aurélien Baillon)
1 Introduction to Decision Analysis for Public Policy Decision-Making Ross D. Shachter Management Science and Engineering Stanford University Introduction.
Example, on Dow Jones & Nikkei indexes today: Peter P. Wakker & Enrico Diecidue & Marcel Zeelenberg Domain: Individual Decisions under Ambiguity (events.
On Dow Jones & Nikkei indexes today: Peter P. Wakker & Enrico Diecidue & Marcel Zeelenberg This file will be on my homepage on coming Monday. Domain: Decisions.
Combining Bayesian Beliefs and Willingness to Bet to Analyze Attitudes towards Uncertainty by Peter P. Wakker, Econ. Dept., Erasmus Univ. Rotterdam (joint.
Chapter 2 – Tools of Positive Analysis
Uncertainty RUD, June 28, 2006, Peter P. Wakker Paper: Chapter for New Palgrave. Its didactical beginning is not optimal for this specialized audience.
Adapting de Finetti's Proper Scoring Rules for Measuring Subjective Beliefs to Modern Decision Theories of Ambiguity Gijs van de Kuilen, Theo Offerman,
Stochastic Dominance Michael H. Birnbaum Decision Research Center California State University, Fullerton.
Making Choquet Expected Utility Tractable for the Study of Ambiguity Peter P. Wakker, December 20, 2005 Tel-Aviv University Aim:Make Choquet-expected utility,
Reconciling Introspective Utility with Revealed Preference: Experimental Arguments Based on Prospect Theory Peter P. Wakker ( & Abdellaoui & Barrios; Ecole.
PSY 5018H: Math Models Hum Behavior, Prof. Paul Schrater, Spring 2005 Rational Decision Making.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Judgment and Decision Making in Information Systems Probability, Utility, and Game Theory Yuval Shahar, M.D., Ph.D.
On the relationship between Keynes’s conception of evidential weight and the Ellsberg paradox Alberto Feduzi University of Cambridge and University of.
Combining Bayesian Beliefs and Willingness to Bet to Analyze Attitudes towards Uncertainty by Peter P. Wakker, Econ. Dept., Erasmus Univ. Rotterdam (joint.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Behavior in the loss domain : an experiment using the probability trade-off consistency condition Olivier L’Haridon GRID, ESTP-ENSAM.
The Development of Decision Analysis Jason R. W. Merrick Based on Smith and von Winterfeldt (2004). Decision Analysis in Management Science. Management.
Ambiguity will concern Dow Jones & Nikkei indexes today: Peter P. Wakker & Enrico Diecidue & Marcel Zeelenberg Lecture + paper are on my homepage Domain:
STOCHASTIC DOMINANCE APPROACH TO PORTFOLIO OPTIMIZATION Nesrin Alptekin Anadolu University, TURKEY.
Decision making Making decisions Optimal decisions Violations of rationality.
Health State Unable to perform some tasks at home and/or at work Able to perform all self care activities (eating, bathing, dressing) albeit with some.
PARAMETRIC STATISTICAL INFERENCE
On the smooth ambiguity model and Machina’s reflection example Robert Nau Fuqua School of Business Duke University.
Decision Making Under Uncertainty and Risk 1 By Isuru Manawadu B.Sc in Accounting Sp. (USJP), ACA, AFM
New Views on Risk Attitudes Peter P. Wakker Economics University of Amsterdam € 100 € 0€ 0 ½ ½ or € 50 for sure What would you rather have? Such gambles.
Proper Scoring Rules and Prospect Theory August 20, 2007 SPUDM, Warsaw, Poland Topic: Our chance estimates of Hillary Clinton to become next president.
Stochastic choice under risk Pavlo Blavatskyy June 24, 2006.
RISK, AMBIGUITY, GAINS, LOSSES Sujoy Chakravarty Department of Humanities and Social Sciences Indian Institute of Technology, Hauz Khas, New Delhi ,
Modern Ways to Model Risk and Uncertainty Peter P. Topic: prospect theory (  classical expected utility) for modeling risk/uncertainty/ambiguity.
Ambiguity Made Operational by Peter P. Wakker, Econ. Dept., Erasmus Univ. Rotterdam (joint with Mohammed Abdellaoui & Aurélien Baillon) ESA, Tucson, Oct.
Ellsberg’s paradoxes: Problems for rank- dependent utility explanations Cherng-Horng Lan & Nigel Harvey Department of Psychology University College London.
1 Chapter 8 Hypothesis Testing 8.2 Basics of Hypothesis Testing 8.3 Testing about a Proportion p 8.4 Testing about a Mean µ (σ known) 8.5 Testing about.
Lecture 3 on Individual Optimization Uncertainty Up until now we have been treating bidders as expected wealth maximizers, and in that way treating their.
1 Behavioral approaches to optimal FDI incentives Authors: Mosi Rosenboim- Ben Gurion University and Sapir Collage Israel Luski- Ben Gurion University.
Measuring the impact of uncertainty resolution Mohammed Abdellaoui CNRS-GRID, ESTP & ENSAM, France Enrico Diecidue & Ayse Onçüler INSEAD, France ESA conference,
Allais Paradox, Ellsberg Paradox, and the Common Consequence Principle Then: Introduction to Prospect Theory Psychology 466: Judgment & Decision Making.
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Adapting de Finetti's proper scoring rules for Measuring Subjective Beliefs to Modern Decision Theories ofAmbiguity Peter P. Wakker (& Gijs van de Kuilen,
Markov Decision Process (MDP)
UNSW | BUSINESS SCHOOL | SCHOOL OF ECONOMICS Calling the shots Experimental evidence on significant aversion to non-existing strategic risk Ben Greiner.
1 BAMS 517 – 2011 Decision Analysis -IV Utility Failures and Prospect Theory Martin L. Puterman UBC Sauder School of Business Winter Term
Small Decision-Making under Uncertainty and Risk Takemi Fujikawa University of Western Sydney, Australia Agenda: Introduction Experimental Design Experiment.
Fundamentals of Data Analysis Lecture 3 Basics of statistics.
Does risk aversion give us a good reason to diversify our charitable portfolio? James Snowden.
Adapting Proper Scoring Rules for Measuring Subjective Beliefs
CHAPTER 1 FOUNDATIONS OF FINANCE I: EXPECTED UTILITY THEORY
Peter P. Wakker & Enrico Diecidue & Marcel Zeelenberg
to Prospect Theory Peter P. Wakker Theo Offerman Joep Sonnemans
Behavioural Economics
Chapter 15 Decisions under Risk and Uncertainty
Presentation transcript:

Adapting de Finetti's Proper Scoring Rules for Measuring Bayesian Subjective Probabilities when Those Probabilities Are not Bayesian Peter P. Wakker (& Gijs van de Kuilen, Theo Offerman, Joep Sonnemans) April 6, 2006 Rotterdam Topic: How to measure your subjective probability p of event E ("rain tomorrow")? Such that also acceptable to frequentists?

Method 1. Ask directly (introspection). Problem. No clear empirical meaning; no incentive(-compatibility). Economists don't like such things. 2

Method 2. Reveal (binary) preferences. € 0.30  (€ 1 under E)  € 0.20  0.30 > p > 0.20 (if EV = expected value) Problem. Much work. Get only inequalities. (And: EV may be violated.) Method 3. Reveal indifferences. € 0.25 ~ (€ 1 under E)  p = 0.25 (if EV …) Problem. Indifferences are hard to observe (BDM …). (And: EV may be violated.) 3

Method 4. de Finetti's book making. Not explained here. Problem. Not implementable in practice. (And: EV may be violated.) 4

Method 5. Proper scoring rules. Choose 0  r  1 as you like (called reported probability). Next, you receive E E c   1 – (1– r) 2 1 – r 2 EV: Take probability p of E; subjective if must be. Then maximize EV. Optimization of r: 5

EV = p(1 – (1– r) 2 ) + (1 – p)(1 – r 2 ) 1 st order condition: 2p(1– r) – (1 – p)2r = 0 2p(1– r) = (1 – p)2r r = p! Wow! Avoids all problems mentioned above, except Problem. EV may be violated … Proper scoring rules tractable & widely used: See Hanson (Nature, 2002), Prelec (Science 2005). In accounting (Wright 1988), Bayesian statistics (Savage 1971), business (Stael von Holstein 1972), education (Echternacht 1972), medicine (Spiegelhalter 1986), psychology (Liberman & Tversky 1993; McClelland & Bolger 1994). 6

Before analyzing nonEV descriptively, a: Theoretical Example [EV]. Urn K ("known") with 100 balls: 25 G(reen), 25 R(ed), 25 S(ilver), 25 Y(ellow). One ball is drawn randomly. E: ball is not red; E = {G,S,Y}. p = Under expected value, optimal r E is r G + r S + r Y = = 0.75 = r E : r satisfies additivity; as probabilities should! 7 Example reanalyzed later.

p 8 Reported probability R(p) = r E as function of true probability p, under: nonEU 0.69 R(p) EU 0.61 r EV EV r nonEU r nonEUA r nonEUA refers to nonexpected utility for unknown probabilities ("Ambiguity"). (c) nonexpected utility for known probabilities, with U(x) = x 0.5 and with w(p) as common; (b) expected utility with U(x) = x 0.5 (EU); (a) expected value (EV); reward: if E true if not E true EV  r EV =  r EU = r EU  r nonEU =  r nonEUA = next p. go to p. 11, Example EU go to p. 15, Example nonEU go to p. 20, Example nonEUA

Deviation 1 [Utility curvature]. Bernoulli (1738): risk aversion!  U concave (if EU …). Now optimize pU ( 1 – (1– r) 2 ) + ( 1 – p )U (1 – r 2 ) 9

10 U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–p) p + p r = Explicit expression: U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–r) r + r p = EV: r is additive. EU: r is nonadditive (U nonsymmetric about 0.5). Gives critical test of EV versus EU

Theoretical Example continued [ Expected Utility ]. (urn K, 25 G, 25 R, 25 S, 25 Y). E: {G,S,Y}; p = EV: r EV = Expected utility, U(x) =  x: r EU = r G + r S + r Y = = 0.93 > 0.69 = r E : additivity violated! Such data prove that expected value cannot hold. 11 Example reanalyzed later. go to p. 8, with figure of R(p)

Deviation 2 from EV: nonexpected utility ( Allais 1953, Machina 1982, Kahneman & Tversky 1979, Quiggin 1982, Schmeidler 1989, Gilboa 1987, Gilboa & Schmeidler 1989, Gul 1991, Tversky & Kahneman 1992, etc.) 12 For two-gain prospects, all theories are as follows: For r  0.5, nonEU(r) = w(p)U ( 1 – (1–r) 2 ) + ( 1–w(p) ) U(1–r 2 ). r < 0.5, symmetry; soit! Different treatment of highest and lowest outcome: "Rank-dependence."

p w(p) /3 Figure. The common weighting fuction w. w(p) = exp(–(–ln(p))  ) for  = w(1/3)  1/3; 13 1/3 w(2/3) .51 2/3.51

Now 14 U´(1–r 2 ) U´(1 – (1–r) 2 ) ( 1–w(p) ) w(p) + w(p) r = U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–r) r + r p = Explicit expression: w –1 ( )

Example continued [nonEU]. (urn K, 25 G, 25 R, 25 S, 25 Y). E: {G,S,Y}; p = EV: r EV = EU: r EU = Nonexpected utility, U(x) =  x, w(p) = exp(–(–ln(p)) 0.65 ). r nonEU = r G + r S + r Y = = 1.17 > 0.61 = r E : additivity is strongly violated! 15 go to p. 8, with figure of R(p) Deviations from EV and Bayesianism so far were at level of behavior; not of beliefs. Now for something different; more fundamental.

3 rd violation of EV: Ambiguity (unknown probabilities) Preparation for theoretical example. Random draw from additional urn A. 100 balls, ? G a, ? R a, ? S a, ? Y a. Unknown composition ("Ambiguous"). E a : {G a,S a,Y a }; p = ?. How deal with unknown probabilities? Have to give up Bayesian beliefs. 16

1 st Proposal (trying to maintain Bayesian beliefs) Assign "subjective" probabilities to events. Then behave as if known probs (may be nonEU) ("probabilistic sophistication," Machina & Schmeidler 1992; they did it for normative, not as we do now descriptively ). By symmetry then P(G) = P(R) = P(S) = P(Y) = ¼. Treated same as known urn K. Empirically violated (Ellsberg)! 17 > < Probabilistic soph. Bayesian beliefs!

18 Instead of additive beliefs p = P(E), nonadditive beliefs B(E), with B(E)  B(G) + B(S) + B(Y). (Dempster&Shafer, Tversky&Koehler, etc.) All currently existing decision models: For r  0.5, nonEU(r) = w(B(E))U ( 1 – (1–r) 2 ) + ( 1–w(B(E)) ) U(1–r 2 ). (Commonly written with W(E) for w(B(E)), so B(E) = w –1 (W(E)); matter of notation.)

19 U´(1–r 2 ) U´(1 – (1–r) 2 ) ( 1–w(B(E)) ) w(B(E)) + w(B(E)) r E = U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–r) r + r B(E) = Explicit expression: w –1 ( )

Example continued [Ambiguity, nonEUA]. (urn A, 100 balls, ? G, ? R, ? S, ? Y). E: {G,S,Y}; p = ? r EV = r EU = r nonEU = 0.61 (under plausible assumptions). Typically, w(B(E)) << w(B(E)) (E from known urn). r nonEUA is, say, r G + r S + r Y = = 1.44 > 0.52 = r E : additivity is very extremely violated! r's are close to always saying fifty-fifty. Belief component B(E) = go to p. 8, with figure of R(p)

B(E): ambiguity attitude  /=/  beliefs?? Before entering that debate, first: How measure B(E)? Our contribution: through proper scoring rules with "risk correction." Earlier proposals for measuring B: 21

Proposal 1 (common in decision theory). Measure U,W, and w from behavior. Derive B(E) = w –1 (W(E)) from it. Problem: Much and difficult work!!! 22

Proposal 2 (common in decision analysis of the 1960s, and in modern experimental economics). Measure "canonical probabilities": For ambiguous event E a, find objective prob. p such that (€100 if E a ) ~ (€100 with prob. p). Then (algebra …) B(E) = p. Problem: measuring indifferences is difficult. 23

Proposal 3 (common in proper scoring rules): Calibration … Problem: Need many repeated observations. 24

25 Our proposal: Take the best of all worlds! Get B(E) = w –1 (W(E)) without measuring U,W, and w from decision making. Get canonical probability without measuring indifferences, or BDM. Calibrate without needing long repeated observations. Do all that with no more than simple proper- scoring-rule questions.

26 We reconsider explicit expressions: U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–r) r + r p = w –1 ( ) U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–r) r + r B(E) = w –1 ( ) Corollary. p = B(E) if related to the same r!!

27 Because p = B(E) if related to same: We simply measure the R(p) curves, and use their inverses: B(E) = R –1 (r E ). Appling R –1 is called risk correction. Directly implementable empirically. We did so in an experiment, and found plausible results.

Experimental Test of Our Correction Method 28

Method Participants. N = 93 students. Procedure. Computarized in lab. Groups of 15/16 each. 4 practice questions. 29

30 Stimuli 1. First we did proper scoring rule for unknown probabilities. 72 in total. For each stock two small intervals, and, third, their union. Thus, we test for additivity.

31 Stimuli 2. Known probabilities: Two 10-sided dies thrown. Yield random nr. between 01 and 100. Event E: nr.  75 (etc.). Done for all probabilities j/20. Motivating subjects. Real incentives. Two treatments. 1. All-pay. Points paid for all questions. 6 points = €1. Average earning € One-pay (random-lottery system). One question, randomly selected afterwards, played for real. 1 point = €20. Average earning: €15.30.

32 Results

33 Average correction curves.

ρ F(ρ) treatment one treatment all Individual corrections.

35

Summary and Conclusion Modern decision theories show that proper scoring rules are heavily biased. We present ways to correct for these biases (get right beliefs even if EV violated). Experiment shows that correction improves quality and reduces deviations from ("rational"?) Bayesian beliefs. Correction does not remove all deviations from Bayesian beliefs. Beliefs seem to be genuinely nonadditive/nonBayesian/sensitive- to-ambiguity. 36