Topic: Subjective chance estimates, e.g.: - chance of rain tomorrow; - chance of strategy choice of opponent. Application considered first: grading of.

Slides:



Advertisements
Similar presentations
Design of Experiments Lecture I
Advertisements

Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
Decision Making Under Uncertainty CSE 495 Resources: –Russell and Norwick’s book.
© 2011 Pearson Education, Inc
Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses Type I and Type II Errors Type I and Type II Errors.
3.2 OLS Fitted Values and Residuals -after obtaining OLS estimates, we can then obtain fitted or predicted values for y: -given our actual and predicted.
Evaluating Non-EU Models Michael H. Birnbaum Fullerton, California, USA.
Part 3: The Minimax Theorem
Copyright © 2009 Cengage Learning 9.1 Chapter 9 Sampling Distributions.
Judgment and Decision Making How Rational Are We?.
Using Prospect Theory to Study Unknown Probabilities ("Ambiguity") by Peter P. Wakker, Econ. Dept., Erasmus Univ. Rotterdam (joint with Mohammed Abdellaoui.
Topic: Our chance estimates of uncertain events; e.g.: Hillary Clinton next president of the US. Adapting Proper Scoring Rules for Measuring Subjective.
PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.
1 Stochastic Dominance Scott Matthews Courses: /
1 Introduction to Decision Analysis for Public Policy Decision-Making Ross D. Shachter Management Science and Engineering Stanford University Introduction.
Combining Bayesian Beliefs and Willingness to Bet to Analyze Attitudes towards Uncertainty by Peter P. Wakker, Econ. Dept., Erasmus Univ. Rotterdam (joint.
Chapter 2 – Tools of Positive Analysis
Uncertainty RUD, June 28, 2006, Peter P. Wakker Paper: Chapter for New Palgrave. Its didactical beginning is not optimal for this specialized audience.
Adapting de Finetti's Proper Scoring Rules for Measuring Subjective Beliefs to Modern Decision Theories of Ambiguity Gijs van de Kuilen, Theo Offerman,
PSY 5018H: Math Models Hum Behavior, Prof. Paul Schrater, Spring 2005 Rational Decision Making.
Determining the Size of
Copyright © 2005 by Evan Schofer
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 10 Sampling Distributions.
The Marriage Problem Finding an Optimal Stopping Procedure.
Chapter 3: Equations and Inequations This chapter begins on page 126.
On the relationship between Keynes’s conception of evidential weight and the Ellsberg paradox Alberto Feduzi University of Cambridge and University of.
Combining Bayesian Beliefs and Willingness to Bet to Analyze Attitudes towards Uncertainty by Peter P. Wakker, Econ. Dept., Erasmus Univ. Rotterdam (joint.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Ambiguity will concern Dow Jones & Nikkei indexes today: Peter P. Wakker & Enrico Diecidue & Marcel Zeelenberg Lecture + paper are on my homepage Domain:
Decision making Making decisions Optimal decisions Violations of rationality.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
PARAMETRIC STATISTICAL INFERENCE
Trust-Aware Optimal Crowdsourcing With Budget Constraint Xiangyang Liu 1, He He 2, and John S. Baras 1 1 Institute for Systems Research and Department.
Adapting de Finetti's Proper Scoring Rules for Measuring Bayesian Subjective Probabilities when Those Probabilities Are not Bayesian Peter P. Wakker (&
Basics of Probability. A Bit Math A Probability Space is a triple, where  is the sample space: a non-empty set of possible outcomes; F is an algebra.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
Introduction Osborn. Daubert is a benchmark!!!: Daubert (1993)- Judges are the “gatekeepers” of scientific evidence. Must determine if the science is.
New Views on Risk Attitudes Peter P. Wakker Economics University of Amsterdam € 100 € 0€ 0 ½ ½ or € 50 for sure What would you rather have? Such gambles.
Proper Scoring Rules and Prospect Theory August 20, 2007 SPUDM, Warsaw, Poland Topic: Our chance estimates of Hillary Clinton to become next president.
Stochastic choice under risk Pavlo Blavatskyy June 24, 2006.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Modern Ways to Model Risk and Uncertainty Peter P. Topic: prospect theory (  classical expected utility) for modeling risk/uncertainty/ambiguity.
Ambiguity Made Operational by Peter P. Wakker, Econ. Dept., Erasmus Univ. Rotterdam (joint with Mohammed Abdellaoui & Aurélien Baillon) ESA, Tucson, Oct.
Ellsberg’s paradoxes: Problems for rank- dependent utility explanations Cherng-Horng Lan & Nigel Harvey Department of Psychology University College London.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
Uncertainty Management in Rule-based Expert Systems
Research Methodology and Methods of Social Inquiry Nov 8, 2011 Assessing Measurement Reliability & Validity.
Auctions serve the dual purpose of eliciting preferences and allocating resources between competing uses. A less fundamental but more practical reason.
©2005, Pearson Education/Prentice Hall CHAPTER 1 Goals and Methods of Science.
Sampling and estimation Petter Mostad
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
Adapting de Finetti's proper scoring rules for Measuring Subjective Beliefs to Modern Decision Theories ofAmbiguity Peter P. Wakker (& Gijs van de Kuilen,
UNSW | BUSINESS SCHOOL | SCHOOL OF ECONOMICS Calling the shots Experimental evidence on significant aversion to non-existing strategic risk Ben Greiner.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Scoring Rules, Generalized Entropy and Utility Maximization Victor Richmond R. Jose Robert F. Nau Robert L. Winkler The Fuqua School of Business Duke University.
Small Decision-Making under Uncertainty and Risk Takemi Fujikawa University of Western Sydney, Australia Agenda: Introduction Experimental Design Experiment.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Statistics 19 Confidence Intervals for Proportions.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Adapting Proper Scoring Rules for Measuring Subjective Beliefs
CHAPTER 1 FOUNDATIONS OF FINANCE I: EXPECTED UTILITY THEORY
Session II: Reserve Ranges Who Does What
Ascertaining certain certainties in choices under uncertainty
Peter P. Wakker & Enrico Diecidue & Marcel Zeelenberg
to Prospect Theory Peter P. Wakker Theo Offerman Joep Sonnemans
Psych 231: Research Methods in Psychology
Presentation transcript:

Topic: Subjective chance estimates, e.g.: - chance of rain tomorrow; - chance of strategy choice of opponent. Application considered first: grading of students. Adapting Proper Scoring Rules for Measuring Subjective Beliefs to Nonexpected Utility A Truth Serum for Non-Bayesians March 14, 2007 Nottingham Peter P. Wakker Theo Offerman Joep Sonnemans Gijs van de Kuilen

Say, you grade a multiple choice exam in geography to test students' knowledge about Statement E: Capital of North Holland = Amsterdam. 2 Reward: if E true if not E true  E  not-E Assume: Two kinds of students. 1.Those who know. They answer correctly. 2.Those who don't know. Answer Problem: Correct answer does not completely identify student's knowledge. Some correct answers, and high grades, are due to luck. There is noise in the data.

One way out: oral exam, or ask details. Too time consuming. Now comes a perfect way out: Exactly identify state of knowledge of each student. Still multiple choice, taking no more time! Best of all worlds! 3

4 For those of you who do not know answer to E. What would you do? Best to say "don't know." System perfectly well discriminates between students! Reward: if E true if not E true  E  not-E  don't know

New Assumption: Students have all kinds of degrees of knowledge. Some are almost sure E is true but are not completely sure; others have a hunch; etc. Above system does not discriminate perfectly well between such various states of knowledge. One solution (too time consuming): Oral exams etc. Second solution is to let them make many binary choices: Expanation in some detail follows; solution will be too time consuming. 5

6  don't know 0.20  E Reward: if E true if not E true 1 0 choice 2  don't know 0.10  E Reward: if E true if not E true 1 0 choice 1  don't know 0.30  E Reward: if E true if not E true 1 0 choice  don't know 0.90  E Reward: if E true if not E true 1 0 choice 9

7 Etc. Can get approximation of true subjective probability p, i.e. degree of belief (!?). Binary-choice ("preference") solutions are popular in decision theory. This method works under subjective expected value (linear utility!). Justifiable by de Finetti's (1931) famous book making argument. Too time consuming for us. Rewards for students are somewhat noisy this way.

8  partly know E, to degree r r r  E Reward: if E true if not E true 1 0 Just ask students for "indifference r," i.e. ask: "Which r makes lower row equally good as upper row?" Problems: Why would student give true answer r = p? What at all is the reward (grade?) for the student? Does reward, whatever it is, give an incentive to give true answer? Third solution [introspection] Student chooses number r ("reported probability"). Then:

9 I now promise a perfect way out: de Finetti's dream-incentives. Exactly identifies state of knowledge of each student, no matter what it is. Still multiple choice, taking no more time. Rewards students fairly, with little noise. Best of all worlds. For the full variety of degrees of knowledge. Student can choose reported probability r for E from the [0,1] continuum, as follows.

10 Reward: if E true if not E true Claim: Under "subjective expected value," optimal reported probability r = true subjective probability p. 1 0 r=1 0 1r=0 1 – (1–r) 2 1–r 2 r (don't know!?) r=0.5: : (E = sure!?) : (not-E is sure!?) degree of belief in E :

11 Figure:

Proof of claim. p true probability; r reported probability. Optimize EV = p ( 1 – (1–r) 2 ) + (1–p)(1–r 2 ). 1 st order optimality: 2p(1–r) – 2r ( 1–p ) = 0. r = p! Figure!?  12 Reward: if E true if not E true 1 – (1–r) 2 1–r 2  r: degree of belief in E To help memory:

Easy in algebraic sense. Conceptually: !!! Wow !!! Incentive compatible... Many implications... de Finetti (1962) and Brier (1950) were the first neuro-economists! 13

"Bayesian truth serum" (Prelec, Science, 2005). Superior to elicitations through preferences . Superior to elicitations through indifferences ~ (BDM). Widely used: Hanson (Nature, 2002), Prelec (Science 2005). In accounting (Wright 1988), Bayesian statistics (Savage 1971), business (Stael von Holstein 1972), education (Echternacht 1972), finance (Shiller, Kon-Ya, & Tsutsui 1996), medicine (Spiegelhalter 1986), psychology (Liberman & Tversky 1993; McClelland & Bolger 1994), experimental economics (Nyarko & Schotter 2002). We want to introduce these very nice features into theories for risk and ambiguity. 14

Survey Part I. Deriving reported prob. r from theories:  expected value;  expected utility;  nonexpected utility for probabilities;  nonexpected utility for ambiguity. Part II. Deriving theories from observed r. In particular: Derive beliefs/ambiguity attitudes. Will be surprisingly easy. Proper scoring rules risk & ambiguity: Mutual benefits. Part III. Implementation in an experiment. 15

Part I. Deriving r from Theories (EV, and then 3 deviations). 16 Event E: Blair will keep his promise and will step down before September. We quantitatively measure your subjective belief about this event (subjective probability?), i.e. how much you trust Blair.

Let us assume that you have doubts about Blair's promise (politicians …) Your "true" subj. prob.(E: Blair keeps this promise) = EV: Then your optimal r E =

18 Reported probability R(p) = r E as function of true probability p, under: nonEU 0.69 EU 0.61 r EV EV r nonEU r nonEUA r nonEUA : nonexpected utility for unknown probabilities ("Ambiguity"). (c) nonexpected utility for known probabilities, with U(x) = x 0.5 and with w(p) as common; (b) expected utility with U(x) =  x (EU); (a) expected value (EV); r EU next p. go to p. 21, Example EU go to p. 25, Example nonEU p R(p) go to p. 29, Example nonEUA reward: if E true if not E true EV  r EV =  r EU =  r nonEU =  r nonEUA =

So far we assumed EV (as does everyone using proper scoring rules, but as no-one does in modern risk-ambiguity theories...) Deviation 1 from EV: EU with U nonlinear Now optimize pU ( 1 – (1– r) 2 ) + ( 1 – p )U (1 – r 2 ) r = p need no more be optimal. 19

Theorem. Under expected utility with true probability p, 20 U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–p) p + p r = U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–r) r + r p = Reversed (and explicit) expression: A corollary to distinguish from EV: r is nonadditive as soon as U is nonlinear.

How bet on Blair? [ Expected Utility ]. EV: r EV = Expected utility, U(x) =  x: r EU = You now bet less on Blair. Closer to safety (Winkler & Murphy 1970). 21 go to p. 18, with figure of R(p)

Deviation 2 from EV : nonexpected utility for probabilities ( Allais 1953, Machina 1982, Kahneman & Tversky 1979, Quiggin 1982, Schmeidler 1989, Gilboa 1987, Gilboa & Schmeidler 1989, Gul 1991, Luce & Fishburn 1991, Tversky & Kahneman 1992, Birnbaum 2005; survey: Starmer 2000) 22 For two-gain prospects, virtually all those theories are as follows: For r  0.5, nonEU(r) = w(p)U ( 1 – (1–r) 2 ) + ( 1–w(p) ) U(1–r 2 ). r < 0.5, symmetry; soit! Different treatment of highest and lowest outcome: "rank-dependence."

23 p w(p) Figure. The common weighting function w. w(p) = exp(–(–ln(p))  ) for  = w(1/3)  1/3; 1/3 w(2/3) .51 2/3.51

Theorem. Under nonexpected utility with true probability p, 24 U´(1–r 2 ) U´(1 – (1–r) 2 ) ( 1–w(p) ) w(p) + w(p) r = U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–r) r + r p = Reversed (explicit) expression: w –1 ( )

How bet on Blair now? [nonEU with probabilities ]. EV: r EV = EU: r EU = Nonexpected utility, U(x) =  x, w(p) = exp(–(–ln(p)) 0.65 ). r nonEU = You bet even less on Blair. Again closer to safety. 25 go to p. 18, with figure of R(p) Deviations were at level of behavior so far, not of be- liefs. Now for something different; more fundamental.

3 rd violation of EV: Ambiguity (unknown probabilities; belief/decision-attitude? Yet to be settled). No objective data on probabilities. How deal with unknown probabilities? Have to give up Bayesian beliefs descriptively. According to some even normatively. 26

27 Instead of additive beliefs p = P(E), nonadditive beliefs B(E) (Dempster&Shafer, Tversky&Koehler; etc.) All currently existing decision models: For r  0.5, nonEU(r) = W (E)U ( 1 – (1–r) 2 ) + ( 1–W(E) ) U(1–r 2 ). I s '92 prospect theory, = Schmeidler (1989). Can always write B(E) = w –1 (W(E)), so W(E) = w(B(E)). Then w(B(E))U ( 1 – (1–r) 2 ) + ( 1–w(B(E)) ) U(1–r 2 ). For binary gambles: Pfanzagl 1959; Luce ('00 Chapter 3); Ghirardato & Marinacci ('01, "biseparable").

28 U´(1–r 2 ) U´(1 – (1–r) 2 ) ( 1–w(B(E)) ) w(B(E)) + w(B(E)) r E = U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–r) r + r B(E) = Reversed (explicit) expression: w –1 ( ) Theorem. Under nonexpected utility with ambiguity,

How bet on Blair now? [Ambiguity, nonEUA]. r EV = r EU = r nonEU = Similarly, r nonEUA = 0.52 (under plausible assumptions). r's are close to always saying fifty-fifty. "Belief" component B(E) = w –1 (W) = go to p. 18, with figure of R(p)

B(E): ambiguity attitude  /=/  beliefs?? Before entering that debate, first: How measure B(E)? Our contribution: through proper scoring rules with "risk correction." Other proposals for measuring B and W considered in the literature (that we will improve upon): 30

Proposal 1 (common in decision theory): Measure U,W, and w from behavior, and derive B(E) = w –1 (W(E)) from it. Problem: Much and difficult work!!! Proposal 2 (common in decision analysis of the 1960s, and in modern experimental economics): measure canonical probabilities, that is, for E, find event E p with objective probability p such that (E:100) ~ (E p :100) = (p:100). Then B(E) = p. Problem: measuring indifferences is difficult. Proposal 3 (common in proper scoring rules): Calibration … Problem: Need many repeated observations. 31

32 We reconsider reversed explicit expressions: U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–r) r + r p = w –1 ( ) U´(1–r 2 ) U´(1 – (1–r) 2 ) (1–r) r + r B(E) = w –1 ( ) Corollary. p = B(E) if related to the same r!! Part II. Deriving Theoretical Things from Empirical Observations of r

33 Our proposal takes the best of several worlds! Need not measure U,W, and w. Get "canonical probability" without measuring indifferences (BDM …; Holt 2006). Calibration without needing many repeated observations. Do all that with no more than simple proper- scoring-rule questions.

34 Example (participant 25) stock 20, CSM certificates dealing in sugar and bakery- ingredients. Reported probability: r = For objective probability p=0.70, also reports r = Conclusion: B(elief) of ending in bar is 0.70! We simply measure the R(p) curves, and use their inverses: is risk correction.

35 Mutual benefits prospect theory proper scoring rules We bring insights of modern prospect theory to proper scoring rules. Make them empirically more realistic. (EV is not credible in 2006 …) We bring insights of proper scoring rules to prospect theory. Make B very easy to measure and analyze. Directly implementable empirically. We did so in an experiment, and found plausible results.

Part III. Experimental Test of Our Correction Method 36

Method Participants. N = 93 students. Procedure. Computarized in lab. Groups of 15/16 each. 4 practice questions. 37

38 Stimuli 1. First we did proper scoring rule for unknown probabilities. 72 in total. For each stock two small intervals, and, third, their union. Thus, we test for additivity.

39 Stimuli 2. Known probabilities: Two 10-sided dies thrown. Yield random nr. between 01 and 100. Event E: nr.  75 (p = 3/4 = 15/20) (etc.). Done for all probabilities j/20. Motivating subjects. Real incentives. Two treatments. 1. All-pay. Points paid for all questions. 6 points = €1. Average earning € One-pay (random-lottery system). One question, randomly selected afterwards, played for real. 1 point = €20. Average earning: €15.30.

40 Results

41 Average correction curves.

ρ F(ρ) treatment one treatment all Individual corrections

43

Summary and Conclusion  Modern risk&ambiguity theories: proper scoring rules are heavily biased.  We correct for those biases. Benefits for proper-scoring rule community and for risk- and ambiguity theories.  Experiment: correction improves quality; reduces deviations from ("rational"?) Bayesian beliefs.  Do not remove all deviations from Bayesian beliefs. Beliefs are genuinely nonadditive/ nonBayesian/sensitive-to-ambiguity.  Proper scoring rules: the post-neuroeconomics approach for mind-reading. 44