When and how can social scientists add value to data scientists?

Slides:



Advertisements
Similar presentations
On the Effects of Immediate Feedback Ido Erev, Adi Luria & Annan Erev Technion Israeli Institute of Technology The Max-Wertheimer Minerva Center for Cognitive.
Advertisements

Paradoxes in Decision Making With a Solution. Lottery 1 $3000 S1 $4000 $0 80% 20% R1 80%20%
“The Plan” From Roth & Erev (1995) to Erev & Barron (2005) Experience-based decisions Empirical data Reinforcement learning among cognitive strategies.
1 Learning and the Economics of Small Decisions Ido Erev, Technion Based on a chapter written with Ernan Haruvy for the 2 nd Vol. of the Handbook of Experimental.
Tversky and Kahnemann: Framing of Decisions
CHAPTER 14 Utility Axioms Paradoxes & Implications.
Prospect Theory, Framing and Behavioral Traps Yuval Shahar M.D., Ph.D. Judgment and Decision Making in Information Systems.
Decision making and economics. Economic theories Economic theories provide normative standards Expected value Expected utility Specialized branches like.
Judgment and Decision Making How Rational Are We?.
Testing Lexicographic Semi- Order Models: Generalizing the Priority Heuristic Michael H. Birnbaum California State University, Fullerton.
Testing Heuristic Models of Risky Decision Making Michael H. Birnbaum California State University, Fullerton.
1 A Brief History of Descriptive Theories of Decision Making Kiel, June 9, 2005 Michael H. Birnbaum California State University, Fullerton.
Some New Approaches to Old Problems: Behavioral Models of Preference Michael H. Birnbaum California State University, Fullerton.
BEE3049 Behaviour, Decisions and Markets Miguel A. Fonseca.
Decision-making II choosing between gambles neural basis of decision-making.
Do we always make the best possible decisions?
Testing Transitivity (and other Properties) Using a True and Error Model Michael H. Birnbaum.
1 A Brief History of Descriptive Theories of Decision Making: Lecture 2: SWU and PT Kiel, June 10, 2005 Michael H. Birnbaum California State University,
I’m not overweight It just needs redistribution Michael H. Birnbaum California State University, Fullerton.
Personality, 9e Jerry M. Burger
1 Efrat Aharonov Majar Michael Sobolev Ori Plonsky Michaela Wänke Yefim Roth Kinneret Teodorescu Eyal Ert Alvin E. Roth On the decision to explore by doctors.
Chapter 5 Research Methods in the Study of Abnormal Behavior Ch 5.
Combining Bayesian Beliefs and Willingness to Bet to Analyze Attitudes towards Uncertainty by Peter P. Wakker, Econ. Dept., Erasmus Univ. Rotterdam (joint.
Chapter 1: Introduction to Statistics
Behavior in the loss domain : an experiment using the probability trade-off consistency condition Olivier L’Haridon GRID, ESTP-ENSAM.
1 On Psychology, Economics, and the Prediction of Human Behavior Ido Erev (Technion) and Ben Greiner (U of New South Wales) The main differences between.
Risk Attitudes of Children and Adults: Choices Over Small and Large Probability Gains and Losses WILLIAM T. HARBAUGH University of Oregon KATE KRAUSE University.
Decision making Making decisions Optimal decisions Violations of rationality.
Agata Michalaszek Warsaw School of Social Psychology Information search patterns in risk judgment and in risky choices.
Why do rational people vote in large elections with costs to vote ? Serge Blondel (GRANEM Univ. Angers & INH, CES) Louis Lévy-Garboua ( CES University.
Thinking and Decision Making
Types of Research (Quantitative and Qualitative) RCS /11/05.
Chapter 2 Research in Abnormal Psychology. Slide 2 Research in Abnormal Psychology  Clinical researchers face certain challenges that make their investigations.
Sequential Expected Utility Theory: Sequential Sampling in Economic Decision Making under Risk Andrea Isoni Andrea Isoni (Warwick) Graham Loomes Graham.
Prospect Theory. 23A i 23B, reference point 23A) Your country is plagued with an outbreak of an exotic Asian disease, which may kill 600 people. You.
Stochastic choice under risk Pavlo Blavatskyy June 24, 2006.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 3: The Foundations of Research 1.
RISK, AMBIGUITY, GAINS, LOSSES Sujoy Chakravarty Department of Humanities and Social Sciences Indian Institute of Technology, Hauz Khas, New Delhi ,
Experiments on Risk Taking and Evaluation Periods Misread as Evidence of Myopic Loss Aversion Ganna Pogrebna June 30, 2007 Experiments on Risk Taking and.
A Stochastic Expected Utility Theory Pavlo R. Blavatskyy June 2007.
Ellsberg’s paradoxes: Problems for rank- dependent utility explanations Cherng-Horng Lan & Nigel Harvey Department of Psychology University College London.
How Could The Expected Utility Model Be So Wrong?
On rare events and the economics of small decisions Ido Erev, Technion Examples: Using safety devices, cheating in exams, selecting among websites, stopping.
1 Behavioral approaches to optimal FDI incentives Authors: Mosi Rosenboim- Ben Gurion University and Sapir Collage Israel Luski- Ben Gurion University.
Expected Value, Expected Utility & the Allais and Ellsberg Paradoxes
Allais Paradox, Ellsberg Paradox, and the Common Consequence Principle Then: Introduction to Prospect Theory Psychology 466: Judgment & Decision Making.
The preference reversal with a single lottery: A Paradox to Regret Theory Serge Blondel (INH Angers & CES Paris 1) Louis Lévy-Garboua (CES Paris 1) ESA.
1 BAMS 517 – 2011 Decision Analysis -IV Utility Failures and Prospect Theory Martin L. Puterman UBC Sauder School of Business Winter Term
마스터 제목 스타일 편집 마스터 텍스트 스타일을 편집합니다 둘째 수준 셋째 수준 넷째 수준 다섯째 수준 The Framing of Decisions and the Psychology of Choice - Amos Tversky and Daniel Kahneman.
Experiments and “Rational” Behavior, 5/1/07. Beauty Contest Game Each person choose a number from 0 to 100. We will average these numbers. The person.
Small Decision-Making under Uncertainty and Risk Takemi Fujikawa University of Western Sydney, Australia Agenda: Introduction Experimental Design Experiment.
The Representativeness Heuristic then: Risk Attitude and Framing Effects Psychology 355: Cognitive Psychology Instructor: John Miyamoto 6/1/2016: Lecture.
Behavioral Issues in Multiple Criteria Decision Making Jyrki Wallenius, Aalto University School of Business Summer School on Behavioral Operational Research:
Some Terminology experiment vs. correlational study IV vs. DV descriptive vs. inferential statistics sample vs. population statistic vs. parameter H 0.
Copyright © 2009 Pearson Education, Inc. Chapter 13 Experiments and Observational Studies.
Psychology and Personal Finance
Economics 430/530 EXPERIMENTAL ECONOMICS Spring 2012
The Allais Paradox & the Ellsberg Paradox
1st: Representativeness Heuristic and Conjunction Errors 2nd: Risk Attitude and Framing Effects Psychology 355:
These slides are preview slides
Behavioral Finance Economics 437.
CSc4730/6730 Scientific Visualization
Empirical Tools of Public Finance
Task Force on Victimization Eurostat, October 2011 Guillaume Osier
By the completion of this presentation, the participant will be able to:
Types of Research (Quantitative and Qualitative)
Statistical Data Analysis
Chapter 15 Decisions under Risk and Uncertainty
Experimental Condition
Prospect Theory.
Presentation transcript:

When and how can social scientists add value to data scientists? Moshe Tennenholtz Technion Eyal Ert Hebrew U Ido Erev & WBS Reut Apel Ori Plonsky Duke U The current research tries to facilitate synergy between research in behavioral economics and machine learning. The first part highlights a difficulty (the classical behavioral studies cannot be easily used to derive effective features for predictions), and presents a choice prediction competition (CPC15) designed to address this problem. The second part challenges you to participate in a new choice prediction competition (CPC18) designed to clarify the results of the first competition. The submission deadline is May 8, 2018.

Part 1: The difficulty and the 2015 competition (CPC15, Erev et al Classical research in behavioral economics focuses on clarifying deviations from rational choice. The phenomena highlighted by this research include: The St. Petersburg paradox (Bernoulli, 1734) Buying both lotteries and insurance (Friedman and Savage, 1948) The Allais paradox (Allais, 1953) Satisfycing (Simon, 1955) The Ellsberg paradox (Ellsberg, 1961) Loss aversion (Kahneman & Tversky, 1979) The break even effect (Thaler & Johnson, 1990) Underweighting of rare events (Barron & Erev, 2013) Each of these phenomena was captured with an elegant model. However, the relationship between the different models is not clear. Al Roth clarified this problem with the 1-800 critique.

Our initial effort to facilitate the development of general behavioral models involves three main steps: 1. A clarification of the relationship between the different choice phenomena (anomalies) by replicating them in a single experimental paradigm. Our results show that the classical description anomalies are replicable, but experience reverses or eliminates most of them. 2. A demonstration that it is possible to reproduce the classical anomalies with a single model. 3. An organization of a choice prediction competition in which we challenged other researchers to develop a better model. The models were compared based on their prediction of a new set of 60 randomly selected choice problems.

Our analysis focused on an 11 dimensions space of choice task. Each problem in our space is a choice between two basic prospects: Option A: HA with probability pHA; or LA otherwise. Option B: Up to 10 outcomes, defined by 5 parameters A 9th parameter, Corr, captures the correlation between the prospects The 10th parameter, Amb, captures ambiguity The 11th parameter FB captures the DMs’ feedback. This parameter was studied within problem. The DMs faced each problem for 25 trials, and got feedback after each choice from the 6th trial.

Example of a basic experimental task, trial 1, initial screen Please select one of the following options: B: 4 with p = 0.8 0 with p = 0.2 A: 3 with certainty B A

Example of a basic experimental task, trial 1, limited feedback 4 with p = 0.8 0 with p = 0.2 A: 3 with certainty B A You selected B

Example of a basic experimental task, trial 6, initial screen Please select one of the following options: B: 4 with p = 0.8 0 with p = 0.2 A: 3 with certainty B A

Example of a basic experimental task, trial 6, feedback screen 4 with p = 0.8 0 with p = 0.2 A: 3 with certainty B 4 A 3 You selected B, your payoff is 4 Had you selected A your payoff would be 3

Example of an ambiguous task, trial 1, initial screen Please select one of the following options: B: 10 with p = q1 0 with p = q2 A: 10 with p =0.5 0 with p = 0.5 B A

Two reasons for the focus on choice among gambles: The best studied topic in behavioral economics. The study of gambles captures human reaction to incentives, and this reaction is one of the challenges for machine learning tools. For example, designers of autonomous vehicles should worry about the possibility that after the development of cars that stops when pedestrian step on the road, pedestrian will learn to use this behavior more often.

The Allais (common ratio) paradox/certainty effect (Allais, 1953, K&T, 1979) 5, with FB 1, No FB Problem 1 Block (58%) 3 with certainty A 42% 4 with p= .8, 0 otherwise B Problem 2 Block (39%) 3 with p =.25, 0 otherwise A’ 61% 4 with p= .2, 0 otherwise B’ Certainty effect from description.

The Allais (common ratio) paradox/certainty effect (Allais, 1953, K&T, 1979) 5, with FB 1, No FB Problem 1 Block (58%) 3 with certainty A 65% 42% 4 with p= .8, 0 otherwise B Problem 2 Block (39%) 3 with p =.25, 0 otherwise A’ 62% 61% 4 with p= .2, 0 otherwise B’ Certainty effect from description. The addition of feedback increases maximization and eliminates the paradox

The reflection effect (kahneman & Tversky, 1979; Simon, 1955) 5 (with FB) 1 (No FB) Block 3 with certainty A 42% 4 with p= .8, 0 otherwise B -3 with certainty A 49% -4 with p= .8, 0 otherwise B

The reflection effect (kahneman & Tversky, 1979; Simon, 1955) 5 (with FB) 1 (No FB) Block 3 with certainty A 65% 42% 4 with p= .8, 0 otherwise B -3 with certainty A 36% 49% -4 with p= .8, 0 otherwise B Risk aversion in the gain and weak risk seeking in the loss domain, feedback eliminates this pattern and increase maximization

Insurance, lotteries, Over and under-weighting of rare events (Savage & Friedman, 1948; Kahneman & Tversky, 1979; Barron & Erev, 2003) 5 (with FB) 1 (No FB) Block 2 with certainty A 55% 101 with p= .01, 1 otherwise B Buy lottery Buy insurance (52%) -1 with certainty A 48% -20 with p= .05, 0 otherwise B Some overweighting of rare events before feedback

Insurance, lotteries, Over and under-weighting of rare events (Savage & Friedman, 1948; Kahneman & Tversky, 1979; Barron & Erev, 2003) 5 (with FB) 1 (No FB) Block 2 with certainty A 42% 55% 101 with p= .01, 1 otherwise B Buy lottery Buy insurance (52%) -1 with certainty A 63% 48% -20 with p= .05, 0 otherwise B Some overweighting of rare events before feedback, and robust underweighting with feedback

Loss aversion 5 (with FB) 1 (No FB) Block 0 with certainty A 34% +50 with p = 0.5; -50 otherwise (EV = 0) B 13 with certainty A 35% 50 with p= .6; -45 otherwise (EV = 12) B

Loss aversion 5 (with FB) 1 (No FB) Block 0 with certainty A 38% 34% +50 with p = 0.5; -50 otherwise (EV = 0) B 13 with certainty A 50% 35% 50 with p= .6; -45 otherwise (EV = 12) B P(B)

The St. Petersburg “paradox” (after Bernoulli, 1738) 5 (with FB) 1 (No FB) Block 9 with certainty A 36% 38% 2 with p=.5 4 with p=.25 8 with p=.125 16 with p=.0625 32 with p=.03125 64 with p=.0015625 128 with p=.00078125 256 with p=.00078125 B Risk aversion.

The Ellsberg paradox (Ellsberg, 1961) 5, with FB 1, No FB Block 10 with p= 0.5; 0 otherwise A 51% 37% 10 with p = q1; 0 otherwise (q1 = 0.5) B Initial ambiguity aversion, and experience eliminates this effect

Regret (Loomes & Sugden, 1982) and correlation (Diederich & Busemeyer, 1999) effects. P(E) = 0.5 5 (with FB) 1 (No FB) Block 6 if E, 0 otherwise A 99% 97% 8 if E, 0 otherwise B 6 if E, 0 otherwise A 85% 96% 9 if not-E, 0 otherwise B Weak sensitivity for regret/correlation without feedback, feedback increases the regret correlation/effect.

Summary of Study 1 Our results show that the classical description anomalies are replicable, but experience reverses or eliminates all of them, but risk aversion in the St. Petersburg problem. Experience, in all cases, decreased the impact or rare events (only in St. Petersburg problem the original deviation from maximization reflect underweighting of rare event). Similar results were observed in Study 2 (that examined 60 randomly selected problems)

BEAST (Best Estimate And Sampling Tools) the best model that we could find assumes very different processes than assumed by the leading models. It does not assume subjective weighting of subjective values (like prospect theory), and does not assume cognitive shortcuts (like the priority Heuristic) Rather it assumes the approximation of the EV plus four extra processes that involve sampling from memory using the following tools: Pessimism (sample the worst outcome) Uniform (all outcomes are equally likely) Sign (implies high sensitivity to the payoff sign). Unbiased (implies minimizing probability of regret) Feedback increases the probability of the unbiased sampling, but the samples stay small. Reliance on small samples implies a bias toward the option that minimizes the probability of regret, and underweighting of rare events  

The CPC15 competition http://departments.agri.huji.ac.il/economics/teachers/ert_eyal/competition.htm On December 2014 we posted the results of Study 1 and 2 (90 choice problems) on the web, and challenged decision scientists to participate in a competition to predict the results of study 3. We offered BEAST and challenged the participants to offer BEAUTY Study 3 was run on April, 2015. The submission deadline was May 17 2015.

Competition participants 53 registered teams 25 submissions from 5 continents Three classes of submissions: 4 Subjective functions models (CPT-like) 15 BEAST-like (EV plus sampling tools) 6 Machine learning models Main results: All 12 top models were variants of BEAST. The difference between these models were statistically insignificant. The winner is Cohen’s BEAST. The best machine learning model, submitted By Noti, Levi, Kolumbus, & Daniely, was almost as good as the BEASTs

The new competitions: CPC18 (Plonsky, Apel, Erev, Ert, & Tennenholtz, in prep) Three explanations of the fact that the machine learning (ML) submissions did not win CPC15: 1.The ML tools were designed to predict future behavior in a familiar setting (like the Netflix recommending system) . They do not have advantage over traditional models in predicting behavior in new environments. 2. The ML submissions used ineffective features. With more careful features selection, they could have won the competition (Plonsky, Erev, Hazan & Tennenholtz, AAAI17). 3. The ML submissions used suboptimal tools.

In order to compare these hypotheses, we invite you to participate in two related competitions. The first, CPC18a, will be extension of CPC15: The same goal (predicting the aggregate choice rate in new problem), with a larger space of choice task. In CPC18a both prospects will have up to 10 outcomes. The second. CPC18i will focus on prediction the behavior of specific individuals in 5 specific choice problems, after observing their behavior in 25 other problems. This task is similar to the best known application of ML (like the Netflix model), and we hope to learn if how behavioral features can help in this setting. We accept submissions in R, MATLAB, Python and SAS

The description is on the web https://cpc18.wordpress.com/  It includes the best models that we could find as baselines. Participants are allowed to use these baselines are starting points. Both baselines for the CPC18a (predicting the aggregate) are variants of BEAST. BEAST.sd is a refinement of BEAST that assumes subjective detection of dominance (the error rate decreases when the same option maximizes EV, and minimizes Probability of regret). Psych Forest is a random forest based model that uses BEAST as one of the features. The best baselines we found for CPC18i (predicting individual choice) are “collaborative flittering” models and use shallower psychology

Summary Classical research in behavioral economics uses different elegant models to capture different anomalies. As a result, it is not easy to use this research to predict behavior. CPC15 shows that it is not difficult to develop more general behavioral models that allow useful predictions. Interestingly, the processes assumed by the winning models are very different than the processes assumed by the classical models. Rather than assuming the weighting of subjective values, or the use of simple heuristic, the winners assume high sensitivity to expected value and four sampling tools. Theory free ML tools provide much better predictions than the classical behavioral models, but they do not outperform the competition winners. We hope that our next competition will clarify the best way to combine behavioral insights with ML tools.