The Cycling of a Decision Threshold: A System Dynamics Model of the Taylor Russell Diagram Elise Axelrad Weaver, Ph.D. and George Richardson, Ph.D. Center.

Slides:



Advertisements
Similar presentations
Group 5: Historical control data Follow-up of 2005 IWGT where the use of historical control data in interpretation of in vitro results was identified as.
Advertisements

Presentation By: Lauree Francis
Chapter 8 Flashcards.
L5: Dynamic Portfolio Management1 Lecture 5: Dynamic Portfolio Management The following topics will be covered: Will risks be washed out over time? Solve.
Chapter Ten Making Decisions. Chapter Ten Making Decisions.
Risk, Uncertainty, and Sensitivity Analysis How economics can help understand, analyze, and cope with limited information.
1 COMM 301: Empirical Research in Communication Kwan M Lee Lect4_1.
Scenario Discovery Robert Lempert Director RAND Pardee Center on Longer-Range Global Policy and the Future Human Condition CEDM Project Meeting May 21,
Chapter 10 Decision Making © 2013 by Nelson Education.
Decision Errors and Power
Causal Loop Diagrams Esmaeil Khedmati Morasae Center for Community-Based Participatory Research in Health Tehran University of Medical Sciences January.
Karen L. Mapp, Ed.D. Deputy Superintendent, Boston Public Schools
Assessing a New Approach to Class-Based Affirmative Action Matthew N. Gaertner University of Colorado at Boulder.
The Modeling Process Esmaeil Khedmati Morasae Center for Community-Based Participatory Research in Health Tehran University of Medical Sciences Winter.
Consensus problem Agreement. All processes that decide choose the same value. Termination. All non-faulty processes eventually decide. Validity. The common.
SCREENING FOR DISEASE Nigel Paneth. THREE KEY MEASURES OF VALIDITY 1.SENSITIVITY 2.SPECIFICITY 3.PREDICTIVE VALUE.
Scatterplots, Association, and Correlation 60 min.
Public Budget As Decision- Making Process  Decision - Making Models:  Incremental Change Model  Satisfying Model  Ideal Rational Model  Stages of.
Should policy be active or passive?
Assessing firm risk September 2012 Andrew Garbutt Director of Risk, SRA.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Duality of error Thomas R. Stewart, Ph.D. Center for Policy Research Rockefeller College of Public Affairs and Policy University at Albany State University.
Planning for Inquiry The Learning Cycle. What do I want the students to know and understand? Take a few minutes to observe the system to be studied. What.
International Finance Chapter 5 Part 2: Forecasting Exchange Rates.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 4: Modeling Decision Processes Decision Support Systems in the.
Engineering Economics in Canada
Faculty Council Student Enrollment Dr. Bernadette Gray-Little, Executive Vice Chancellor and Provost September 15, 2006 The University of North Carolina.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Is scientific knowledge useful for decision making? CRICS 5 La Habana, April 2001.
Hypothesis Testing. Outline The Null Hypothesis The Null Hypothesis Type I and Type II Error Type I and Type II Error Using Statistics to test the Null.
Biofuels, Energy Security, and Future Policy Alternatives Wally Tyner Purdue University.
Andrew Howard Nichols, Ph.D. Senior Research Analyst The Pell Institute Student Financial.
Chapter 15 – Elaborating Bivariate Tables
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
Analytical Techniques of Political Science Clip art.
Copyright ©2011 Pearson Education
A Discipline-Based Diversity Model George C. Johnson University of California, Berkeley Numbers Infrastructure Diversity Officers AGEP in Relation to Infrastructure.
Engineering Economic Analysis Canadian Edition
{ Principal Leadership Evaluation. The VAL-ED Vision… The construction of valid, reliable, unbiased, accurate, and useful reporting of results Summative.
CONFIDENTIAL AND PROPRIETARY. Permission to reprint or distribute any content from this presentation requires the written approval of Standard & Poor’s.
DEGREE AND ENROLLMENT PLANNING UPDATE Board of Governors Strategic Planning Committee November 17, 2005.
Chapter 5 Choice Under Uncertainty. Chapter 5Slide 2 Topics to be Discussed Describing Risk Preferences Toward Risk Reducing Risk The Demand for Risky.
Copernicus Institute Universiteit Utrecht Taking uncertainty on board in decision making The example of adaptation to climate change.
An overview of multi-criteria analysis techniques The main role of the techniques is to deal with the difficulties that human decision-makers have been.
Voluntary Disclosure Not Covered in Textbook. You’re on a job interview and the interviewer knows what the distribution of GPAs are for MBA students at.
© Farhan Mir 2007 IMS Management Thoughts & Practices MBA & BBA Lecture 6 (Decision Making the Essence of Managerial Job) By: Farhan Mir.
1 Lesson 4 Attitudes. 2 Lesson Outline   Last class, the self and its presentation  What are attitudes?  Where do attitudes come from  How are they.
1 Scenarios and More California Water Plan Advisory Committee Meeting April 14 th, 2005.
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
1 THE DESIGN OF INTELLECTUAL MOVEMENTS Stuart Umpleby The George Washington University Washington, DC.
David Mitchell, Professional Services Manager Baseline Risk and Vulnerabilities Indicator Tool.
MAP the Way to Success in Math: A Hybridization of Tutoring and SI Support Evin Deschamps Northern Arizona University Student Learning Centers.
Lecture 16: Money and the Price Level I L11200 Introduction to Macroeconomics 2009/10 Reading: Barro Ch.10 2 March 2010.
SD modeling process One drawback of using a computer to simulate systems is that the computer will always do exactly what you tell it to do. (Garbage in.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
SQO 4/7/05 INCORPORATING MULTIPLE LINES OF EVIDENCE INTO SEDIMENT QUALITY OBJECTIVES Stephen B. Weisberg Southern California Coastal Water Research Project.
How to measure the impact of R&D on SD ? Laurence Esterle, MD, PhD Cermes and Ifris France Cyprus, 16 – 17 October L. ESTERLE Linking science and.
Structural Vulnerability Assessments An example of theory-informed, data- driven research to support for the formulation of actionable policy options March.
Hypothesis Testing.
Conceptual issues in legislative bodies
Colleen Carroll University of Nevada, Las Vegas
Propensity Score Adjustments for Internet Survey of Voting Behavior:
ACT/SAT Concordance What You Need to Know Speakers:
PRINCIPLES OF WRITING AND CLASSIFICATION OF QUESTIONS
Pull 2 samples of 10 pennies and record both averages (2 dots).
Beginning the conversation
MELD Exception Scores During NLRB Transition
Associate Vice President
Presentation transcript:

The Cycling of a Decision Threshold: A System Dynamics Model of the Taylor Russell Diagram Elise Axelrad Weaver, Ph.D. and George Richardson, Ph.D. Center for Policy Research April 20, 2001

The Cycling of Decision Thresholds In his book, Hammond (1996) presents the following ideas: Any decision threshold based on a statistically uncertain measure will inevitably yield some error and injustice in policy outcomes (a duality of error: false positives and false negatives) Oscillations in public and professional attitudes (with implicit policy thresholds) exist: Schlesinger’s (1986) proposal of “regular oscillations” in the dominance of political parties Oscillations between cautious conservatism and risky innovation in bridge design, as “the accumulation of successful experience” makes designers bold until a “collosal failure” takes everyone by surprise Cycles tend to be around 30 years long, across decision domains Hammond suggests that oscillations may be a result of the duality of error

Hammond’s exhortation: “For if such oscillations can be shown to exist, and if they can be shown to have a definite period...then we have at hand not only a means for predicting our future political climate far in advance, but an important phenomenon that strongly invites, indeed, demands, analysis and interpretation.” We propose a system dynamics model to represent and explore Hammond’s idea in a rigorous way

Another Example of a Reversal in Policy Formation Use of SAT testing in admissions 1967 University of California began using SAT scores in admissions decisions 1999 University of California faculty voted their preference to exclude SAT scores from admissions decisions 2001 Richard Atkinson, University of California President, publicly advocates removing SATs from admissions.

Example of a Decision Threshold for Policy Formation Illustration: Students are admitted to an academic program partially according to SAT score set at a given threshold The SAT score is used to predict academic success The GPA at graduation is the measure of true academic success

Taylor Russell Diagram r =.5, cutoff = 50 Positive on testNegative on test Graduates Non- Graduates Decision Threshold False + False -

Taylor Russell Diagram r =.5, cutoff = 80 Positive on testNegative on test Decision Threshold False + False - Graduates Non- Graduates

Taylor Russell Diagram r =.5, cutoff = 20 Positive on testNegative on test Decision Threshold False + False - Graduates Non- Graduates

Taylor Russell Diagram Highly Certain Test (r =.99) Positive on testNegative on test Decision Threshold False + False - Graduates

They want to reduce the number of individuals for whom the decision to accept was falsely negative: high potential for success but unacceptable SAT scores Stakeholders in the Duality of Error Constituency Concerned with Unfair Disadvantage: Constituency Concerned with Maintaining Standards: They want to reduce the number of individuals for whom the decision to accept was falsely positive: low potential for success but acceptable SAT scores

Duality of Error Due to the Decision Threshold Decision Threshold False + False -

Decision Threshold as a Stock (accumulating increases and decreases)

Decision Threshold Responding to Stakeholder Pressure Disadvantaged Constituency (DC) High Standards Constituency (HSC)

Cycling of Policy Threshold: Historic Discontent Disadvantaged Constituency (DC) High Standards Constituency (HSC)

Cycling of Policy Threshold: Key Parameters For both constituencies: Tolerated Number of False Cases Relative Weight on History Time to Respond to Pressure Threshold Change per Unit Pressure Disadvantaged Constituency (DC) High Standards Constituency (HSC)

Cycling of Policy Threshold: More Key Parameters Time Constants (Forgetting, Retention) Initial value of Threshold Lookup functions (correlation) Dissatisfaction per Error

Unexplored Parameters Lookup Function –translation of threshold choice to number of errors –involves correlation, thresholds for judgment and success –currently set at r = 0.7, variable threshold for judgment, and threshold for success fixed. Pace of Error Generation and Error Detection –Baseline model assumes that in every month, there is an assessment of number of errors due to threshold choice Threshold Associated with “Success” –Baseline model assumes a fixed threshold of “success”

Graph of Cycling of Policy Threshold SAT Score Baseline Case

Threshold Change per Unit Pressure Time to Respond to Pressure - represents policy makers’ responsiveness to constituent pressure: more responsiveness means wider amplitude and very slightly lower frequency

Initial Threshold Value - represents policy makers’ initial setting for decision threshold: no long term effect

Tolerated Number of False Cases - represents constituents’ sensitivity to errors not a key difference in the long run for either frequency or amplitude

Time Constant for Forgetting - represents time it takes for an error to dissipate from constituent memory more time to forget lowers frequency and raises amplitude

Relative Weight of History - represents constituents’ memory or weighting for accumulated past errors relative to current error more weight on history means lower frequency and higher amplitude of cycling; no weight on history means no cycling at all.

Different Weight on History for Each Constituency - represents differential weight on history by constituents - if one group puts more weight on history, it doesn’t change the center of the cycling, but it does reduce the amplitude.

Key Parameter Summary Parameters not affecting frequency: initial decision threshold policy maker sensitivity: increases amplitude,but doesn’t affect frequency much in the long run tolerated number of errors by constituents: does not have much effect on frequency or amplitude Parameters affecting frequency relative weight on history & time to forget: longer memories lower frequency and raise amplitude.

Next Steps Ground the conceptual model in data from one or more institutions (especially look for cases where false negatives could be detected) Incorporate the effects of correlations between the judgment and success measures, time to generate and detect errors, and the threshold for success Explore and operationalize the conversion variables (from errors to stakeholder pressure to changes in threshold) Consider whether there is any impact of true negatives and true positives on cycling Explore limit cycle and open loop structure of model

Summary A systems dynamics model is under development. According to Hammond (1996), any uncertain test where a threshold is used as a policy decision tool leads to unavoidable injustice to some constituency. The pressure on the decision threshold from stakeholders representing the false positives and the false negatives will oppose. These opposing pressures will cause a cycling of the decision threshold over time. A key parameter affecting frequency and amplitude of cycling is constituent weighting of past errors.