STA 320: Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.

Slides:



Advertisements
Similar presentations
AP Statistics Exam Review. AP House Rules EVERYONE participates Regardless of whether or not youre taking the exam Actively engaged… in chapter packets,
Advertisements

STAT 101 Dr. Kari Lock Morgan
Inference: Fishers Exact p-values STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke.
Control in Experimental Designs. Control u Key element of experimental and quasi- experimental designs –Subjects (one group or several) on which no variable.
Observational Studies and Propensity Scores STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical.
Copyright EM LYON Par accord du CFC Cession et reproduction interdites Research in Entrepreneurship- The problem of unobserved heterogeneity Frédéric Delmar.
Today’s Agenda  Syllabus CS2336: Computer Science II.
Inference: Neyman’s Repeated Sampling STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science.
STAT 101: Data Analysis and Statistical Inference
Assignment Mechanisms STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
1 BA 275 Quantitative Business Methods Housekeeping Introduction to Statistics Elements of Statistical Analysis Concept of Statistical Analysis Exploring.
Specifying a Purpose, Research Questions or Hypothesis
1 Design Approaches to Causal Inference Statistical mediation analysis answers the following question, “How does a researcher use measures of the hypothetical.
STAT 101 Dr. Kari Lock Morgan Exam 2 Review.
Course Introduction Computer Science Department CS141:Computer Programming 1 Fall 2014 Dr. Hamid Al-Hamadi.
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Specifying a Purpose, Research Questions or Hypothesis
Weighting STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
Subclassification STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
Randomized Experiments STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
AREC 514 Cost-Benefit Analysis Instructor: Mark Langworthy.
EVAL 6970: Cost Analysis for Evaluation Dr. Chris L. S. Coryn Nick Saxton Fall 2014.
Bell ringer! On a sheet of your own paper…. – What is Integrated Science???
Essential Synthesis SECTION 4.4, 4.5, ES A, ES B
Statistics: Unlocking the Power of Data Lock 5 Synthesis STAT 250 Dr. Kari Lock Morgan SECTIONS 4.4, 4.5 Connecting bootstrapping and randomization (4.4)
1 BA 275 Quantitative Business Methods Housekeeping Introduction to Statistics Elements of Statistical Analysis Concept of Statistical Analysis Statgraphics.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 12/4/12 Bayesian Inference SECTION 11.1, 11.2 More probability rules (11.1)
SUTVA, Assignment Mechanism STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
CS 140 Computer Programming (I) Second semester (3 credits) Imam Mohammad bin Saud Islamic University College of Computer Science and Information.
ECON 3039 Labor Economics By Elliott Fan Economics, NTU Elliott Fan: Labor 2015 Fall Lecture 21.
Is the association causal, or are there alternative explanations? Epidemiology matters: a new introduction to methodological foundations Chapter 8.
Comments on Midterm Comments on HW4 Final Project Regression Example Sensitivity Analysis? Quiz STA 320 Design and Analysis of Causal Studies Dr. Kari.
Bayesian Inference, Review 4/25/12 Frequentist inference Bayesian inference Review The Bayesian Heresy (pdf)pdf Professor Kari Lock Morgan Duke University.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Please turn off cell phones, pagers, etc.. Welcome to Stat Joe Schafer Associate Professor Department of Statistics and The Methodology Center.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 12/6/12 Synthesis Big Picture Essential Synthesis Bayesian Inference (continued)
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Chapter 10 Finding Relationships Among Variables: Non-Experimental Research.
Matching STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
Africa Program for Education Impact Evaluation Dakar, Senegal December 15-19, 2008 Experimental Methods Muna Meky Economist Africa Impact Evaluation Initiative.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 250 Dr. Kari Lock Morgan SECTION 4.1 Hypothesis test Null and alternative.
Education Intervention in the Clinical Setting for Inappropriate Use of Antibiotics in Children Katie Butterfield.
Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.
PA 221 Wills, Trusts, and Estate Planning Unit One.
Probability 4/18/12 Probability Conditional probability Disjoint events Independent events Section 11.1 (pdf)pdf Professor Kari Lock Morgan Duke University.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Synthesis and Review for Exam 1.
MAT 540 Entire Course FOR MORE CLASSES VISIT MAT 540 Week 1 DQ 1 Gallup Poll MAT 540 Week 1 DQ 2 Qualitative vs. Quantitative MAT.
Reasoning in Psychology Using Statistics Psychology
Subhorn Khonthapagdee (TA)
Introduction to Statistical Signal Processing
Week 1 Tutorial: Foundation Mathematics for Business Statistics
Explanation of slide: Logos, to show while the audience arrive.
PSY 325 TUTOR Lessons in Excellence -- psy325tutor.com.
PSY 325 TUTOR Education for Service-- psy325tutor.com.
Statistics for the Social Sciences
INTRODUCTION TO INFORMATION SYSTEMS AND TECHNOLOGY (NET 201)
Physics 101: Lecture 29 Exam Results & Review
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Lesson Using Studies Wisely.
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Counterfactual models Time dependent confounding
Chapter 4: Designing Studies
Presentation transcript:

STA 320: Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University

Office Hours and Support Dr. Kari Lock Morgan, o Office Hours: M 3-4pm, W 3-4pm, F 1-3pm in Old Chem 216 Dr. Fan Li, o Office Hours: M 10: am, W 2-3pm in Old Chem 122 TA: Wenjing Shi, o Office Hours: tbd in Old Chem 211A Statistics Education Center o Sun, Mon, Tues, Wed, Thurs 4 – 9 pm in Old Chem 211A

Reading Causal Inference for Statistics and Social Sciences o By Guido W. Imbens and Donald B. Rubin o Not yet published o Draft pdf available on Sakai (do not share) Will distribute other relevant papers

Assessment Most weeks will either have a homework due or a quiz One midterm exam: 3/5 Final project Grading o Homework: 20% o Quizzes: 20% o Midterm: 30% o Final project: 30%

Course Website stat.duke.edu/courses/Spring14/sta320.01/ Class materials will be posted here Textbook, relevant papers, etc. will be posted on Sakai

Causality and Potential Outcomes

Causality? What is causality??? In this class we will learn… o a formal framework for causal effects o how to design studies to estimate causal effects o how to analyze data to estimate causal effects

Rubin Causal Model In the class, we’ll use the Rubin Causal Model framework for causal effects Key points: o Causality is tied to an action (intervention) o Causal effect as a comparison of potential outcomes o Assignment mechanism

Causal Question What is the effect of the treatment on the outcome? If the opposite treatment had been received, how would the outcome differ? Example: o treatment: choosing organic produce o outcome: get cancer? (yes or no) o Question: Does choosing organic produce decrease risk of cancer?

Treatment For most of this course, we will consider only two possible treatments: o Active treatment (“treatment”) o Control treatment (“control”) – often just not getting the treatment Two treatments (including the alternative to specified treatment) must be well defined

Causal Questions What is the effect of studying on test scores? Does texting while driving cause accidents? Does exercising in the morning give you more energy during the day? Do students learn better in smaller classes? Did the hook-up happen because alcohol was involved? Come up with your own!

Intervention Causality is tied to an action (or manipulation, treatment, or intervention) “no causation without manipulation” – manipulation need not be performed, but should be theoretically possible Treatments must be plausible as a (perhaps hypothetical) intervention Gender? Age? Race?

Not Clearly Defined Causal Questions Are parents more conservative then their children because they are older? What is the causal effect of majoring in statistical science on future income? Did she get hired because she is female? Can you make these into well-defined causal questions?

Potential Outcomes Key question: what would have happened, under the opposite treatment? A potential outcome is the value of the outcome variable for a given value of the treatment Outcome variable: Y Y(treatment): outcome under treatment Y(control): outcome under control

Potential Outcomes Y(organic) = cancer or no cancer Y(non-organic) = cancer or no cancer Possibilities: Y(organic) = no cancer, Y(non-organic) = cancer Y(organic) = Y(non-organic) = cancer Y(organic) = Y(non-organic) = no cancer Y(organic) = cancer, Y(non-organic) = no cancer Formulate the potential outcomes for your example

Causal Effect The causal effect is the comparison of the potential outcome under treatment to the potential outcome under control For quantitative outcomes, we often take a difference: Causal Effect = Y(treatment) – Y(control)

Causal Effect Possibility 1: Y(organic) = no cancer, Y(non-organic) = cancer Causal effect for this individual: choosing organic produce prevents cancer, which he otherwise would have gotten eating non-organic Possibility 2: Y(organic) = cancer, Y(non-organic) = cancer Causal effect for this individual: he would get cancer regardless, so no causal effect

Causal Effect Y = test score o Y(study) – Y(don’t study) o Example: Y(study) = 90, Y(don’t study) = 60 o Causal effect = 90 – 60 = 30. Studying causes a 30 point gain in test score for this unit. Y = accident (y/n) o Y(texting) vs Y(not texting) Y = energy during the day o Y(morning exercise) – Y(no morning exercise) Y = hook-up (y/n) o Y(alcohol) vs Y(no alcohol)

Units These are unit-level causal effect Unit: the person, place, or thing upon which the treatment will operate, at a particular point in time Note: the same person at different points in time = different units The causal effect will probably not be the same for each unit

Causal Effects The definition of a causal effect does not depend on which treatment is observed Causal effects, in their definition, do not relate to probability distribution for subjects “who got different treatments”, or to coefficients of models Not a before-after comparison. Potential outcomes are a “what-if” comparison. Potential outcomes may depend on other variables (not just the treatment)

Causal Effects Fundamental problem in estimating causal effects: at most one potential outcome observed for each unit The other potential outcome lies in an unobserved counterfactual land… what would have happened, under a different treatment For treated units: Y(treatment) = observed, Y(control) = ??? For control units: Y(treatment) = ???, Y(control) = observed

Estimation For the definition of a causal effect: compare potential outcomes for a single unit However, in reality, we can only see one potential outcome for each unit For estimation of a causal effect we will need to consider multiple units, some who have been exposed to the treatment, and some to the control

Multiple Units Need to compare similar units, some exposed to active treatment, and some to control Can be same units at different points in time, or different units at the same point in time In this class we will focus primarily on comparing different (but similar) units at the same point in time

Summary Causality is tied to an action (treatment) Potential outcomes represent the outcome for each unit under treatment and control A causal effect compares the potential outcome under treatment to the potential outcome under control for each unit In reality, only one potential outcome observed for each unit, so need multiple units to estimate causal effects