The Question of Causation

Slides:



Advertisements
Similar presentations
Section 4.2. Correlation and Regression Describe only linear relationship. Strongly influenced by extremes in data. Always plot data first. Extrapolation.
Advertisements

The Question of Causation YMS3e 4.3:Establishing Causation AP Statistics Mr. Molesky.
Chapter 4 Review: More About Relationship Between Two Variables
Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Aim: How do we establish causation?
AP Statistics Section 4.3 Establishing Causation
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
More on Two-Variable Data. Chapter Objectives Identify settings in which a transformation might be necessary in order to achieve linearity. Use transformations.
Lecture 18 Simple Linear Regression (Chapters )
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
Chapter 4 Section 3 Establishing Causation
The Question of Causation
HW#9: read Chapter 2.6 pages On page 159 #2.122, page 160#2.124,
1 10. Causality and Correlation ECON 251 Research Methods.
C HAPTER 4: M ORE ON T WO V ARIABLE D ATA Sec. 4.2 – Cautions about Correlation and Regression.
The Practice of Statistics Third Edition Chapter 4: More about Relationships between Two Variables Copyright © 2008 by W. H. Freeman & Company Daniel S.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
4.3: Establishing Causation Both correlation and regression are very useful in describing the relationship between two variables; however, they are first.
Relationships Can Be Deceiving Statistics lecture 5.
Chapter 4 More on Two-Variable Data YMS 4.1 Transforming Relationships.
AP STATISTICS LESSON 4 – 2 ( DAY 1 ) Cautions About Correlation and Regression.
Chapter 4 Day Six Establishing Causation. Beware the post-hoc fallacy “Post hoc, ergo propter hoc.” To avoid falling for the post-hoc fallacy, assuming.
1. Plot the data. What kind of growth does it exhibit? (plot by hand but you may use calculators to confirm answers.) 2. Use logs to transform the data.
Cautions About Correlation and Regression Section 4.2.
The Question of Causation
Scatterplots and Correlation Section 3.1 Part 2 of 2 Reference Text: The Practice of Statistics, Fourth Edition. Starnes, Yates, Moore.
Section Causation AP Statistics ww.toddfadoir.com/apstats.
Chapter 8 (3-4), 9 More about Correlation. Today’s Lecture l SD Line l Calculating r l correlation vs causation.
Prediction and Causation How do we predict a response? Explanatory Variables can be used to predict a response: 1. Prediction is based on fitting a line.
Two-Variable Data Analysis
The Question of Causation 4.2:Establishing Causation AP Statistics.
AP Statistics. Issues Interpreting Correlation and Regression  Limitations for r, r 2, and LSRL :  Can only be used to describe linear relationships.
Sit in your permanent seat
Sit in your permanent seat
2.7 The Question of Causation
Cautions About Correlation and Regression Section 4.2
Unit 5: Hypothesis Testing
Unit 1: Science of Psychology
Day 46 & 47 – Cause and Effect.
Cautions About Correlation and Regression
Examining Relationships Least-Squares Regression & Cautions about Correlation and Regression PSBE Chapters 2.3 and 2.4 © 2011 W. H. Freeman and Company.
Association vs. Causation
Proving Causation Why do you think it was me?!.
Collecting Data: Observational Studies
Establishing Causation
Cautions about Correlation and Regression
7.3 Best-Fit Lines and Prediction
Section 4.3 Types of Association
Chapter 2 Looking at Data— Relationships
Register for AP Exams --- now there’s a $10 late fee per exam
Stat 112 Notes 4 Today: Review of p-values for one-sided tests
a= b= WARM - UP Variable Coef StDev T P
DRILL Put these correlations in order from strongest to weakest.
7.3 Best-Fit Lines and Prediction
Cautions about Correlation and Regression
Day 47 – Cause and Effect.
Scatterplots and Correlation
The Language of Studies
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
Least-Squares Regression
EQ: What gets in the way of a good model?
4.2 Cautions about Correlation and Regression
Correlation/regression using averages
3.3 Cautions Correlation and Regression Wisdom Correlation and regression describe ONLY LINEAR relationships Extrapolations (using data to.
Section 6.2 Establishing Causation
Correlation and Causality
Chapter 4: More on Two-Variable Data
Correlation/regression using averages
4.2: Experiments.
Presentation transcript:

The Question of Causation 4.2:Establishing Causation AP Statistics

Beware the post-hoc fallacy “Post hoc, ergo propter hoc.” To avoid falling for the post-hoc fallacy, assuming that an observed correlation is due to causation, you must put any statement of relationship through sharp inspection. Causation can not be established “after the fact.” It can only be established through well-designed experiments. {see Ch 5}

Explaining Association Strong Associations can generally be explained by one of three relationships. Causation: x causes y Common Response: x and y are reacting to a lurking variable z Confounding: x may cause y, but y may instead be caused by a confounding variable z

Causation Causation is not easily established. The best evidence for causation comes from experiements that change x while holding all other factors fixed. Even when direct causation is present, it is rarely a complete explanation of an association between two variables. Even well established causal relations may not generalize to other settings.

Common Response “Beware the Lurking Variable” The observed association between two variables may be due to a third variable. Both x and y may be changing in response to changes in z. Consider the “Presbyterian Minister and Barrels of Rum” example...do ministers actually cause people to drink more rum?

Confounding Two variables are confounded when their effects on a response variable cannot be distinguished from each other. Confounding prevents us from drawing conclusions about causation. We can help reduce the chances of confounding by designing a well-controlled experiment.

Exercises For Exercises 4.41 – 4.47, carry out the instructions. Then state whether the relationship between the two variables involves causation, common response, or confounding. Identify possible lurking variable(s). Draw a diagram of the relationship in which each circle represents a variable. Write a brief description of the variable by each circle.

Exercises 4.41: There is a high positive correlation: nations with many TV sets have higher life expectancies. Could we lengthen the life of people in Rwanda by shipping them TVs? 4.42: People who use artificial sweeteners in place of sugar tend to be heavier than people who use sugar. Does artificial sweetener use cause weight gain? 4.43: Women who work in the production of computer chips have abnormally high numbers of miscarriages. The union claimed chemicals cause the miscarriages. Another explanation may be the fact these workers spend a lot of time on their feet.

Exercises-con’t 4.44: People with two cars tend to live longer than people who own only one car. Owning three cars is even better, and so on. What might explain the association? 4.45: Children who watch many hours of TV get lower grades on average than those who watch less TV. Why does this fact not show that watching TV causes low grades?

Exercises-con’t 4.46: Data show that married men (and men who are divorced or widowed) earn more than men who have never been married. If you want to make more money, should you get married? 4.47: High school students who take the SAT, enroll in an SAT coaching course, and take the SAT again raise their mathematics score from an average of 521 to 561. Can this increase be attributed entirely to taking the course?

Half Sheet Quiz! A study of elementary school children, ages 6-11, finds a high positive correlation between shoe size x and score y on a test of reading comprehension. Then, in your own words, explain the meaning of causation, common response, and confounding

Other Concerns Extrapolation: The use of a regression line or curve for a prediction outside the domain of the explanatory variable. These predictions cannot be trusted Averaged Data: Many regression or correlation studies work with averages or other measures that combine information from many individuals These results cannot be applied to individuals - they can only be applied to the averaged units One of the advantages of averaging data is that it takes care of some lurking variables Correlations based on averages are typically too high when applied to individuals

Correlation does NOT imply causation! REMEMBER! Correlation does NOT imply causation!