The Question of Causation 4.2:Establishing Causation AP Statistics.

Slides:



Advertisements
Similar presentations
Section 4.2. Correlation and Regression Describe only linear relationship. Strongly influenced by extremes in data. Always plot data first. Extrapolation.
Advertisements

The Question of Causation YMS3e 4.3:Establishing Causation AP Statistics Mr. Molesky.
Chapter 4 Review: More About Relationship Between Two Variables
O BSERVATIONAL S TUDY VS. E XPERIMENT. O BSERVATIONAL S TUDY The researcher observes individuals and measures variables of interest without influencing.
Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Aim: How do we establish causation?
AP Statistics Section 4.3 Establishing Causation
MA-250 Probability and Statistics
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
More on Two-Variable Data. Chapter Objectives Identify settings in which a transformation might be necessary in order to achieve linearity. Use transformations.
Lecture 18 Simple Linear Regression (Chapters )
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
Chapter 4 Section 3 Establishing Causation
The Question of Causation
HW#9: read Chapter 2.6 pages On page 159 #2.122, page 160#2.124,
1 10. Causality and Correlation ECON 251 Research Methods.
C HAPTER 4: M ORE ON T WO V ARIABLE D ATA Sec. 4.2 – Cautions about Correlation and Regression.
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
The Practice of Statistics Third Edition Chapter 4: More about Relationships between Two Variables Copyright © 2008 by W. H. Freeman & Company Daniel S.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Warm-Up Aug. 28th Grab a bias worksheet off the stool and begin reading the scenarios. Also make sure to grab other materials (workbook, spiral.
Chapter 15 Describing Relationships: Regression, Prediction, and Causation Chapter 151.
4.3: Establishing Causation Both correlation and regression are very useful in describing the relationship between two variables; however, they are first.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Relationships Can Be Deceiving Statistics lecture 5.
Statistical Reasoning for everyday life Intro to Probability and Statistics Mr. Spering – Room 113.
 Get out your homework and materials for notes!  Take-home quiz due!
CHAPTER 9: Producing Data: Experiments. Chapter 9 Concepts 2  Observation vs. Experiment  Subjects, Factors, Treatments  How to Experiment Badly 
Chapter 4 More on Two-Variable Data YMS 4.1 Transforming Relationships.
Alternatively, dependent variable and independent variable. Alternatively, endogenous variable and exogenous variable.
Get out your Residuals Worksheet! You will be able to distinguish between correlation and causation. Today’s Objectives:
AP STATISTICS LESSON 4 – 2 ( DAY 1 ) Cautions About Correlation and Regression.
Chapter 4 Day Six Establishing Causation. Beware the post-hoc fallacy “Post hoc, ergo propter hoc.” To avoid falling for the post-hoc fallacy, assuming.
1. Plot the data. What kind of growth does it exhibit? (plot by hand but you may use calculators to confirm answers.) 2. Use logs to transform the data.
Cautions About Correlation and Regression Section 4.2.
The Question of Causation
Scatterplots and Correlation Section 3.1 Part 2 of 2 Reference Text: The Practice of Statistics, Fourth Edition. Starnes, Yates, Moore.
Section Causation AP Statistics ww.toddfadoir.com/apstats.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Prediction and Causation How do we predict a response? Explanatory Variables can be used to predict a response: 1. Prediction is based on fitting a line.
Two-Variable Data Analysis
AP Statistics. Issues Interpreting Correlation and Regression  Limitations for r, r 2, and LSRL :  Can only be used to describe linear relationships.
 Understand how to determine a data point is influential  Understand the difference between Extrapolation and Interpolation  Understand that lurking.
Sit in your permanent seat
Sit in your permanent seat
2.7 The Question of Causation
Cautions About Correlation and Regression Section 4.2
Unit 5: Hypothesis Testing
Day 46 & 47 – Cause and Effect.
Cautions About Correlation and Regression
Examining Relationships Least-Squares Regression & Cautions about Correlation and Regression PSBE Chapters 2.3 and 2.4 © 2011 W. H. Freeman and Company.
Proving Causation Why do you think it was me?!.
Establishing Causation
Cautions about Correlation and Regression
Section 4.3 Types of Association
Which of the following would be necessary to establish a cause-and- effect relationship between two variables? Strong association between the variables.
Chapter 2 Looking at Data— Relationships
Register for AP Exams --- now there’s a $10 late fee per exam
Cautions about Correlation and Regression
The Question of Causation
Day 47 – Cause and Effect.
Scatterplots and Correlation
Least-Squares Regression
EQ: What gets in the way of a good model?
4.2 Cautions about Correlation and Regression
3.3 Cautions Correlation and Regression Wisdom Correlation and regression describe ONLY LINEAR relationships Extrapolations (using data to.
Section 6.2 Establishing Causation
Chapter 4: More on Two-Variable Data
Correlation/regression using averages
Presentation transcript:

The Question of Causation 4.2:Establishing Causation AP Statistics

Beware the post-hoc fallacy “Post hoc, ergo propter hoc.” To avoid falling for the post-hoc fallacy, assuming that an observed correlation is due to causation, you must put any statement of relationship through sharp inspection. Causation can not be established “after the fact.” It can only be established through well-designed experiments. {see Ch 5}

Explaining Association Strong Associations can generally be explained by one of three relationships. Confounding Confounding: x may cause y, but y may instead be caused by a confounding variable z CommonResponse Common Response: x and y are reacting to a lurking variable z Causation Causation: x causes y

Causation Causation is not easily established. The best evidence for causation comes from experiements that change x while holding all other factors fixed. Even when direct causation is present, it is rarely a complete explanation of an association between two variables. Even well established causal relations may not generalize to other settings.

Common Response “Beware the Lurking Variable” The observed association between two variables may be due to a third variable. Both x and y may be changing in response to changes in z. Consider the “Presbyterian Minister and Barrels of Rum” example...do ministers actually cause people to drink more rum?

Confounding Two variables are confounded when their effects on a response variable cannot be distinguished from each other. Confounding prevents us from drawing conclusions about causation. We can help reduce the chances of confounding by designing a well-controlled experiment.

Exercises For Exercises 4.41 – 4.47, carry out the instructions. Then state whether the relationship between the two variables involves causation, common response, or confounding. Identify possible lurking variable(s). Draw a diagram of the relationship in which each circle represents a variable. Write a brief description of the variable by each circle.

Exercises 4.41: There is a high positive correlation: nations with many TV sets have higher life expectancies. Could we lengthen the life of people in Rwanda by shipping them TVs? 4.42: People who use artificial sweeteners in place of sugar tend to be heavier than people who use sugar. Does artificial sweetener use cause weight gain? 4.43: Women who work in the production of computer chips have abnormally high numbers of miscarriages. The union claimed chemicals cause the miscarriages. Another explanation may be the fact these workers spend a lot of time on their feet.

Exercises-con’t 4.44: People with two cars tend to live longer than people who own only one car. Owning three cars is even better, and so on. What might explain the association? 4.45: Children who watch many hours of TV get lower grades on average than those who watch less TV. Why does this fact not show that watching TV causes low grades?

Exercises-con’t 4.46: Data show that married men (and men who are divorced or widowed) earn more than men who have never been married. If you want to make more money, should you get married? 4.47: High school students who take the SAT, enroll in an SAT coaching course, and take the SAT again raise their mathematics score from an average of 521 to 561. Can this increase be attributed entirely to taking the course?

Half Sheet Quiz! Answer your exercise Then, in your own words, explain the meaning of causation, common response, and confounding

Other Concerns Extrapolation: The use of a regression line or curve for a prediction outside the domain of the explanatory variable. These predictions cannot be trusted Averaged Data: Many regression or correlation studies work with averages or other measures that combine information from many individuals These results cannot be applied to individuals - they can only be applied to the averaged units One of the advantages of averaging data is that it takes care of some lurking variables Correlations based on averages are typically too high when applied to individuals

REMEMBER! Correlation does NOT imply causation!

Circle Share Make two large circles, one inside of the other Turn to face one another Each person takes a turn to share one thing learned today in class. Stop sharing when I call stop!