Cautions About Correlation and Regression Section 4.2.

Slides:



Advertisements
Similar presentations
Section 4.2. Correlation and Regression Describe only linear relationship. Strongly influenced by extremes in data. Always plot data first. Extrapolation.
Advertisements

Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Chapter 4: Designing Studies
Aim: How do we establish causation?
AP Statistics Section 4.3 Establishing Causation
Correlation AND EXPERIMENTAL DESIGN
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
MA 102 Statistical Controversies Friday, March 22, 2002 Today: Chapter 15 More on Correlation Regression Causation Reading : None new Exercises: 15.1,
Lesson Establishing Causation. Knowledge Objectives Identify the three ways in which the association between two variables can be explained. Define.
Basic Practice of Statistics - 3rd Edition
Causation. Learning Objectives By the end of this lecture, you should be able to: – Describe causation and the ways in which it differs from correlation.
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
 Pg : 3b, 6b (form and strength)  Page : 10b, 12a, 16c, 16e.
Chapter 4 Section 3 Establishing Causation
The Question of Causation
HW#9: read Chapter 2.6 pages On page 159 #2.122, page 160#2.124,
1 10. Causality and Correlation ECON 251 Research Methods.
 Correlation and regression are closely connected; however correlation does not require you to choose an explanatory variable and regression does. 
C HAPTER 4: M ORE ON T WO V ARIABLE D ATA Sec. 4.2 – Cautions about Correlation and Regression.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
The Practice of Statistics Third Edition Chapter 4: More about Relationships between Two Variables Copyright © 2008 by W. H. Freeman & Company Daniel S.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
Stat 13, Thur 5/24/ Scatterplot. 2. Correlation, r. 3. Residuals 4. Def. of least squares regression line. 5. Example. 6. Extrapolation. 7. Interpreting.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 15 Describing Relationships: Regression, Prediction, and Causation Chapter 151.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
4.3: Establishing Causation Both correlation and regression are very useful in describing the relationship between two variables; however, they are first.
Does Association Imply Causation? Sometimes, but not always! What about: –x=mother's BMI, y=daughter's BMI –x=amt. of saccharin in a rat's diet, y=# of.
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
Get out your Residuals Worksheet! You will be able to distinguish between correlation and causation. Today’s Objectives:
AP STATISTICS LESSON 4 – 2 ( DAY 1 ) Cautions About Correlation and Regression.
Lecture 5 Chapter 4. Relationships: Regression Student version.
Chapter 4 Day Six Establishing Causation. Beware the post-hoc fallacy “Post hoc, ergo propter hoc.” To avoid falling for the post-hoc fallacy, assuming.
Describing Relationships
1. Plot the data. What kind of growth does it exhibit? (plot by hand but you may use calculators to confirm answers.) 2. Use logs to transform the data.
Section Causation AP Statistics ww.toddfadoir.com/apstats.
Prediction and Causation How do we predict a response? Explanatory Variables can be used to predict a response: 1. Prediction is based on fitting a line.
The Question of Causation 4.2:Establishing Causation AP Statistics.
AP Statistics. Issues Interpreting Correlation and Regression  Limitations for r, r 2, and LSRL :  Can only be used to describe linear relationships.
Chapter 5: 02/17/ Chapter 5 Regression. 2 Chapter 5: 02/17/2004 Objective: To quantify the linear relationship between an explanatory variable (x)
4. Relationships: Regression
2.7 The Question of Causation
Cautions About Correlation and Regression Section 4.2
4. Relationships: Regression
Chapter 4.2 Notes LSRL.
Cautions About Correlation and Regression
Proving Causation Why do you think it was me?!.
Establishing Causation
Cautions about Correlation and Regression
Section 4.3 Types of Association
Chapter 2: Looking at Data — Relationships
Chapter 2 Looking at Data— Relationships
Register for AP Exams --- now there’s a $10 late fee per exam
DRILL Put these correlations in order from strongest to weakest.
Cautions about Correlation and Regression
7 Minutes of Silence Determine if the data is linear or exponential.
The Question of Causation
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
Lesson Using Studies Wisely.
Least-Squares Regression
EQ: What gets in the way of a good model?
Chapter 4: Designing Studies
Does Association Imply Causation?
4.2 Cautions about Correlation and Regression
Section 6.2 Establishing Causation
Basic Practice of Statistics - 3rd Edition Lecture Powerpoint
Experiments Observational Study – observes individuals and measures variables of interest but does not attempt to influence the responses. Experiment.
Experiments Observational Study – observes individuals and measures variables of interest but does not attempt to influence the responses. Experiment.
Chapter 4: More on Two-Variable Data
Presentation transcript:

Cautions About Correlation and Regression Section 4.2

CAUTIONS … to keep in mind … Extrapolation – A prediction made based on a regression line for a value of x that is outside of the domain of values for the explanatory variable. Such predictions are often inaccurate. (Example … Mile Run far in the future) A prediction made based on a regression line for a value of x that is outside of the domain of values for the explanatory variable. Such predictions are often inaccurate. (Example … Mile Run far in the future) Lurking Variables – A variable that is NOT among the explanatory or response variables, that may influence the interpretation of the relationship among those variables. (Example …Men, Women, Heart Disease Treatment) A variable that is NOT among the explanatory or response variables, that may influence the interpretation of the relationship among those variables. (Example …Men, Women, Heart Disease Treatment)

More Cautions … Using Averaged Data – When studies use averages from large numbers of people, resist the urge to apply the findings to the individuals. Averages will smooth out the deviations from the LSRL. CAUSATION – A correlation does not imply a causation. Other explanations exist regarding the Association – Common Response & Confounding

Explaining Association Causation: A strong association may in fact be a result of a true causation. Sometimes there are more factors as well. (Ex: BMI Mom, BMI daughter – genetic IS the cause, but Diet, Exercise are also relevant) Sometimes there are more factors as well. (Ex: BMI Mom, BMI daughter – genetic IS the cause, but Diet, Exercise are also relevant) EXPERIMENTS are what we use to hold as many factors constant as possible. EXPERIMENTS are what we use to hold as many factors constant as possible. Yet, the finding might not generalize to other settings. (Ex: Rats, Saccharin, Bladder Tumors) Yet, the finding might not generalize to other settings. (Ex: Rats, Saccharin, Bladder Tumors)

Explaining Association Common Response – “Beware the Lurking Variable” The strong association between x and y might be a common response to some other variable z. Ex: High SATs and High College Grades – z = the students ability and knowledge. Ex: Amount of Money individuals invest, and how well the market does – z = underlying investor sentiment.

Confounding – Two variables are confounded when their effects cannot be distinguished from each other. Mixing in many different causes together at the same time (Ex: Heredity, Diet, Exercise, Modeled Behavior, Couch Potato). EX: Religious people live longer. It might not be the religion, it might be that hey also take better care of themselves – less likely to smoke, drink, live excessively. EX: More education and higher income. It might be the initial affluence that drives the ability to get the education.

CAUSATION Carefully Designed Experiments Control the Lurking Variables Does Gun Control Reduce Violent Crime? Do Power Lines Cause Cancer? Ethical and Practical Constraints!

Smoking & Lung Cancer In the absence of and experiment, what is needed to establish “Causation”: Strong Association (How strong is the association to start with – for smoking and lung cancer, it is very strong); Strong Association (How strong is the association to start with – for smoking and lung cancer, it is very strong); Consistent Association (Many studies, many countries, many different kinds of people); Consistent Association (Many studies, many countries, many different kinds of people); Higher Doses have Stronger Responses (People who smoke more, have greater incidents of cancer); Higher Doses have Stronger Responses (People who smoke more, have greater incidents of cancer); Alleged Cause is Chronologically before the Effect (Deaths today are related to smoking from 30 years ago); Alleged Cause is Chronologically before the Effect (Deaths today are related to smoking from 30 years ago); The Alleged Cause is Plausible (Animal Research) The Alleged Cause is Plausible (Animal Research) The evidence that Smoking Causes Lung cancer is OVERWHELMING … but nothing “beats” a well- designed Experiment.