Cautions About Correlation and Regression Section 4.2

Slides:



Advertisements
Similar presentations
Section 4.2. Correlation and Regression Describe only linear relationship. Strongly influenced by extremes in data. Always plot data first. Extrapolation.
Advertisements

Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Aim: How do we establish causation?
AP Statistics Section 4.3 Establishing Causation
MA 102 Statistical Controversies Friday, March 22, 2002 Today: Chapter 15 More on Correlation Regression Causation Reading : None new Exercises: 15.1,
Lesson Establishing Causation. Knowledge Objectives Identify the three ways in which the association between two variables can be explained. Define.
Basic Practice of Statistics - 3rd Edition
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
 Pg : 3b, 6b (form and strength)  Page : 10b, 12a, 16c, 16e.
Chapter 4 Section 3 Establishing Causation
The Question of Causation
HW#9: read Chapter 2.6 pages On page 159 #2.122, page 160#2.124,
1 10. Causality and Correlation ECON 251 Research Methods.
 Correlation and regression are closely connected; however correlation does not require you to choose an explanatory variable and regression does. 
C HAPTER 4: M ORE ON T WO V ARIABLE D ATA Sec. 4.2 – Cautions about Correlation and Regression.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
The Practice of Statistics Third Edition Chapter 4: More about Relationships between Two Variables Copyright © 2008 by W. H. Freeman & Company Daniel S.
1 Chapter 4: More on Two-Variable Data 4.1Transforming Relationships 4.2Cautions 4.3Relations in Categorical Data.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
4.3: Establishing Causation Both correlation and regression are very useful in describing the relationship between two variables; however, they are first.
Does Association Imply Causation? Sometimes, but not always! What about: –x=mother's BMI, y=daughter's BMI –x=amt. of saccharin in a rat's diet, y=# of.
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
AP STATISTICS LESSON 4 – 2 ( DAY 1 ) Cautions About Correlation and Regression.
Lecture 5 Chapter 4. Relationships: Regression Student version.
Chapter 4 Day Six Establishing Causation. Beware the post-hoc fallacy “Post hoc, ergo propter hoc.” To avoid falling for the post-hoc fallacy, assuming.
1. Plot the data. What kind of growth does it exhibit? (plot by hand but you may use calculators to confirm answers.) 2. Use logs to transform the data.
Cautions About Correlation and Regression Section 4.2.
Section Causation AP Statistics ww.toddfadoir.com/apstats.
Prediction and Causation How do we predict a response? Explanatory Variables can be used to predict a response: 1. Prediction is based on fitting a line.
The Question of Causation 4.2:Establishing Causation AP Statistics.
AP Statistics. Issues Interpreting Correlation and Regression  Limitations for r, r 2, and LSRL :  Can only be used to describe linear relationships.
Chapter 5: 02/17/ Chapter 5 Regression. 2 Chapter 5: 02/17/2004 Objective: To quantify the linear relationship between an explanatory variable (x)
2.7 The Question of Causation
Chapter 4.2 Notes LSRL.
Cautions About Correlation and Regression
Proving Causation Why do you think it was me?!.
Establishing Causation
Cautions about Correlation and Regression
Section 4.3 Types of Association
Chapter 2: Looking at Data — Relationships
Chapter 2 Looking at Data— Relationships
Register for AP Exams --- now there’s a $10 late fee per exam
DRILL Put these correlations in order from strongest to weakest.
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Cautions about Correlation and Regression
7 Minutes of Silence Determine if the data is linear or exponential.
The Question of Causation
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
Lesson Using Studies Wisely.
Least-Squares Regression
EQ: What gets in the way of a good model?
Chapter 4: Designing Studies
Does Association Imply Causation?
Chapter 4: Designing Studies
4.2 Cautions about Correlation and Regression
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Section 6.2 Establishing Causation
Basic Practice of Statistics - 3rd Edition Lecture Powerpoint
Experiments Observational Study – observes individuals and measures variables of interest but does not attempt to influence the responses. Experiment.
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Experiments Observational Study – observes individuals and measures variables of interest but does not attempt to influence the responses. Experiment.
Chapter 4: More on Two-Variable Data
Correlation/regression using averages
Presentation transcript:

Cautions About Correlation and Regression Section 4.2

CAUTIONS … to keep in mind … Extrapolation – A prediction made based on a regression line for a value of x that is outside of the domain of values for the explanatory variable. Such predictions are often inaccurate. (Example … Mile Run far in the future) Lurking Variables – A variable that is NOT among the explanatory or response variables, that may influence the interpretation of the relationship among those variables. (Example …Men, Women, Heart Disease Treatment)

More Cautions … Using Averaged Data – CAUSATION – When studies use averages from large numbers of people, resist the urge to apply the findings to the individuals. Averages will smooth out the deviations from the LSRL. CAUSATION – A correlation does not imply a causation. Other explanations exist regarding the Association – Common Response & Confounding

Explaining Association Causation: A strong association may in fact be a result of a true causation. Sometimes there are more factors as well. (Ex: BMI Mom, BMI daughter – genetic IS the cause, but Diet, Exercise are also relevant) EXPERIMENTS are what we use to hold as many factors constant as possible. Yet, the finding might not generalize to other settings. (Ex: Rats, Saccharin, Bladder Tumors)

Explaining Association Common Response – “Beware the Lurking Variable” The strong association between x and y might be a common response to some other variable z. Ex: High SATs and High College Grades – z = the students ability and knowledge. Ex: Amount of Money individuals invest, and how well the market does – z = underlying investor sentiment.

Explaining Association Confounding – Two variables are confounded when their effects cannot be distinguished from each other. Mixing in many different causes together at the same time (Ex: Heredity, Diet, Exercise, Modeled Behavior, Couch Potato). EX: Religious people live longer. It might not be the religion, it might be that hey also take better care of themselves – less likely to smoke, drink, live excessively. EX: More education and higher income. It might be the initial affluence that drives the ability to get the education.

CAUSATION Carefully Designed Experiments Control the Lurking Variables Does Gun Control Reduce Violent Crime? Do Power Lines Cause Cancer? Ethical and Practical Constraints!

Smoking & Lung Cancer In the absence of and experiment, what is needed to establish “Causation”: Strong Association (How strong is the association to start with – for smoking and lung cancer, it is very strong); Consistent Association (Many studies, many countries, many different kinds of people); Higher Doses have Stronger Responses (People who smoke more, have greater incidents of cancer); Alleged Cause is Chronologically before the Effect (Deaths today are related to smoking from 30 years ago); The Alleged Cause is Plausible (Animal Research) The evidence that Smoking Causes Lung cancer is OVERWHELMING … but nothing “beats” a well-designed Experiment.