Cautions About Correlation and Regression

Slides:



Advertisements
Similar presentations
Section 4.2. Correlation and Regression Describe only linear relationship. Strongly influenced by extremes in data. Always plot data first. Extrapolation.
Advertisements

Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Looking at Data-Relationships 2.1 –Scatter plots.
Lesson Establishing Causation. Knowledge Objectives Identify the three ways in which the association between two variables can be explained. Define.
BPS - 5th Ed. Chapter 51 Regression. BPS - 5th Ed. Chapter 52 u Objective: To quantify the linear relationship between an explanatory variable (x) and.
Basic Practice of Statistics - 3rd Edition
Chapter 5 Regression. Chapter 51 u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We.
1 10. Causality and Correlation ECON 251 Research Methods.
 Correlation and regression are closely connected; however correlation does not require you to choose an explanatory variable and regression does. 
C HAPTER 4: M ORE ON T WO V ARIABLE D ATA Sec. 4.2 – Cautions about Correlation and Regression.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 15 Describing Relationships: Regression, Prediction, and Causation Chapter 151.
4.3: Establishing Causation Both correlation and regression are very useful in describing the relationship between two variables; however, they are first.
Chapter 2 Looking at Data - Relationships. Relations Among Variables Response variable - Outcome measurement (or characteristic) of a study. Also called:
BPS - 3rd Ed. Chapter 51 Regression. BPS - 3rd Ed. Chapter 52 u Objective: To quantify the linear relationship between an explanatory variable (x) and.
Chapter 5 Regression BPS - 5th Ed. Chapter 51. Linear Regression  Objective: To quantify the linear relationship between an explanatory variable (x)
BPS - 5th Ed. Chapter 51 Regression. BPS - 5th Ed. Chapter 52 u Objective: To quantify the linear relationship between an explanatory variable (x) and.
Chapter 4 More on Two-Variable Data YMS 4.1 Transforming Relationships.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
AP STATISTICS LESSON 4 – 2 ( DAY 1 ) Cautions About Correlation and Regression.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Business Statistics for Managerial Decision Making
Describing Relationships
1. Plot the data. What kind of growth does it exhibit? (plot by hand but you may use calculators to confirm answers.) 2. Use logs to transform the data.
Cautions About Correlation and Regression Section 4.2.
Stat 1510: Statistical Thinking and Concepts REGRESSION.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Prediction and Causation How do we predict a response? Explanatory Variables can be used to predict a response: 1. Prediction is based on fitting a line.
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Chapter 5: 02/17/ Chapter 5 Regression. 2 Chapter 5: 02/17/2004 Objective: To quantify the linear relationship between an explanatory variable (x)
2.7 The Question of Causation
Cautions About Correlation and Regression Section 4.2
CHAPTER 3 Describing Relationships
Chapter 4.2 Notes LSRL.
Essential Statistics Regression
Proving Causation Why do you think it was me?!.
Cautions about Correlation and Regression
Section 4.3 Types of Association
Chapter 2: Looking at Data — Relationships
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
Chapter 2 Looking at Data— Relationships
Cautions about Correlation and Regression
Least-Squares Regression
The Question of Causation
Basic Practice of Statistics - 5th Edition Regression
HS 67 (Intro Health Stat) Regression
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
Basic Practice of Statistics - 3rd Edition Regression
Chapter 3: Describing Relationships
Least-Squares Regression
CHAPTER 3 Describing Relationships
Does Association Imply Causation?
Chapter 3: Describing Relationships
4.2 Cautions about Correlation and Regression
Correlation/regression using averages
3.3 Cautions Correlation and Regression Wisdom Correlation and regression describe ONLY LINEAR relationships Extrapolations (using data to.
Section 6.2 Establishing Causation
Basic Practice of Statistics - 3rd Edition Lecture Powerpoint
Experiments Observational Study – observes individuals and measures variables of interest but does not attempt to influence the responses. Experiment.
Experiments Observational Study – observes individuals and measures variables of interest but does not attempt to influence the responses. Experiment.
Chapter 4: More on Two-Variable Data
Correlation/regression using averages
Presentation transcript:

Cautions About Correlation and Regression AP STATISTICS Cautions About Correlation and Regression

ESSENTIAL QUESTION: What is causation and how can it be determined? OBJECTIVES: To examine data for causation, and to examine data. To be careful when using models for extrapolation. To understand that causation may be hard to determine do to effects of lurking variables, common response, and confounding variables.

Correlation and Regression Correlation and regression describe only linear relationships. The correlation r and the least-squares regression line are not resistant. One influential or incorrectly entered data point can greatly change these measures. Always plot data before interpreting regression or correlation.

Extrapolation Extrapolation is the use of a regression line for prediction far outside the domain of values of the explanatory variable x that you used to obtain the line or curve. Such predictions are often not accurate.

Lurking Variables A lurking variable is a variable that is not among the explanatory or response variables in study and yet may influence the interpretation of relationships among those variables.

Lurking Variables The relationship between two variables can be strongly influenced by lurking variables. Many lurking variables change systematically over time. One method of detecting if time has an influence is to plot residuals and response variables over time if available.

Using averaged data Many regression or correlation studies work with averages or other measures that combine information from many individuals. Note this carefully and resist the temptation to apply the results of such studies to individuals. Correlations based on averages are usually too high when applied to individuals. This is another reminder that it is important to note exactly what variables were measured in a statistical study.

The question of causation In many studies of the relationship between two variables, the goal is to establish that changes in the explanatory variable cause changes in the response variable. Even when a strong association is present, the conclusion that this association is due to a causal linking in the variables is often elusive.

Explaining association: causation Variable x and y show a strong association (dashed line). This association may be the result of any of several causal relationships ( solid arrow).

Causal associations Causation: changes in x cause changes in y. Common response: Changes in both x and y are caused by changes in a lurking variable z. Confounding: The effect ( if any ) of x and y is confounded with the effect of a lurking variable. Even when direct causation is present. It is exactly a complete explanation of an association between two variables.

Explaining association: Common Response Beware of lurking variables when thinking about an association between two variables. The observed association between the variables x and y is explained by a lurking variable z. Both x and y change to changes in z. This common response creates an association even though there may be no direct causal link between x and y.

Confounding Two variables are confounded when their effects on a response variable cannot be distinguished from each other. The confounded variables may be either explanatory variables or lurking variables. Even a very strong association between two variables is not by itself good evidence that there is a cause-and-effect link between the variables.

What are the criteria for establishing causation when we can’t do an experiment? The association is strong. The association is consistent. Higher doses are associated with stronger responses. The alleged cause precedes the effect in time. The alleged cause is plausible.