Correlation and Causality

Slides:



Advertisements
Similar presentations
Copyright © 2011 Pearson Education, Inc. Statistical Reasoning.
Advertisements

7.1 Seeking Correlation LEARNING GOAL
Section 7.2 ~ Interpreting Correlations Introduction to Probability and Statistics Ms. Young ~ room 113.
Chapter 6: Correlational Research Examine whether variables are related to one another (whether they vary together). Correlation coefficient: statistic.
Copyright © 2015, 2011, 2008 Pearson Education, Inc. Chapter 5, Unit E, Slide 1 Statistical Reasoning 5.
CORRELATON & REGRESSION
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Correlation and Regression Analysis
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Regression and Correlation
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section 7.3 ~ Best-Fit Lines and Prediction Introduction to Probability and Statistics Ms. Young.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Chapter 10 Correlation and Regression
Statistical Reasoning for everyday life Intro to Probability and Statistics Mr. Spering – Room 113.
Section 7.4 ~ The Search for Causality Introduction to Probability and Statistics Ms. Young.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Check roster below the chat area for your name to be sure you get credit! Audio will start at class time. Previously requested topics will be gone over.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 5 Statistical Reasoning.
Chapter 7 found in Unit 5 Correlation & Causality Section 1: Seeking Correlation Can't Type? press F11 Can’t Hear? Check: Speakers, Volume or Re-Enter.
Found StatCrunch Resources
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 7 found in Unit 5 Correlation & Causality Section 1: Seeking Correlation Page 286 Can't Type? press F11 Can’t Hear? Check: Speakers, Volume or.
Copyright © 2011 Pearson Education, Inc. Statistical Reasoning 1 web 39. Weather Maps 40. Cancer Cure 1 world 41. News Graphics 42. Geographical Data.
Unit 5E Correlation and Causality. CORRELATION Heights and weights Study Time and Test Score Available Gasoline and Price of Gasoline A correlation exists.
Some Reminders: Check the Roster below the chat area to make sure you are listed, especially if it says you left! Audio starts on the hour. Active on-topic.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © Cengage Learning. All rights reserved.
7.1 Seeking Correlation LEARNING GOAL
Welcome to the Unit 5 Seminar Kristin Webster
Topics
Regression and Correlation
Regression Analysis AGEC 784.
Essential Statistics Regression
What is Correlation Analysis?
Correlation and Regression
7.2 Interpreting Correlations
SIMPLE LINEAR REGRESSION MODEL
Cautions about Correlation and Regression
Correlation and Regression
7.3 Best-Fit Lines and Prediction
Chapter 25 Comparing Counts.
Elementary Statistics
7.2 Interpreting Correlations
Essential Statistics (a.k.a: The statistical bare minimum I should take along from STAT 101)
Chapter 8 Part 2 Linear Regression
7.2 Interpreting Correlations
Prepared by Lee Revere and John Large
Lecture Notes The Relation between Two Variables Q Q
7.3 Best-Fit Lines and Prediction
Correlation and Causality
Least-Squares Regression
Statistical Reasoning
Looking at data: relationships - Caution about correlation and regression - The question of causation IPS chapters 2.4 and 2.5 © 2006 W. H. Freeman and.
Chapter 26 Comparing Counts.
Least-Squares Regression
LG: I can assess the reliability of a linear model
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Correlation/regression using averages
Inferences for Regression
Basic Practice of Statistics - 3rd Edition Lecture Powerpoint
Chapter 26 Comparing Counts.
MGS 3100 Business Analysis Regression Feb 18, 2016
Correlation/regression using averages
Correlation and Prediction
Presentation transcript:

Correlation and Causality Chapter 7 Correlation and Causality

GED111/CDS111 Statistics in Modern Society Correlation It exists between 2 variables when higher values of one variable consistently go with higher values of another variable or when higher values of one variable consistently go with lower values of another variable Height vs weight Demand for apples vs price of apples Practice time vs skill Inflation vs unemployment GED111/CDS111 Statistics in Modern Society

GED111/CDS111 Statistics in Modern Society Types of correlation Positive correlation Negative correlation No correlation Nonlinear correlation Fig 7.3 p289 GED111/CDS111 Statistics in Modern Society

Measuring the strength of a correlation Correlation coefficient, r Rising straight line, r>0, positive correlation Descending straight line, r<0, negative correlation r ≈ 0, no correlation GED111/CDS111 Statistics in Modern Society

GED111/CDS111 Statistics in Modern Society Exercise Q5-8 p295 Q9-13 p295 GED111/CDS111 Statistics in Modern Society

Interpreting Correlations Once a correlation between 2 variables has been established, we can then proceed to identify the nature of the relationship so that useful predictions can be made. Beware of outliners Correlation can be very sensitive to outliners Should NOT remove outliners unless we confirm our suspicion that they were invalid data points GED111/CDS111 Statistics in Modern Society

Correlation does not imply causality Possible explanation for a correlation Coincidence Both variables might be directly influenced by some common underlying cause. One of the correlated variables may actually be a cause of the other. But note that, even in this case, it may be just one of several causes. GED111/CDS111 Statistics in Modern Society

GED111/CDS111 Statistics in Modern Society Exercise Q9-13 p305 GED111/CDS111 Statistics in Modern Society

Best-fit lines and Prediction The best-fit line (or regression line) on a scatter diagram is a line that lies closer to the data point than any other possible line (according to a standard statistical measure of closeness). Using this Regression line, one variable could be used to predict the other variable GED111/CDS111 Statistics in Modern Society

GED111/CDS111 Statistics in Modern Society Caution Don’t expect a best-fit line to give a good prediction unless the correlation is significant. Don’t use the best-fit line to make predictions beyond the bounds of the data points to which the line was fit. A best-fit line based on past data is not necessarily valid now and might not result in valid predictions of the future. GED111/CDS111 Statistics in Modern Society

GED111/CDS111 Statistics in Modern Society Caution (cont.) Don’t make predictions about a population that is different from the population from which the sample data were drawn. Remember that a best-fit line should not be used for predictions when there is no significant correlation or when the relationship is nonlinear. GED111/CDS111 Statistics in Modern Society

GED111/CDS111 Statistics in Modern Society Multiple Regression Multiple Regression allows the calculation of a best-fit equation that represents the best fit between one variable (such as price) and a combination of 2 or more other variables (such as weight and colour). The coefficient of determination R2, tells us the proportion of the scatter in the data accounted for by the best-fit equation. GED111/CDS111 Statistics in Modern Society

GED111/CDS111 Statistics in Modern Society Causality The fact that one factor causes another Correlation may suggest causality but by itself never establishes causality. Correlation can be explained by Coincidence Common underlying cause (for the 2 variables) One variable is actually having a direct influence on the other Causality is to eliminate the 1st 2 reasons for correlation GED111/CDS111 Statistics in Modern Society

Guidelines for Causality Look for situations in which the effect is correlated with the suspected cause even while other factors vary. Among groups that differ only in the presence or absence of the suspected cause, check that the effect is similarly present or absent. GED111/CDS111 Statistics in Modern Society

Guidelines for Causality (cont.) Look for evidence that larger amounts of the suspected cause produce larger amounts of the effect. If the effect might be produced by other potential causes (besides your suspected cause), make sure that the effect still remains after accounting for these other potential causes. GED111/CDS111 Statistics in Modern Society

Guidelines for Causality (cont.) If possible, test suspected cause with an experiment. If the experiment cannot be performed with humans for ethical reasons, consider doing the experiment with animals, cell cultures, or computer models. Try to determine the physical mechanism by which the suspected cause produces the effect. GED111/CDS111 Statistics in Modern Society

Broad Levels of Confidence in Causality Possible cause We have discovered a correlation, but cannot yet determine whether the correlation implies causality. In the legal system, possible cause is often the reason for starting an investigation GED111/CDS111 Statistics in Modern Society

Broad Levels of Confidence in Causality (cont.) Probable cause We have good reason to suspect that the correlation involves cause, perhaps because some of the guidelines for establishing causality are satisfied. In the legal system, probable cause is the general standard for getting a judge to grant a warrant for a search or wiretap GED111/CDS111 Statistics in Modern Society

Broad Levels of Confidence in Causality (cont.) Cause beyond reasonable doubt We have found a physical model that is so successful in explaining how one thing causes another that it seems unreasonable doubt is the usual standard for convictions and generally demands that the prosecution have shown how and why the suspect committed the crime. Note that beyond reasonable doubt does not mean beyond all doubt. GED111/CDS111 Statistics in Modern Society

GED111/CDS111 Statistics in Modern Society Focus on Education What Helps Children Learn to Read? pp325-327 GED111/CDS111 Statistics in Modern Society