Correlation vs. Causation What is the difference?.

Slides:



Advertisements
Similar presentations
Spearman’s Rank Correlation Coefficient
Advertisements

A recent newspaper article reported that the number of personal computers being sold is increasing. In addition, the number of athletic shoes being sold.
Linear regression and correlation
Describing the Relation Between Two Variables
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Lecture 17: Correlations – Describing Relationships Between Two Variables 2011, 11, 22.
Correlation and Regression. Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score.
Scatter Diagrams and Correlation
Calculating and Interpreting the Correlation Coefficient ~adapted from walch education.
Correlation & Regression Math 137 Fresno State Burger.
Simple Linear Regression
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
Overview 4.2 Introduction to Correlation 4.3 Introduction to Regression.
Correlation By Dr.Muthupandi,. Correlation Correlation is a statistical technique which can show whether and how strongly pairs of variables are related.
Correlation and regression 1: Correlation Coefficient
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Linear Regression When looking for a linear relationship between two sets of data we can plot what is known as a scatter diagram. x y Looking at the graph.
Scatterplots October 14, Warm-Up Given the following domain and range in set notation, write the equivalent domain and range in algebraic notation.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
1.6 Linear Regression & the Correlation Coefficient.
Run the colour experiment where kids write red, green, yellow …first.
2 Variable Stats Does the amount of time spent on homework improve your grade? If you eat more junk food will you gain weight? Does the amount of rainfall.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Correlation Analysis. A measure of association between two or more numerical variables. For examples height & weight relationship price and demand relationship.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
Scatter Diagrams and Correlation Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
Click to edit Master title style Midterm 3 Wednesday, June 10, 1:10pm.
April 1 st, Bellringer-April 1 st, 2015 Video Link Worksheet Link
Creating a Residual Plot and Investigating the Correlation Coefficient.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Warm-Up Write the equation of each line. A B (1,2) and (-3, 7)
Section 5.1: Correlation. Correlation Coefficient A quantitative assessment of the strength of a relationship between the x and y values in a set of (x,y)
Correlation.
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
Scatterplots and Linear Regressions Unit 8. Warm – up!! As you walk in, please pick up your calculator and begin working on your warm – up! 1. Look at.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
Discovering Mathematics Week 9 – Unit 6 Graphs MU123 Dr. Hassan Sharafuddin.
Scatter Plot A scatter plot is a graph of a collection of ordered pairs (x,y). The ordered pairs are not connected The graph looks like a bunch of dots,
CORRELATION ANALYSIS.
Unit 3 Section : Regression  Regression – statistical method used to describe the nature of the relationship between variables.  Positive.
Correlation and Regression Ch 4. Why Regression and Correlation We need to be able to analyze the relationship between two variables (up to now we have.
4.5 – Analyzing Lines of Best Fit Today’s learning goal is that students will be able to Use residuals to determine how well lines of fit model data. Distinguish.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Scatter Plots and Correlation Coefficients
Pearson’s Correlation Coefficient
Lesson 4.5 Topic/ Objective: To use residuals to determine how well lines of fit model data. To use linear regression to find lines of best fit. To distinguish.
Correlation & Regression
Correlation vs. Causation
Correlation vs. Causation
SIMPLE LINEAR REGRESSION MODEL
Scatterplots A way of displaying numeric data
Linear Trends and Correlations
Correlation vs. Causation
CHAPTER 10 Correlation and Regression (Objectives)
2. Find the equation of line of regression
Correlation vs. Causation
Correlation vs. Causation
Correlation vs. Causation
CORRELATION ANALYSIS.
Section 1.4 Curve Fitting with Linear Models
Correlation vs. Causation
y = mx + b Linear Regression line of best fit REMEMBER:
Correlation vs. Causation
Ch 9.
Warsaw Summer School 2017, OSU Study Abroad Program
Correlation vs. Causation
Presentation transcript:

Correlation vs. Causation What is the difference?

Causation Causation means that one thing causes another – You hit your finger with a hammer so your finger swells. Your finger swelling is CAUSED by the hammer hitting it.

Correlation Correlation is a concept from statistics that measures the relationship between two things. – A positive correlation means that when one thing goes up, the other goes up too. – A negative correlation is the opposite, when one goes up, the other goes down

Can you tell the two apart? The temperature outside and ice cream sales? – Correlation (they are related but hot weather doesn’t CAUSE ice cream sales to increase) The number of firemen fighting a fire and the size of the fire? – Correlation (The bigger the fire, the more firemen; however, this doesn’t mean that bringing more firemen will cause the size of the fire to increase). Not studying for a test and getting a low grade? – Correlation (they are related but you COULD make a good grade and not study, right?) Exercising and burning calories? – Causation (burning calories is the RESULT of the exercise; in other words, exercising CAUSED you to burn calories)

Correlation Coefficients Correlation coefficients are used in statistics to measure how strong a relationship is between two variables. The value of r is such that -1 < r < +1. The + and – signs are used for positive linear correlations and negative linear correlations, respectively.

Positive, Negative or No Positive correlation: If x and y have a strong positive linear correlation, r is close to +1. An r value of exactly +1 indicates a perfect positive fit. Positive values indicate a relationship between x and y variables such that as values for x increases, values for y also increase. Negative correlation: If x and y have a strong negative linear correlation, r is close to -1. An r value of exactly -1 indicates a perfect negative fit. Negative values indicate a relationship between x and y such that as values for x increase, values for y decrease. No correlation: If there is no linear correlation or a weak linear correlation, r is close to 0. A value near zero means that there is a random, nonlinear relationship between the two variables

Way More Info Than You Want: Correlation Coefficient, r : The quantity r, called the linear correlation coefficient, measures the strength and the direction of a linear relationship between two variables. The linear correlation coefficient is sometimes referred to as the Pearson product moment correlation coefficient in honor of its developer Karl Pearson. The mathematical formula for computing r is: where n is the number of pairs of data. (Aren't you glad you have a graphing calculator that computes this formula?)

Where do you find “r” on the calculator? The correlation coefficient, r, will appear on the screen that shows the regression equation information In addition to appearing with the regression information, the values r and r 2 can be found under VARS, #5 Statistics → EQ #7 r and #8 r 2.

Meaning of r values A correlation greater than 0.8 is generally described as strong, whereas a correlation less than 0.5 is generally described as weak.

Describe as weak or strong and positive or negative -0.4? – Weak negative (negative sign shows negative correlation and 0.4 is less than 0.5 so weak) 0.9? – Strong positive (greater than 0.8 so strong) -0.87? – Strong negative

Graphs for Correlation -

Summary The closer your correlation coefficient is to 1 or -1 the more linear your data will look when graphed. When r = 1, the data would be perfectly linear with a positive slope When r = -1, the data would be perfectly linear with a negative slope The closer the r value gets to zero (from either direction, negative or positive), the weaker the correlation is. When r= 0, there would be no correlation.