Correlation.  It should come as no great surprise that there is an association between height and weight  Yes, as you would expect, taller students.

Slides:



Advertisements
Similar presentations
Correlation and Linear Regression
Advertisements

 Objective: To look for relationships between two quantitative variables.
Scatterplots, Association, and Correlation
Correlation Data collected from students in Statistics classes included their heights (in inches) and weights (in pounds): Here we see a positive association.
CHAPTER 8: LINEAR REGRESSION
Chapter 4 Scatterplots and Correlation. Rating Cereal: 0 to = unhealthy 100 = very nutritious.
Scatterplots, Association, and Correlation 60 min.
Statistics for the Social Sciences Psychology 340 Fall 2006 Relationships between variables.
Chapter 6 Prediction, Residuals, Influence Some remarks: Residual = Observed Y – Predicted Y Residuals are errors.
Scatterplots, Association, and Correlation Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 6, Slide 1 Chapter 6 Scatterplots, Association and Correlation.
Correlation with a Non - Linear Emphasis Day 2.  Correlation measures the strength of the linear association between 2 quantitative variables.  Before.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
Copyright © 2010 Pearson Education, Inc. Unit 2: Chapter 7 Scatterplots, Association, and Correlation.
Examining Relationships
Scatterplots, Association,
Examining Relationships Prob. And Stat. 2.2 Correlation.
Scatterplots, Associations, and Correlation
 Chapter 7 Scatterplots, Association, and Correlation.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Slide 7-1 Copyright © 2004 Pearson Education, Inc.
1 Chapter 7 Scatterplots, Association, and Correlation.
Copyright © 2010 Pearson Education, Inc. Slide Lauren is enrolled in a very large college calculus class. On the first exam, the class mean was a.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 Linear Regression.
The Practice of Statistics
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter 7 Scatterplots, Association, and Correlation
Chapter 8 Linear Regression *The Linear Model *Residuals *Best Fit Line *Correlation and the Line *Predicated Values *Regression.
3.3 Correlation: The Strength of a Linear Trend Estimating the Correlation Measure strength of a linear trend using: r (between -1 to 1) Positive, Negative.
4.2 Correlation The Correlation Coefficient r Properties of r 1.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 7 Scatterplots, Association, and Correlation.
Copyright © 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8- 1.
CP Prob & Stats Unit 4 – Chapter 7 A Tale of Two Variables.
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
UNIT 4 Bivariate Data Scatter Plots and Regression.
Chapter 7 Scatterplots, Association, and Correlation.
Chapter 7 Scatterplots, association, and correlation math2200.
Module 11 Scatterplots, Association, and Correlation.
Correlation Example: School closed for a week and nobody else got Swine Flu. Correlation: closing school stopped it spreading! But did the closure really.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 7 Scatterplots, Association, and Correlation.
Honors Statistics Chapter 7 Scatterplots, Association, and Correlation.
Statistics 7 Scatterplots, Association, and Correlation.
Scatterplots, Association, and Correlation. Scatterplots are the best way to start observing the relationship and picturing the association between two.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 7- 1.
Part II Exploring Relationships Between Variables.
Chapter 6 Prediction, Residuals, Influence
Scatterplots, Association, and Correlation
CHAPTER 7 LINEAR RELATIONSHIPS
Chapter 7: Scatterplots, Association, and Correlation
Chapter 7 Scatterplots, Association, and Correlation
Scatterplots, Association and Correlation
Chapter 7 Scatterplots, Association, and Correlation
Chapter 7 Scatterplots, Association, and Correlation
Chapter 7 Part 2 Scatterplots, Association, and Correlation
Scatterplots, Association, and Correlation
Scatterplots, Association, and Correlation
Scatterplots, Association, and Correlation
Chapter 7: Scatterplots, Association, and Correlation
Review of Chapter 3 Examining Relationships
Scatterplots, Association and Correlation
Scatterplots Scatterplots may be the most common and most effective display for data. In a scatterplot, you can see patterns, trends, relationships, and.
Correlation r Explained
Scatterplots, Association, and Correlation
Presentation transcript:

Correlation

 It should come as no great surprise that there is an association between height and weight  Yes, as you would expect, taller students tend to weigh more (or, conversely, heavier students tend to be taller)

Standardizing  We want to put a number on the strength of the association between the two variables of a scatter plot  We want it to be unaffected by our unit choice (i.e. kg vs. lbs) because these don’t change the direction, form, or strength of the relationship.

Standardizing

Effects of Standardizing  Underlying linear patterns often appear steeper in the standardized plot?  Why?  Hint: Look to units being used

Reading Standardized Plots

Correlation Reviewed and Expanded

A Measure of Correlation

Quick Check  What does a correlation coefficient of r = 0.8 look like?  What does a correlation coefficient of r = 0.3 look like?  Note: r will always be used for correlation

Correlation Conditions  Correlation measures the strength of the linear association between two quantitative variables. To use correlation, you must check several conditions:  Quantitative Variable Condition: Correlation applies only to two quantitative variables, it cannot be used for any categorical variables. Check to make sure you know the unit’s variables and what they measure.

Correlation Conditions  Correlation measures the strength of the linear association between two quantitative variables. To use correlation, you must check several conditions:  Straight Enough Condition: Is the form of the scatterplot straight enough so that a linear relationship makes sense?

Correlation Conditions  Correlation measures the strength of the linear association between two quantitative variables. To use correlation, you must check several conditions:  Outlier Condition: Outliers can distort the correlation dramatically. It can make an otherwise weak correlation look big or hide a strong correlation. It can even given an otherwise positive association a negative correlation coefficient and vice versa. When you see an outlier, it is often a good idea to report the correlation with and without that point.

Just Checking  Lets say I gave two exams both worth 50 points and reported that the correlation between the two scores was 0.75  1) Before answering any questions about the correlation, what would you like to see, and why?  Answer: We know the scores are quantitative so we should check to see if the Straight Enough Condition and the Outlier Condition are satisfied by looking at the scatterplot of the two scores.

Just Checking  Lets say I gave two exams both worth 50 points and reported that the correlation between the two scores was 0.75  2)If she adds 10 points to each Exam 1 score, how will this change the correlation?  Answer: It will not change.

Just Checking  Lets say I gave two exams both worth 50 points and reported that the correlation between the two scores was 0.75  3) If she standardizes scores on each exam, how will this affect the correlation?  Answer: It will not change.

Just Checking  Lets say I gave two exams both worth 50 points and reported that the correlation between the two scores was 0.75  4) In general, if someone did poorly on exam 1 are they likely to have done poorly on exam 2? Explain.  Answer: They are likely to have done poorly. The positive correlation means low scores on exam 1 are associated with low scores on exam 2.

Just Checking  Lets say I gave two exams both worth 50 points and reported that the correlation between the two scores was 0.75  5) If someone did poorly on exam 1 can you be sure they did poorly on exam 2? Explain.  Answer: No. The general association is positive, but individual performances may vary.

Correlation Properties  The sign of a correlation coefficient gives the direction of the association  Correlation is always between -1 and +1, but these values are unusual in real data because they mean that all the data points fall exactly on a single straight line.  Correlation treats x and y symmetrically, the correlation of x with y is the same as the correlation of y with x.

Correlation Properties  Correlation has no units. Correlation is sometimes given as a percentage, but you probably shouldn’t do that because it suggests a percentage of something – and correlation, lacking units, has no “something” of which to be a percent.  Correlation is not affected by changes in the center or scale of either variable. Changing the units or baseline of either variable has no effect on the correlation coefficient. Correlation depends only on the z-scores, and they are unaffected by changes in center or scale

Correlation Properties  Correlation measures the strength of the linear association between the two variables. Variables can be strongly associated but still have a small correlation if the association is not linear.  Correlation is sensitive to outliers. A single outlying value can make a small correlation large or a large correlation small.

 The more firemen fighting a fire, the bigger the fire is observed to be.  Therefore firemen cause an increase in the size of a fire.

 As ice cream sales increase, the rate of drowning deaths increases sharply.  Therefore, ice cream consumption causes drowning.

 Therefore, global warming is causing obesity.

 A hidden variable that stands behind a relationship and determines it by simultaneously affecting the other two variables is called a lurking variable.  Ice cream sales and drowning are both caused by increased number of beach-goers during the summer.  Obesity and global warming are both caused by increased wealth and energy consumption

Correlation Tables  It is common in some fields to compute the correlations between every pair of variables in a collection and arrange these correlations in a table.  Why is this dangerous? AssetsSalesMarket ValueProfitsCash FlowEmployees Assets1 Sales Market Value Profits Cash Flow Employees

Straightening Scatterplots  An Example With Cameras  Some camera lenses have an adjustable aperture, the hole that lets the light in.  The size of this aperture is expressed as a mysterious number called the f/stop  Each increase of one f/stop number corresponds to halving the light that is allowed to come through.  When we halve the shutter speed we cut down the light that gets let in, so you have to open the aperture one notch.

Straightening Scatterplots  We can experiment to find the best f/stop values for each shutter speed. Shutter Speed: 1/10001/5001/2501/125 1/60 1/30 1/15 1/8 f/stop:

Straightening Scatterplots  The correlation of these shutter speeds and f/stops is.979. That sounds pretty high and you might assume a strong linear relationship. But when we check the scatterplot it shows something is not quite right. Shutter Speed: 1/10001/5001/2501/125 1/60 1/30 1/15 1/8 f/stop:

Straightening Scatterplots

 We can see that f/stop is not linearly related to shutter speed. Can we find a transformation of f/stop that straightens out the line?  What if we look at the square of the f/stop against the shutter speed?

Straightening Scatterplots

 The correlation is now.998 but the increase in correlation is not important. What is important is that the form of the plot is now straight, so the correlation is now an appropriate measure of correlation

Homework Pg 165, # 12, 17, 23, 27, 33