Correlation: How Strong Is the Linear Relationship?

Slides:



Advertisements
Similar presentations
Simple Linear Regression and Correlation by Asst. Prof. Dr. Min Aung.
Advertisements

AP Statistics Section 3.2 C Coefficient of Determination
A recent newspaper article reported that the number of personal computers being sold is increasing. In addition, the number of athletic shoes being sold.
EMERGENCY MANAGEMENT PLANNING STAFF RICHMOND REGIONAL PLANNING DISTRICT COMMISSION NOVEMBER 2010 Central Virginia Urban Areas Security Initiative.
Bivariate Data – Scatter Plots and Correlation Coefficient…… Section 3.1 and 3.2.
Describing Relationships Using Correlation and Regression
Lecture 17: Correlations – Describing Relationships Between Two Variables 2011, 11, 22.
Calculating and Interpreting the Correlation Coefficient ~adapted from walch education.
Regression and Correlation. Bivariate Analysis Can we say if there is a relationship between the number of hours spent in Facebook and the number of friends.
Correlation: How Strong Is the Linear Relationship? Lecture 50 Sec Mon, May 1, 2006.
Correlation Correlation is used to measure strength of the relationship between two variables.
Section 4.1 Scatter Diagrams and Correlation. Definitions The Response Variable is the variable whose value can be explained by the value of the explanatory.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Correlation and Regression. Section 9.1  Correlation is a relationship between 2 variables.  Data is often represented by ordered pairs (x, y) and.
3.3 Correlation: The Strength of a Linear Trend Estimating the Correlation Measure strength of a linear trend using: r (between -1 to 1) Positive, Negative.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
Correlation The apparent relation between two variables.
9.1B – Computing the Correlation Coefficient by Hand
Lesson Scatter Diagrams and Correlation. Objectives Draw and interpret scatter diagrams Understand the properties of the linear correlation coefficient.
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
AP Statistics HW: p. 165 #42, 44, 45 Obj: to understand the meaning of r 2 and to use residual plots Do Now: On your calculator select: 2 ND ; 0; DIAGNOSTIC.
Scatterplots and Linear Regressions Unit 8. Warm – up!! As you walk in, please pick up your calculator and begin working on your warm – up! 1. Look at.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
Chapter 8 Linear Regression. Fat Versus Protein: An Example 30 items on the Burger King menu:
CORRELATION ANALYSIS.
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Applied Quantitative Analysis and Practices LECTURE#10 By Dr. Osman Sadiq Paracha.
Correlation: How Strong Is the Linear Relationship? Lecture 50 Sec Fri, Dec 9, 2005.
Chapter 15 Association Between Variables Measured at the Interval-Ratio Level.
Solutions to Healey 2/3e #13.1 (1e #15.1) Turnout by Unemployment and by Negative Campaigning.
Correlation: How Strong Is the Linear Relationship? Lecture 46 Sec Mon, Apr 30, 2007.
Simple Linear Correlation
Pearson’s Correlation Coefficient
Warm Up Scatter Plot Activity.
Examining Relationships
Regression and Correlation
AND.
Linear Regression and Correlation Analysis
The Research Experience for Teachers Program
Scatterplots A way of displaying numeric data
CHAPTER 10 Correlation and Regression (Objectives)
Simple Linear Regression
Correlation and Regression
2. Find the equation of line of regression
Correlation and Regression
A P STATISTICS LESSON 3 – 2 CORRELATION.
Regression.
The Weather Turbulence
CORRELATION ANALYSIS.
Calculating the Least Squares Regression Line
STEM Fair Graphs.
Correlation Coefficient Using Technology
Section 1.4 Curve Fitting with Linear Models
Calculating the Least Squares Regression Line
AP Statistics Section 3.2 Part 1 AP Statistics
Correlation and Regression
Chapter 4 - Scatterplots and Correlation
A P STATISTICS LESSON 3 – 2 CORRELATION.
Calculating the Least Squares Regression Line
Linear Correlation and Regression
y = mx + b Linear Regression line of best fit REMEMBER:
AP Statistics October 1st, 2010 Mr. Calise
Objective: Interpret Scatterplots & Use them to Make Predictions.
The Five-Number Summary
Correlation & Trend Lines
Applying linear and median regression
Section 11.1 Correlation.
Scatter Plots That was easy Year # of Applications
The Squared Correlation r2 – What Does It Tell Us?
Correlation and Prediction
Presentation transcript:

Correlation: How Strong Is the Linear Relationship? Lecture 46 Sec. 13.7 Mon, Dec 3, 2007

The Correlation Coefficient The correlation coefficient r is a number between –1 and +1. It measures the direction and strength of the linear relationship. If r > 0, then the relationship is positive. If r < 0, then the relationship is negative. The closer r is to +1 or –1, the stronger the relationship. The closer r is to 0, the weaker the relationship.

Strong Positive Linear Association In this display, r is close to +1. x y

Strong Positive Linear Association In this display, r is close to +1. x y

Strong Negative Linear Association In this display, r is close to –1. x y

Strong Negative Linear Association In this display, r is close to –1. x y

Almost No Linear Association In this display, r is close to 0. x y

Almost No Linear Association In this display, r is close to 0. x y

Interpretation of r -1 -0.8 -0.2 0.2 0.8 1

Interpretation of r Strong Negative Strong Positive -1 -0.8 -0.2 0.2 0.2 0.8 1

Interpretation of r Weak Negative Weak Positive -1 -0.8 -0.2 0.2 0.8 1

Interpretation of r No Significant Correlation -1 -0.8 -0.2 0.2 0.8 1

Correlation vs. Cause and Effect If the value of r is close to +1 or -1, that indicates that x is a good predictor of y. It does not indicate that x causes y (or that y causes x). The correlation coefficient alone cannot be used to determine cause and effect.

Calculating the Correlation Coefficient There are many formulas for r. The most basic formula is Another formula is

Example Consider again the data x y 1 8 3 12 4 9 5 14 16 20 11 17 15 24

Example We found earlier that SSX = 150 SSY = 206 SSXY = 165

Example Then compute r.

TI-83 – Calculating r To calculate r on the TI-83, First, be sure that Diagnostic is turned on. Press CATALOG and select DiagnosticsOn. Then, follow the procedure that produces the regression line. In the same window, the TI-83 reports r2 and r.

TI-83 – Calculating r Use the TI-83 to calculate r in the preceding example. Find r for the S/T Ratio vs. Graduation Rate. Find r for SOL-Eng Passing Rate vs. Graduation Rate.

Another Formula for r It turns out that where

Another Formula for r Free-lunch participation vs. graduation rate data, SSR = 1896.7, SST = 2598.2. So we get

The Coefficient of Determination r2 is called the coefficient of determination. It is interpreted as telling us how much of the variation in y is determined by the variation in x. So, 73% of the variation is graduation rates is determined by the variation in participation in the free-lunch program.

The Coefficient of Determination What percentage of the variation in graduation rate is determined by the variation in S/T ratio? What percentage of variation in graduation rate is determined by the variation in teachers’ average salary?

How Does r Work? How does r indicate the direction of the relationship? Consider the numerator of the formula.

How Does r Work? Consider the lunch vs. graduation data: District Free Lunch Grad. Rate Amelia 41.2 68.9 King and Queen 59.9 64.1 Caroline 40.2 62.9 King William 27.9 67.0 Charles City 45.8 67.7 Louisa 44.9 80.1 Chesterfield 22.5 80.5 New Kent 13.9 77.0 Colonial Hgts 25.7 73.0 Petersburg 61.6 54.6 Cumberland 55.3 63.9 Powhatan 12.2 89.3 Dinwiddie 45.2 71.4 Prince George 30.9 85.0 Goochland 23.3 76.3 Richmond 74.0 46.9 Hanover 13.7 90.1 Sussex 74.8 59.0 Henrico 30.2 81.1 West Point 19.1 82.0 Hopewell 63.1 63.4

How Does r Work? Consider the lunch vs. graduation data: (first half) x y x –x y –y (x –x)(y –y) 41.2 68.9 40.2 62.9 45.8 67.7 22.5 80.5 25.7 73.0 55.3 63.9 45.2 71.4 23.3 76.3 13.7 90.1 30.2 81.1 63.1 63.4 (first half)

How Does r Work? Consider the lunch vs. graduation data: (first half) x y x –x y –y (x –x)(y –y) 41.2 68.9 1.9 -2.7 40.2 62.9 0.9 -8.7 45.8 67.7 6.5 -3.9 22.5 80.5 -16.8 8.9 25.7 73.0 -13.6 1.4 55.3 63.9 16.0 -7.7 45.2 71.4 5.9 -0.2 23.3 76.3 -16.0 4.7 13.7 90.1 -25.6 18.5 30.2 81.1 -9.1 9.5 63.1 63.4 23.8 -8.2 (first half)

How Does r Work? Consider the lunch vs. graduation data: (first half) x y x –x y –y (x –x)(y –y) 41.2 68.9 1.9 -2.7 -5.13 40.2 62.9 0.9 -8.7 -7.83 45.8 67.7 6.5 -3.9 -25.35 22.5 80.5 -16.8 8.9 -149.52 25.7 73.0 -13.6 1.4 -19.04 55.3 63.9 16.0 -7.7 -123.20 45.2 71.4 5.9 -0.2 -1.18 23.3 76.3 -16.0 4.7 -75.2 13.7 90.1 -25.6 18.5 -473.6 30.2 81.1 -9.1 9.5 -86.45 63.1 63.4 23.8 -8.2 -195.16 (first half)

How Does r Work? Consider the lunch vs. graduation data: (second half) x y x –x y –y (x –x)(y –y) 59.9 64.1 27.9 67.0 44.9 80.1 13.9 77.0 61.6 54.6 12.2 89.3 30.9 85.0 74.0 46.9 74.8 59.0 19.1 82.0 (second half)

How Does r Work? Consider the lunch vs. graduation data: (second half) x y x –x y –y (x –x)(y –y) 59.9 64.1 20.6 -7.5 27.9 67.0 -11.4 -4.6 44.9 80.1 5.6 8.5 13.9 77.0 -25.4 5.4 61.6 54.6 22.3 -17.0 12.2 89.3 -27.1 17.7 30.9 85.0 -8.4 13.4 74.0 46.9 34.7 -24.7 74.8 59.0 35.5 -12.6 19.1 82.0 -20.2 10.4 (second half)

How Does r Work? Consider the lunch vs. graduation data: (second half) x y x –x y –y (x –x)(y –y) 59.9 64.1 20.6 -7.5 -154.50 27.9 67.0 -11.4 -4.6 52.44 44.9 80.1 5.6 8.5 47.60 13.9 77.0 -25.4 5.4 -137.16 61.6 54.6 22.3 -17.0 -379.10 12.2 89.3 -27.1 17.7 -479.67 30.9 85.0 -8.4 13.4 -112.56 74.0 46.9 34.7 -24.7 -857.09 74.8 59.0 35.5 -12.6 -447.30 19.1 82.0 -20.2 10.4 -210.08 (second half)

Scatter Plot 90 80 Graduation Rate 70 60 50 Free Lunch Rate 20 30 40

Scatter Plot 90 80 Graduation Rate 70 60 50 Free Lunch Rate 20 30 40

Scatter Plot 90 80 Graduation Rate 70 60 50 Free Lunch Rate 20 30 40

Scatter Plot 90 The two oddballs 80 Graduation Rate 70 60 50 Free Lunch Rate 20 30 40 50 60 70 80