Correlation.

Slides:



Advertisements
Similar presentations
Correlation and regression Dr. Ghada Abo-Zaid
Advertisements

Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Correlation and Regression
1 Objective Investigate how two variables (x and y) are related (i.e. correlated). That is, how much they depend on each other. Section 10.2 Correlation.
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
SIMPLE LINEAR REGRESSION
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
10-2 Correlation A correlation exists between two variables when the values of one are somehow associated with the values of the other in some way. A.
SIMPLE LINEAR REGRESSION
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
STATISTICS ELEMENTARY C.M. Pascual
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Chapter 10 Correlation and Regression
SIMPLE LINEAR REGRESSION
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
Relationship of two variables
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Sections 9-1 and 9-2 Overview Correlation. PAIRED DATA Is there a relationship? If so, what is the equation? Use that equation for prediction. In this.
Is there a relationship between the lengths of body parts ?
Chapter 15 Correlation and Regression
1 Chapter 9. Section 9-1 and 9-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Probabilistic and Statistical Techniques 1 Lecture 24 Eng. Ismail Zakaria El Daour 2010.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Hypothesis of Association: Correlation
Chapter 10 Correlation and Regression
Examining Relationships in Quantitative Research
Statistics Class 7 2/11/2013. It’s all relative. Create a box and whisker diagram for the following data(hint: you need to find the 5 number summary):
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 10 Correlation and Regression Lecture 1 Sections: 10.1 – 10.2.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
Principles of Biostatistics Chapter 17 Correlation 宇传华 网上免费统计资源(八)
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-1 Overview Overview 10-2 Correlation 10-3 Regression-3 Regression.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Regression and Correlation
Review and Preview and Correlation
Spearman’s Rho Correlation
Is there a relationship between the lengths of body parts?
CHS 221 Biostatistics Dr. wajed Hatamleh
Elementary Statistics
Lecture Slides Elementary Statistics Twelfth Edition
Correlation and Simple Linear Regression
Correlation and Regression
Elementary Statistics
Chapter 10 Correlation and Regression
Lecture Slides Elementary Statistics Tenth Edition
SIMPLE LINEAR REGRESSION
Simple Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Correlation and Regression Lecture 1 Sections: 10.1 – 10.2
Lecture Slides Elementary Statistics Eleventh Edition
Presentation transcript:

Correlation

Definition The linear correlation coefficient r measures the strength of the linear relationship between paired x- and y- quantitative values in a sample. We can often see a relationship between two variables by constructing a scatterplot. page 518 of Elementary Statistics, 10th Edition

Scatterplots of Paired Data Page 519 of Elementary Statistics, 10th Edition

Scatterplots of Paired Data Page 519 of Elementary Statistics, 10th Edition

Requirements 1. The sample of paired (x, y) data is a random sample. 2. Visual examination of the scatter plot must confirm that the points approximate a certain pattern. 3. The outliers must be removed if they are known to be errors. page 520 of Elementary Statistics, 10th Edition Explain to students the difference between the ‘paired’ data of this chapter and the investigation of two groups of data in Chapter 9.

Notation for the Linear Correlation Coefficient n represents the number of pairs of data present.  denotes the addition of the items indicated. x denotes the sum of all x-values. x2 indicates that each x-value should be squared and then those squares added. (x)2 indicates that the x-values should be added and the total then squared. xy indicates that each x-value should be first multiplied by its corresponding y-value. After obtaining all such products, find their sum. r represents linear correlation coefficient for a sample.  represents linear correlation coefficient for a population.

Formula r = nxy – (x)(y) n(x2) – (x)2 n(y2) – (y)2 The linear correlation coefficient r measures the strength of a linear relationship between the paired values in a sample. nxy – (x)(y) n(x2) – (x)2 n(y2) – (y)2 r =

Example: Calculating r Using the simple random sample of data below, find the value of r. 3 5 1 8 6 4 Data x y page 521 of Elementary Statistics, 10th Edition.

Example: Calculating r - cont page 522 of Elementary Statistics, 10th Edition.

Example: Calculating r - cont 3 5 1 8 6 4 Data x y nxy – (x)(y) n(x2) – (x)2 n(y2) – (y)2 r = 4(61) – (12)(23) 4(44) – (12)2 4(141) – (23)2 -32 33.466 = -0.956 Page 522 of Elementary Statistics, 10th Edition

Properties of the Linear Correlation Coefficient r 2. The value of r does not change if all values of either variable are converted to a different scale. 3. The value of r is not affected by the choice of x and y. Interchange all x- and y-values and the value of r will not change. 4. r measures strength of a linear relationship. page 524 of Elementary Statistics, 10th Edition If using a graphics calculator for demonstration, it will be an easy exercise to switch the x and y values to show that the value of r will not change.

Interpreting r : Explained Variation The value of r2 is the proportion of the variation in y that is explained by the linear relationship between x and y. For Example if r = 0.926, we get r2 = 0.857. We conclude that 0.857 (or about 86%) of the variation in Y can be explained by the linear relationship between X and Y. This implies that 14% of the variation in Y cannot be explained by X page 524 of Elementary Statistics, 10th Edition If using a graphics calculator for demonstration, it will be an easy exercise to switch the x and y values to show that the value of r will not change.

Formal Hypothesis Test We wish to determine whether there is a significant linear correlation between two variables. H0: = (no significant linear correlation) H1:  (significant linear correlation) page 527 of Elementary Statistics, 10th Edition

Test Statistic is t t = Use Tables with degrees of freedom = n – 2 Critical values: Use Tables with degrees of freedom = n – 2 This is the first example in the text where the degrees of freedom for Table A-3 is different from n - 1. Special note should be made of this.

P-value: Conclusion: Use Tables with degrees of freedom = n – 2 If the absolute value of t is > critical value reject H0 and conclude that there is a linear correlation. If the absolute value of t ≤ critical value, fail to reject H0; there is not sufficient evidence to conclude that there is a linear correlation.

(follows format of earlier chapters) Test Statistic is t (follows format of earlier chapters) This is the drawing used to verify the position of the sample data t value in regard to the critical t values for the example which begins on page 527 of Elementary Statistics, 10th Edition. Drawing is at the top of page 528.

Covariance Measure of linear relationship between variables If the relationship between the random variables is nonlinear, the covariance might not be sensitive to the relationship

Pearson’s Correlation Coeff. Pearson's correlation coefficient between two variables is defined as the covariance of the two variables divided by the product of their standard deviations: The above formula defines the population correlation coefficient, commonly represented by the Greek letter ρ (rho). Substituting estimates of the covariances and variances based on a sample gives the sample correlation coefficient, commonly denoted r :                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          

Pearson correlation coefficient The Spearman correlation coefficient is often thought of as being the Pearson correlation coefficient between the ranked variables. In practice, however, a simpler procedure is normally used to calculate ρ. The n raw scores Xi, Yi are converted to ranks xi, yi, and the differences di = xi − yi between the ranks of each observation on the two variables are calculated Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

A Spearman correlation of 1 results when the two variables being compared are monotonically related, even if their relationship is not linear. In contrast, this does not give a perfect Pearson correlation Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

When the data are roughly elliptically distributed and there are no prominent outliers, the Spearman correlation and Pearson correlation give similar values Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.

The Spearman correlation is less sensitive than the Pearson correlation to strong outliers that are in the tails of both samples Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley.