Presentation is loading. Please wait.

Presentation is loading. Please wait.

 What is correlation?  How to compute?  How to interpret? 2.

Similar presentations


Presentation on theme: " What is correlation?  How to compute?  How to interpret? 2."— Presentation transcript:

1

2  What is correlation?  How to compute?  How to interpret? 2

3  The relations between two variables  How the value of one variable changes when the value of another variable changes  A correlation coefficient is a numerical index to reflect the relationship between two variables.  Range: -1 ~ +1  Bivariate correlation (for two variables) 3

4  Parametric  Pearson product-moment correlation (named for inventor Karl Pearson)  Non-parametric  Spearman’s rank correlation  Kendall tau rank correlation coefficient 4

5  For two variables which are continuous in nature  Height, age, test score, income  But not for discrete or categorical variables  Race, political affiliation, social class, rank R xy is the correlation between variable X and variable Y 5

6  Direct correlation (positive correlation):  If both variables change in the same direction  Indirect correlation (negative correlation):  If both variables change in opposite directions 6

7  Below is Correlation Report of different Currency Exchange Rate on November 13 – 2014 (source: Bloomberg Terminal)  -0.8 and 0.5, which is stronger? 7

8 the correlation coefficient between X and Y n the size of the sample X the individual’s score on the X variable Ythe individual’s score on the Y variable XYthe product of each X score times its corresponding Y score X 2 the individual X score, squared Y 2 the individual Y score, squared 8

9  Calculate Pearson correlation coefficient for US school enrollment (unit: k) in some time points of previous 50 years. (Source: United States Census Bureau) 1. Select two columns of data – are they correlated? 2. What does this correlated mean? 9 Year G9-12 Public G9-12 Private College- Public College- Private 196511610140039701951 197013336131164282153 197514304130088362350 198013231133994572640 198512388136294792768 1990113411136108452974 1995125021163110923169 2000135171264117533560 2005149091349130224466

10  CORREL function  Or PEARSON function 10

11  Scatterplot or scattergram X Y 11 XY 23 42 56 65 43 76 85 54 64 75

12 12

13  r =1, a perfect direct (or positive) correlation  In real life case, 0.7 and 0.8 could be the highest you will see 13

14  Strength and direction are important 14

15 Four sets of data with the same correlation of 0.816 15

16  Linear correlation means that X and Y are in one straight line  Curvlilinear correlation  Age and memory 16

17 incomeeducationattitudevote 741901311 809311232 813141142 730891152 620231132 612171042 845261151 872511141 626591252 764501062 705121272 78858961 786281371 862121482 74962982 588281194 614711085 786211275 60071984 How to calculate the correlation coefficient? 1.CORREL() 2.Correlation in data analysis toolset 17

18  Correlation matrix IncomeEducationAttitudeVote Income1.000.35-0.19-0.51 Education1.00-0.21-0.20 Attitude1.000.55 Vote1.00 18

19  Data Analysis tool - correlation 19

20  Correlation value:  - finite number ~ + finite number  Correlation coefficient value:  -1.00 ~ +1.00 r xy valueInterpretation 0.8 ~ 1.0Very strong relationship (share most of the things in common) 0.6 ~0.8Strong relationship (share many things in common) 0.4 ~ 0.6Moderate relationship (share something in common) 0.2 ~ 0.4Weak relationship (share a little in common) 0.0 ~ 0.2Weak or no relationship (share very little or nothing in common) 20

21  Coefficient of determination:  The percentage of variance in one variable that is accounted for by the variance in the other variable.  = square of coefficient 49% of the variance in GPA can be explained by the variance in studying time 21

22  The amount of unexplained variance is called the coefficient of undetermination (coefficient of alienation) correlationdeterminationinterpretation 00 0.50.25 0.90.81 22

23  In a small town in Greece,  The local police found the direct correlation between ice cream and crime 23

24  The correlation represents the association between two or more variables  It has nothing to do with causality (there is no cause relation between two correlated variables)  Ices cream and crime are correlated, but  Ices cream does not cause crime 24

25 Summer Summer is when people get together. More specifically, casual drinkers and drug users are more likely to go to bars or parties on weekends and evenings, as opposed to a Tuesday morning. These people in the social mix, flooding the city’s streets and neighborhood bars, feed the peak times for murder, experts say. 25


Download ppt " What is correlation?  How to compute?  How to interpret? 2."

Similar presentations


Ads by Google