Presentation is loading. Please wait.

Presentation is loading. Please wait.

CORRELATION. Bivariate Distribution Observations are taken on two variables Two characteristics are measured on n individuals e.g : The height (x) and.

Similar presentations


Presentation on theme: "CORRELATION. Bivariate Distribution Observations are taken on two variables Two characteristics are measured on n individuals e.g : The height (x) and."— Presentation transcript:

1 CORRELATION

2 Bivariate Distribution Observations are taken on two variables Two characteristics are measured on n individuals e.g : The height (x) and weight (y) of 10 students A single characteristic is measured on two groups of individuals e.g : The height of 10 males (x) and 10 females (y)

3 HeightSelf-esteem 684.1 714.6 623.8 754.4 583.2 603.1 673.8 684.1 714.3 693.7 683.5 673.2 633.7 623.3 603.4 634 654.1 673.8 633.4 613.6

4 Definition Correlation is used to measure and describe a relationship/association between two variables A single number which describes the relationship between X and Y is the correlation coefficient. Denoted by ‘r’ or ‘ρ ’.

5 Scatter Diagram

6 What is the relationship between level of education and lifetime earnings?

7 Direction of Relationship A scatter plot shows at a glance the direction of the relationship. A positive correlation indicates a directly proportional relationship.

8 Direction of Relationship A negative correlation indicates an inversely proportional relationship

9 No Correlation In cases where there is no correlation between two variables, the dots are scattered about the plot in an irregular pattern.

10 Correlation Coefficient The correlation coefficient measures three characteristics of the relationship between X and Y: The direction of the relationship. The form of the relationship. The degree of the relationship

11 Karl Pearson Correlation

12 Calculation Calculate the KP Correlation for data in slide 3. Ans: 0.73 Interpretation: The data exhibits a strong positive correlation indicating that self-esteem increases with height.

13

14 The data shows a high positive correlation between income and education.

15 Drawbacks Presence of outliers Nonlinear scatter plot of x and y values. In the next slide scatter plots are shown for 7 different datasets that have the same correlation r=0.70. Is the use of r justified in each case?

16

17 Rank Correlation Age (mths) Stopping distance Age rankStopping rank dd2d2 928.41100 1529.32200 2437.637416 3036.244.50.50.25 3836.55611 4635.363-39 5336.274.5-2.56.25 6044.18800 6444.89900 7647.210 00 32.5

18 Scatter Plot

19 Calculations Number in sample (n) = 10 r = 1 - (195 / 10 x 99) r = 1 - 0.197 r = 0.803

20 Probable Error If r>6P.E, then correlation is highly significant in the population, otherwise it is insignificant.

21 Caution Correlation does not imply causation. Example : Average temperature (x) in a month and number of ice cream vendors (y). r=0.92 (Highly positive)


Download ppt "CORRELATION. Bivariate Distribution Observations are taken on two variables Two characteristics are measured on n individuals e.g : The height (x) and."

Similar presentations


Ads by Google