Presentation is loading. Please wait.

Presentation is loading. Please wait.

Correlation This Chapter is on Correlation We will look at patterns in data on a scatter graph We will be looking at how to calculate the variance and.

Similar presentations


Presentation on theme: "Correlation This Chapter is on Correlation We will look at patterns in data on a scatter graph We will be looking at how to calculate the variance and."— Presentation transcript:

1

2 Correlation This Chapter is on Correlation We will look at patterns in data on a scatter graph We will be looking at how to calculate the variance and co-variance of variables We will see how to numerically measure the strength of correlation between two variables

3

4 Correlation Scatter Graphs Scatter Graphs are a way of representing 2 sets of data. It is then possible to see whether they are related. Positive Correlation  As one variable increases, so does the other Negative Correlation  As one variable increases, the other decreases No Correlation  There seems to be no pattern linking the two variables Positive Negative None 6A

5 Correlation Scatter Graphs In the study of a city, the population density, in people/hectare, and the distance from the city centre, in km, was investigated by choosing sample areas. The results are as follows: Plot a scatter graph and describe the correlation. Interpret what the correlation means. AreaABCDE Distance0.63.82.43.02.0 Pop. Density 5022142033 AreaFGHIJ Distance1.51.83.44.00.9 Pop. Density 472581638 01423 10 20 30 50 40 0 Distance from centre (km) Pop. Density (people/hectare) The correlation is negative, which means that as we get further from the city centre, the population density decreases.

6

7 Correlation Variability of Bivariate Data We learnt in chapter 3 that: In Correlation: Similarly for y: And you can also calculate the Co-variance of both variables 6B/C (Although remember that this formula changed to make it easier to use) ‘How x varies’ ‘How y varies’ ‘How x and y vary together’

8 Correlation Variability of Bivariate Data Like in chapter 3, we can use a formula which will make calculations easier BUT: 6B/C

9 Correlation Variability of Bivariate Data Multiply both sides by ‘n’ The easier formula for variance from chapter 3 For the second fraction, square the top and bottom separately Variability of Bivariate Data Multiplying both fractions by ‘n’ will cancel a ‘divide by n’ from each of them 6B/C

10 Correlation Variability of Bivariate Data These are the formulae for S xx, S yy and S xy. You are given these in the formula booklet. You do not need to know how to derive them (like we just did!) 6B/C

11 Correlation Variability of Bivariate Data Calculate Sxx, Syy and Sxy, based on the following information. 6B/C

12 Correlation Variability of Bivariate Data The following table shows babies heads’ circumferences (cm) and the gestation period (weeks) for 6 new born babies. Calculate S xx, S yy and S xy. We need 120014001178114012211116 xy 1600 1444 13691296 y2y2 90012259619001089961 x2x2 40 38 3736 Gestation period (y) 303531303331 Head size (x) FEDCBABaby 6B/C

13 Correlation Variability of Bivariate Data The following table shows babies heads’ circumferences (cm) and the gestation period (weeks) for 6 new born babies. Calculate S xx, S yy and S xy. We need 6B/C

14 Correlation Product Moment Correlation Coefficient We can test the correlation of data by calculating the Product Moment Correlation Coefficient. This uses S xx, S yy and S xy. The value of this number tells you what the correlation is and how strong it is. The closer to 1, the stronger the positive correlation. The same applies for -1 and negative correlation. A value close to 0 implies no linear correlation. Positive Correlation Negative Correlation 10 No Linear Correlation 6B/C

15 Correlation Product Moment Correlation Coefficient Given the following data, calculate the Product Moment Correlation Coefficient. There is positive correlation, as x increases, y does as well. 6B/C

16 Correlation Limitations of the Product Moment Correlation Coefficient Sometimes it may indicate Correlation between unrelated variables  Cars on a particular street have increased, as have the sales of DVDs in town  The PMCC would indicate positive correlation where the two are most likely not linked  The speed of computers has increased, as has life expectancy amongst people  These are not directly linked, but are both due to scientific developments 6B/C

17

18 Correlation Using Coding with the PMCC Calculating the PMCC from this table. 6D 391403744036565351903450532640xy 144400129600126025119025112225102400y2y2 106091081610609104041060910404x2x2 380360355345335320y 103104103102103102x

19 Correlation Using Coding with the PMCC Calculating the PMCC from this table. 391403744036565351903450532640xy 144400129600126025119025112225102400y2y2 106091081610609104041060910404x2x2 380360355345335320y 103104103102103102x 6D

20 Correlation Using Coding with the PMCC Calculating the PMCC from this table, using coding. 6D 48 3318218pq 256144121814916q2q2 9 9494p2p2 1211974q 343232p 380360355345335320y 103104103102103102x

21 Correlation Using Coding with the PMCC Calculating the PMCC from this table. So coding will not affect the PMCC! 48 3318218pq 256144121814916q2q2 9 9494p2p2 1211974q 343232p 380360355345335320y 103104103102103102x 6D

22 Summary We have looked at plotting scatter graphs We have looked at calculating measures of variance, S xx, S yy and S xy We have also seen types of correlation and how to recognise them on a graph We have calculated the Product Moment Correlation Coefficient, and interpreted it. It is a numerical measure of correlation.


Download ppt "Correlation This Chapter is on Correlation We will look at patterns in data on a scatter graph We will be looking at how to calculate the variance and."

Similar presentations


Ads by Google