Download presentation
Presentation is loading. Please wait.
Published byElizabeth Hunt Modified over 9 years ago
1
Correlation
2
Descriptive technique: Describes the relationship between two variables Variables are observed or measured but rarely manipulated thus we CANNOT infer causation
3
You have a pair of observations -one for each variable Eg height and weight Anxiety and introversion You measure all people on both variables N = number of pairs
4
Characteristics of the relationship measured 1)Direction is it positive or negative- Indicated by the sign of the correlation 2) The strength of the relationship Indicated by the number of the correlation +/- 1 is a perfect correlation 0 indicates no linear relationship 3) The form – is it linear?
5
Mean heights of a group of children in Kalama, an Egyptian village that is the site of a study of nutrition in developing countries. The data were obtained by measuring the heights of all 161 children in the village each month over several years. Age in months Height in cm
6
Scatterplot of age vs height
8
Correlation does not mean causation
9
Before Jonas Salk found the polio vaccine researchers looked for relationship with polio and anything – any protection Found a correlation between soft-drink sales and polio. The more soft-drinks sold the more cases of polio…… Or factoid... The more churches per square mile the higher the crime.....
10
So why use correlation if we can’t infer causation ??? 1) prediction – if know the relationship then can predict the value of one variable if know the other. 2) validity – if design a new test and you want to see if it tests the same thing as the old one. … 3) reliability – will the test change over time or over different observers 4) test a theory
11
r = amount that X and Y vary together amount that X and Y vary separately r = covariability of X and Y variability of X and Y separately
13
r is unchanged if 1) interchange the two variables call height X and age Y or height Y and age X 2) Add a constant to all the values of one variable 3) Multiply all the values of one variable by a constant. (changes mean and sd but not the relative positions)
14
18.00 76.10 78.10 152.20. 19.00 77.00 79.00 154.00. 20.00 78.10 80.10 156.20. 21.00 78.20 80.20 156.40. 22.00 78.80 80.80 157.60. 23.00 79.70 81.70 159.40. 24.00 79.90 81.90 159.80. 25.00 81.10 83.10 162.20. 26.00 81.20 83.20 162.40. 27.00 81.80 83.80 163.60. 28.00 82.80 84.80 165.60. 29.00 83.50 85.50 167.00. age ht Ht+2Ht X 2
15
+2X2
16
Coefficient of Determination r is correlation r 2 is coefficient of determination Tells you how much of the variability is explained by the correlation
17
Interpretation 1)No causation 2) Value of r is influenced by range 3) A zero correlation tells you that there is no linear relationship. 4) Outliers can have a huge impact web
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.