Presentation is loading. Please wait.

Presentation is loading. Please wait.

R xy. When two variables are correlated, we can predict a score on one variable from a score on the other The stronger the correlation, the more accurate.

Similar presentations


Presentation on theme: "R xy. When two variables are correlated, we can predict a score on one variable from a score on the other The stronger the correlation, the more accurate."— Presentation transcript:

1 r xy

2 When two variables are correlated, we can predict a score on one variable from a score on the other The stronger the correlation, the more accurate our prediction will be

3 r xy We need a measure of the “strength” of a correlation

4 r xy We need a number that gets bigger when big numbers are paired with big numbers and small numbers are paired with small numbers We need a number that gets smaller when big numbers are paired with small numbers and small numbers are paired with big numbers

5 r xy Remember the height/weight example: Big number indicates this (strong positive correlation) 5’5’25’45’65’85’10 100110120130140150 a a b b, e c c d d ef f

6 r xy Remember the height/weight example: Small number indicates this (strong negative correlation) 5’5’25’45’65’85’10 100110120130140150 a a b b, e c c d d ef f

7 r xy Two sets of scores, x i and y i What could we do?

8 r xy What could we do?

9 r xy What could we do? When pairs are multiplied and the products are summed up: – Greatest when big numbers paired with big numbers and small numbers with small numbers –Least when small numbers are paired with big numbers and big numbers are paired with small numbers

10 r xy analogy: This gets you most money Pennies Quarters Loonies

11 r xy analogy:this gets you the least… Pennies Quarters Loonies

12 r xy analogy: Because: 3 x $1 plus 2 x $0.25 plus 1 x $0.01 is more than 1 x $1 plus 2 x $0.25 plus 3 x $0.01

13 r xy But there’s a problem Not a good measure because the value ultimately depends on n AND the size of the numbers

14 r xy Try this

15 r xy Try this Still not so good - doesn’t depend on n anymore, but does depend on size of x’s and y’s

16 r xy How about multiply deviation scores –comparing each variable relative to its respective mean

17 r xy Multiply deviation scores Now value depends on the spread of the data

18 r xy So standardize the scores

19 r xy This measures strength of correlation: = = r xy

20 r xy r xy ranges from -1.0 indicating a perfect negative correlation to +1.0 indicating a perfect positive correlation an r xy of zero indicates no correlation whatsoever. Scores are random with respect to each other.

21 r xy r xy also has a geometric meaning

22 r xy r xy also has a geometric meaning Recall that the mean of the z x and z y distributions is zero and each z-score is a deviation from the mean

23 r xy Each point lands in one of four quadrants point z x, z y zxzx zyzy

24 r xy notice that: both z x and z y are positive r xy =

25 r xy notice that: z x is negative and z y is positive r xy =

26 r xy notice that: z x is negative and z y is negative r xy =

27 r xy notice that: z x is positive and z y is negative r xy =

28 r xy So Thus if most points tend to fall around a line with a positive (45 degree) slope (I and III), the cross-products will tend to be positive I II III IV

29 r xy So If most points tend to fall around a line with a negative slope (II and IV), the cross products will tend to be negative Thus if most points tend to fall around a line with a positive (45 degree) slope (I and III), the cross-products will tend to be positive I II III IV

30 r xy So If the points were randomly scattered about, the negative and positive cross-products cancel

31 Covariance a related measure of the relationship between scores on two different variables is the covariance

32 Covariance notice that the variance (S 2 x ) is the covariance between a variable and itself !

33 Regression If two variables are perfectly correlated (r = + or - 1.0) then one can exactly predict a score on one variable given a score on another

34 Regression For example: a university charges $250 registration fee plus $100 / credit

35 Regression tuition = $100(X) + $250 –where X is the number of credits Notice this is a linear relationship (an equation of the form y = ax + b –a = $100/credit –b = $250 –x = number of credits

36 Regression Tuition as a function of credit hours is a straight line There is a perfect correlation between credit hours and tuition You could predict perfectly the tuition required given the number of credit hours

37 Next Time Regression - read chapter 8


Download ppt "R xy. When two variables are correlated, we can predict a score on one variable from a score on the other The stronger the correlation, the more accurate."

Similar presentations


Ads by Google