Presentation is loading. Please wait.

Presentation is loading. Please wait.

Correlation. Correlation refers to a relationship that exists between pairs of measures. Knowledge of the strength and direction of the relationship allows.

Similar presentations


Presentation on theme: "Correlation. Correlation refers to a relationship that exists between pairs of measures. Knowledge of the strength and direction of the relationship allows."— Presentation transcript:

1 Correlation

2 Correlation refers to a relationship that exists between pairs of measures. Knowledge of the strength and direction of the relationship allows us to predict one variable from the other with an accuracy greater than chance.

3 Correlation For example, you can guess someone’s weight more accurately if you know how tall they are because height and weight are positively correlated.

4 Correlation For example, you can guess someone’s weight more accurately if you know how tall they are because height and weight are positively correlated. When two variables are positively correlated that means they tend to both move higher or both move lower at the same time.

5 Correlation For example, you can guess someone’s weight more accurately if you know how tall they are because height and weight are positively correlated. When two variables are positively correlated that means they tend to both move higher or both move lower at the same time. Generally, taller people weigh more than shorted people.

6 Correlation When two variable are negatively correlated that means that they change in value inversely. Higher scores on one generally go with lower scores on the other.

7 Correlation When two variable are negatively correlated that means that they change in value inversely. Higher scores on one generally go with lower scores on the other. For example, the outdoor temperature and the weight of one’s clothing are negatively correlated. The higher the temperature is, the less clothing we wear. The lower the outdoor temperature, the more clothing we wear.

8 Correlation When two variable are negatively correlated that means that they change in value inversely. Higher scores on one generally go with lower scores on the other. For example, the outdoor temperature and the weight of one’s clothing are negatively correlated. The higher the temperature is, the less clothing we wear. The lower the outdoor temperature, the more clothing we wear. Another example: How old you are and how much longer you will live are negatively correlated.

9 Correlation When two variable are negatively correlated that means that they change in value inversely. Higher scores on one generally go with lower scores on the other. For example, the outdoor temperature and the weight of one’s clothing are negatively correlated. The higher the temperature is, the less clothing we wear. The lower the outdoor temperature, the more clothing we wear. Another example: How old you are and how much longer you will live are negatively correlated. The existence of a negative correlation does not mean the absence of a relationship. It means that the variables tend to move in opposite directions, not that the variables are unrelated.

10 Correlation A near zero correlation is said to exist when scores on the two variables are unrelated. Higher scores on one variable are just as likely to be accompanied by higher scores as by lower scores on the other variable.

11 Correlation A near zero correlation is said to exist when scores on the two variables are unrelated. Higher scores on one variable are just as likely to be accompanied by higher scores as by lower scores on the other variable. An example: The street number on your house and the odometer reading of your car.

12 Correlation For Each of the Following Examples, State From Your General Knowledge Whether the Correlation Between the Two Variables is Likely to be Positive, Negative, or Near Zero, and Explain Why

13 Average Number of Calories Eaten Per Day and Body Weight

14 Positive – Caloric Intake is One of the Major Determinants of Body Weight

15 Golf Scores and the Number of Years of Golfing Experience

16 Negative – Golfers Improve with Experience and Hence Would Be Expected to Get Better (Lower) Scores

17 Length of Hair and Shoe Size in Adult Males

18 Near Zero – It’s Doubtful That These Two Measures Could Be Influenced By Common Factors

19 Amount of Formal Education One Has Received and the Time Spent Collecting Public Assistance (Welfare)

20 Negative – Educated Individuals Are More Likely to Be Employable and Hence Less Likely to Need Welfare

21 Per Capita Consumption of Alcohol in a Group of Cites and Suicide Rates in Those Cities

22 Positive – A Common Set of Stresses and Other Factors Are Likely to Influence Rates of Both Alcoholism and Suicide in a Given Community

23 Number Correct on a Current Events Test and Time Spent Reading the Newspaper

24 Positive – The Newspapers Are Full of Stories Concerning Current Events Around the World

25 Strength of Traditional Religious Beliefs and Favorableness of Attitude Toward Abortion on Demand

26 Negative – Members of Traditional Religious Groups Are More Likely to Regard Abortion as Immoral Than Others

27 Height and Political Conservatism

28 Near Zero

29 Correlation Correlations can be represented either graphically by the construction of a special type of graph called a scatter-plot diagram or through the computation of an index called a coefficient of correlation.

30 Correlation Correlations can be represented either graphically by the construction of a special type of graph called a scatter-plot diagram or through the computation of an index called a coefficient of correlation. Most of the seminal work that went into the development of these representation was done by the early statistician Karl Pearson, employed by the Guinness Brewery.

31 Correlation Construction of a scatter-plot diagram requires acquiring pairs of scores from each subject. For example, suppose we wanted to look at the relationship between height and self esteem in men. Perhaps we have a hypothesis that how tall you are effects your level of self esteem. So we collect pairs of scores from twenty male individuals. Height, measured in inches, and Self Esteem based on a self- rating scale (where higher scores mean higher self esteem).

32 ManHeight Self Esteem 1684.1 2714.6 3623.8 4754.4 5583.2 6603.1 7673.8 8684.1 9714.3 10693.7 11683.5 12673.2 13633.7 14623.3 15603.4 16634.0 17654.1 18673.8 19633.4 20613.6

33 ManHeight Self Esteem 1684.1 2714.6 3623.8 4754.4 5583.2 6603.1 7673.8 8684.1 9714.3 10693.7 11683.5 12673.2 13633.7 14623.3 15603.4 16634.0 17654.1 18673.8 19633.4 20613.6

34 ManHeight Self Esteem 1684.1 2714.6 3623.8 4754.4 5583.2 6603.1 7673.8 8684.1 9714.3 10693.7 11683.5 12673.2 13633.7 14623.3 15603.4 16634.0 17654.1 18673.8 19633.4 20613.6

35 ManHeight Self Esteem 1684.1 2714.6 3623.8 4754.4 5583.2 6603.1 7673.8 8684.1 9714.3 10693.7 11683.5 12673.2 13633.7 14623.3 15603.4 16634.0 17654.1 18673.8 19633.4 20613.6

36 ManHeight Self Esteem 1684.1 2714.6 3623.8 4754.4 5583.2 6603.1 7673.8 8684.1 9714.3 10693.7 11683.5 12673.2 13633.7 14623.3 15603.4 16634.0 17654.1 18673.8 19633.4 20613.6

37 ManHeight Self Esteem 1684.1 2714.6 3623.8 4754.4 5583.2 6603.1 7673.8 8684.1 9714.3 10693.7 11683.5 12673.2 13633.7 14623.3 15603.4 16634.0 17654.1 18673.8 19633.4 20613.6 Linear Regression

38 ManHeight Self Esteem 1684.1 2714.6 3623.8 4754.4 5583.2 6603.1 7673.8 8684.1 9714.3 10693.7 11683.5 12673.2 13633.7 14623.3 15603.4 16634.0 17654.1 18673.8 19633.4 20613.6 Linear Regression Best Fitting Line Line of Prediction

39 Coefficient of Correlation

40 A statistical computation that indicates the strength and direction of an underlying correlation

41 Coefficient of Correlation A statistical computation that indicates the strength and direction of an underlying correlation

42 Coefficient of Correlation A statistical computation that indicates the strength and direction of an underlying correlation Always results in a signed number in the range from -1.00 to +1.00 If the sign is positive, that indicates the underlying relationship is a positive correlation. If the sign is negative, it indicates an underlying negative correlation. The closer the value is to “1” (either positive or negative) the stronger is the indicated underlying relationship.

43 Coefficient of Correlation A statistical computation that indicates the strength and direction of an underlying correlation Always results in a signed number in the range from -1.00 to +1.00 If the sign is positive, that indicates the underlying relationship is a positive correlation. If the sign is negative, it indicates an underlying negative correlation. The closer the value is to “1” (either positive or negative) the stronger is the indicated underlying relationship.

44 Coefficient of Correlation A statistical computation that indicates the strength and direction of an underlying correlation Always results in a signed number in the range from -1.00 to +1.00 If the sign is positive, that indicates the underlying relationship is a positive correlation. If the sign is negative, it indicates an underlying negative correlation. The closer the value is to “1” (either positive or negative) the stronger is the indicated underlying relationship.

45 Coefficient of Correlation r = +.86

46 Coefficient of Correlation r = +.86 Sign

47 Coefficient of Correlation r = +.86 Sign Magnitude

48 Coefficient of Correlation r = +.86 Sign Magnitude Direction

49 Coefficient of Correlation r = +.86 Sign Magnitude DirectionStrength

50 Coefficient of Correlation r = +.86 r = +.31 r = -.96 r = +.04 r = +1.02

51 Coefficient of Correlation r = +.86 r = +.31 r = -.96 r = +.04 r = +1.02 Strongest Weakest Computational Error

52

53 Direction? Strength? Prediction?

54

55

56

57

58

59 Correlation Does Not Imply Causation

60 The Third Variable Problem The Problem of Directionality

61 Correlation Does Not Imply Causation The Third Variable Problem Refers to the possibility that two variables are correlated with each other, not because one causes the other, but because both are effects of some third unidentified cause.

62 Correlation Does Not Imply Causation The Problem of Directionality Even when two variables are correlated because of a causal relationship between them, from the correlational data alone, we can not tell which is the cause and which is the effect.

63 Why Do Correlation Research?

64 When we are dealing with variables that we have not yet learned to directly control

65 Why Do Correlation Research? When we are dealing with variables that we have not yet learned to directly control When we are dealing with variables that it would not be ethical to directly control

66 Why Do Correlation Research? When we are dealing with variables that we have not yet learned to directly control When we are dealing with variables that it would not be ethical to directly control Reasons of economy (cheaper, faster easier) because we are analyzing data that already exists rather than creating data through our experimentation.

67 Why Do Correlation Research? When we are dealing with variables that we have not yet learned to directly control When we are dealing with variables that it would not be ethical to directly control Reasons of economy (cheaper, faster easier) because we are analyzing data that already exists rather than creating data through our experimentation. Prelude to Experimentation


Download ppt "Correlation. Correlation refers to a relationship that exists between pairs of measures. Knowledge of the strength and direction of the relationship allows."

Similar presentations


Ads by Google