Correlation and Covariance

Slides:



Advertisements
Similar presentations
Correlation and Covariance R. F. Riesenfeld (Based on web slides by James H. Steiger)
Advertisements

Covariance and Correlation
Chapter 16: Correlation.
Lab 3: Correlation and Retest Reliability. Today’s Activities Review the correlation coefficient Descriptive statistics for Big Five at Time 1 and Time.
The Laws of Linear Combination James H. Steiger. Goals for this Module In this module, we cover What is a linear combination? Basic definitions and terminology.
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
The Laws of Linear Combination James H. Steiger. Goals for this Module In this module, we cover What is a linear combination? Basic definitions and terminology.
Basic Statistical Concepts
Correlation-Regression The correlation coefficient measures how well one can predict X from Y or Y from X.
Statistics Psych 231: Research Methods in Psychology.
Statistics for the Social Sciences Psychology 340 Fall 2006 Relationships between variables.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Conditional Distributions and the Bivariate Normal Distribution James H. Steiger.
Correlation and Covariance R. F. Riesenfeld (Based on web slides by James H. Steiger)
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
Correlation and regression 1: Correlation Coefficient
CORRELATION & REGRESSION
Covariance and correlation
Correlation.
Correlation1.  The variance of a variable X provides information on the variability of X.  The covariance of two variables X and Y provides information.
Correlation. Definition: the degree of relationship between two or more variables. For example, smoking and lung cancer are correlated: – if we look at.
1 G Lect 8b G Lecture 8b Correlation: quantifying linear association between random variables Example: Okazaki’s inferences from a survey.
Figure 15-3 (p. 512) Examples of positive and negative relationships. (a) Beer sales are positively related to temperature. (b) Coffee sales are negatively.
Educ 200C Wed. Oct 3, Variation What is it? What does it look like in a data set?
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
Psychology 820 Correlation Regression & Prediction.
Today: Standard Deviations & Z-Scores Any questions from last time?
Chapter 15: Correlation. Correlations: Measuring and Describing Relationships A correlation is a statistical method used to measure and describe the relationship.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
Correlation and Covariance
CHAPTER 3 Describing Relationships
Covariance and Correlation
Chapter 3: Describing Relationships
Statistics for the Social Sciences
Chapter 3: Describing Relationships
Introduction to bivariate data
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
The Pearson Correlation
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Correlation and Covariance
Chapter 3 Scatterplots and Correlation.
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Ch 4.1 & 4.2 Two dimensions concept
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Correlation and Covariance
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

Correlation and Covariance James H. Steiger

Goals for Today Introduce the statistical concepts of Covariance Correlation Investigate invariance properties Develop computational formulas

Covariance So far, we have been analyzing summary statistics that describe aspects of a single list of numbers Frequently, however, we are interested in how variables behave together

Smoking and Lung Capacity Suppose, for example, we wanted to investigate the relationship between cigarette smoking and lung capacity We might ask a group of people about their smoking habits, and measure their lung capacities

Smoking and Lung Capacity Cigarettes (X) Lung Capacity (Y) 45 5 42 10 33 15 31 20 29

Smoking and Lung Capacity With SPSS, we can easily enter these data and produce a scatterplot.

Smoking and Lung Capacity We can see easily from the graph that as smoking goes up, lung capacity tends to go down. The two variables covary in opposite directions. We now examine two statistics, covariance and correlation, for quantifying how variables covary.

Covariance When two variables covary in opposite directions, as smoking and lung capacity do, values tend to be on opposite sides of the group mean. That is, when smoking is above its group mean, lung capacity tends to be below its group mean. Consequently, by averaging the product of deviation scores, we can obtain a measure of how the variables vary together.

The Sample Covariance Instead of averaging by dividing by N, we divide by . The resulting formula is

Calculating Covariance Cigarettes (X) dX dXdY dY Lung Capacity (Y) -10 -90 +9 45 5 -5 -30 +6 42 10 -3 33 15 +5 -25 31 20 +10 -70 -7 29

Calculating Covariance So we obtain

Invariance Properties of Covariance The covariance is invariant under listwise addition, but not under listwise multiplication. Hence, it is vulnerable to changes in standard deviation of the variables, and is not scale-invariant.

Invariance Properties of Covariance

Invariance Properties of Covariance Multiplicative constants come straight through in the covariance, so covariance is difficult to interpret – it incorporates information about the scale of the variables.

The (Pearson) Correlation Coefficient Like covariance, but uses Z-scores instead of deviations scores. Hence, it is invariant under linear transformation of the raw scores.

Alternative Formula for the Correlation Coefficient

Computational Formulas -- Covariance There is a computational formula for covariance similar to the one for variance. Indeed, the latter is a special case of the former, since variance of a variable is “its covariance with itself.”

Computational Formula for Correlation By substituting and rearranging, you obtain a substantial (and not very transparent) formula for

Computing a correlation Cigarettes (X) XY Lung Capacity (Y) 2025 45 5 25 210 1764 42 10 100 330 1089 33 15 225 465 961 31 20 400 580 841 29 50 750 1585 6680 180

Computing a Correlation