Correlation and Covariance

Slides:



Advertisements
Similar presentations
Correlation and Covariance R. F. Riesenfeld (Based on web slides by James H. Steiger)
Advertisements

Chapter 16: Correlation.
When Simultaneous observations on hydrological variables are available then one may be interested in the linear association between the variables. This.
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Descriptive Statistics: Numerical Measures
17 Correlation. 17 Correlation Chapter17 p399.
Correlation and Covariance
Five-Number Summary 1 Smallest Value 2 First Quartile 3 Median 4
Major Points Scatterplots The correlation coefficient –Correlations on ranks Factors affecting correlations Testing for significance Intercorrelation matrices.
BIVARIATE DATA: CORRELATION AND REGRESSION Two variables of interest: X, Y. GOAL: Quantify association between X and Y: correlation. Predict value of Y.
Analysis of Research Data
Cal State Northridge  320 Andrew Ainsworth PhD Correlation.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Summarizing Bivariate Data Terms, Scatterplots and Correlation.
Cal State Northridge 427 Ainsworth
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Examples of continuous probability distributions: The normal and standard normal.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Correlation and Covariance R. F. Riesenfeld (Based on web slides by James H. Steiger)
Equations in Simple Regression Analysis. The Variance.
CORRELATION & REGRESSION
Portfolio Management-Learning Objective
Scatter Plots and Linear Correlation. How do you determine if something causes something else to happen? We want to see if the dependent variable (response.
Correlation.
Correlation1.  The variance of a variable X provides information on the variability of X.  The covariance of two variables X and Y provides information.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Anthony Greene1 Correlation The Association Between Variables.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Measures of Variability James H. Steiger. Overview Discuss Common Measures of Variability Range Semi-Interquartile Range Variance Standard Deviation Derive.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Wednesday, October 12 Correlation and Linear Regression.
Stats/Methods I JEOPARDY. Jeopardy CorrelationRegressionZ-ScoresProbabilitySurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
1 Examining Relationships in Data William P. Wattles, Ph.D. Francis Marion University.
MEASURES of CORRELATION. CORRELATION basically the test of measurement. Means that two variables tend to vary together The presence of one indicates the.
Figure 15-3 (p. 512) Examples of positive and negative relationships. (a) Beer sales are positively related to temperature. (b) Coffee sales are negatively.
Variables and Random Variables àA variable is a quantity (such as height, income, the inflation rate, GDP, etc.) that takes on different values across.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Educ 200C Wed. Oct 3, Variation What is it? What does it look like in a data set?
1 Inferences About The Pearson Correlation Coefficient.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
9.1B – Computing the Correlation Coefficient by Hand
Measures of Variability James H. Steiger. Overview Discuss Common Measures of Variability Range Semi-Interquartile Range Variance Standard Deviation Derive.
1 ESTIMATORS OF VARIANCE, COVARIANCE, AND CORRELATION We have seen that the variance of a random variable X is given by the expression above. Variance.
Chapter 15: Correlation. Correlations: Measuring and Describing Relationships A correlation is a statistical method used to measure and describe the relationship.
Summarizing Data Graphical Methods. Histogram Stem-Leaf Diagram Grouped Freq Table Box-whisker Plot.
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Understand.
Theme 5. Association 1. Introduction. 2. Bivariate tables and graphs.
Tests of hypothesis Contents: Tests of significance for small samples
Simple Linear Correlation
Practice As part of a program to reducing smoking, a national organization ran an advertising campaign to convince people to quit or reduce their smoking.
Correlation and Covariance
Design and Data Analysis in Psychology II
Covariance and Correlation
Correlation – Regression
Basic Statistics Overview
Elementary Statistics
Examining Relationships in Data
The Pearson Correlation
Correlation and Covariance
Karl’s Pearson Correlation
Inferential Statistics
Hypothesis Testing: The Difference Between Two Population Means
Ch 4.1 & 4.2 Two dimensions concept
Geology Geomath Chapter 7 - Statistics tom.h.wilson
Z Scores & Correlation.
Principal Component Analysis (PCA)
Interpreting & Summarizing
Presentation transcript:

Correlation and Covariance R. F. Riesenfeld (Based on web slides by James H. Steiger)

Goals Introduce concepts of Develop computational formulas Covariance Correlation Develop computational formulas R F Riesenfeld Sp 2010 CS5961 Comp Stat

Covariance Variables may change in relation to each other Covariance measures how much the movement in one variable predicts the movement in a corresponding variable R F Riesenfeld Sp 2010 CS5961 Comp Stat

Smoking and Lung Capacity Example: investigate relationship between cigarette smoking and lung capacity Data: sample group response data on smoking habits, and measured lung capacities, respectively R F Riesenfeld Sp 2010 CS5961 Comp Stat

Smoking v Lung Capacity Data Cigarettes (X ) Lung Capacity (Y ) 1 45 2 5 42 3 10 33 4 15 31 20 29 R F Riesenfeld Sp 2010 CS5961 Comp Stat

Smoking and Lung Capacity

Smoking v Lung Capacity Observe that as smoking exposure goes up, corresponding lung capacity goes down Variables covary inversely Covariance and Correlation quantify relationship R F Riesenfeld Sp 2010 CS5961 Comp Stat

Covariance Variables that covary inversely, like smoking and lung capacity, tend to appear on opposite sides of the group means When smoking is above its group mean, lung capacity tends to be below its group mean. Average product of deviation measures extent to which variables covary, the degree of linkage between them R F Riesenfeld Sp 2010 CS5961 Comp Stat

The Sample Covariance Similar to variance, for theoretical reasons, average is typically computed using (N -1), not N . Thus, R F Riesenfeld Sp 2010 CS5961 Comp Stat

Calculating Covariance Cigs (X ) Lung Cap (Y )   45 5 42 10 33 15 31 20 29 36 R F Riesenfeld Sp 2010 CS5961 Comp Stat

Calculating Covariance Cigs (X )   Cap (Y ) -10 -90 9 45 5 -5 -30 6 42 10 -3 33 15 -25 31 20 -70 -7 29 ∑= -215 R F Riesenfeld Sp 2010 CS5961 Comp Stat

Covariance Calculation (2) Evaluation yields, R F Riesenfeld Sp 2010 CS5961 Comp Stat

Covariance under Affine Transformation R F Riesenfeld Sp 2010 CS5961 Comp Stat

Covariance under Affine Transf (2) R F Riesenfeld Sp 2010 CS5961 Comp Stat

(Pearson) Correlation Coefficient rxy Like covariance, but uses Z-values instead of deviations. Hence, invariant under linear transformation of the raw data. R F Riesenfeld Sp 2010 CS5961 Comp Stat

Alternative (common) Expression R F Riesenfeld Sp 2010 CS5961 Comp Stat

Computational Formula 1 R F Riesenfeld Sp 2010 CS5961 Comp Stat

Computational Formula 2 R F Riesenfeld Sp 2010 CS5961 Comp Stat

Table for Calculating rxy Cigs (X ) X 2 XY Y 2 Cap (Y ) 2025 45 5 25 210 1764 42 10 100 330 1089 33 15 225 465 961 31 20 400 580 841 29 ∑= 50 750 1585 6680 180 R F Riesenfeld Sp 2010 CS5961 Comp Stat

Computing rxy from Table R F Riesenfeld Sp 2010 CS5961 Comp Stat

Computing Correlation R F Riesenfeld Sp 2010 CS5961 Comp Stat

Conclusion rxy = -0.96 implies almost certainty smoker will have diminish lung capacity Greater smoking exposure implies greater likelihood of lung damage R F Riesenfeld Sp 2010 CS5961 Comp Stat

Covariance & Correlation End Covariance & Correlation Notes R F Riesenfeld Sp 2010 CS5961 Comp Stat