Correlation and Covariance R. F. Riesenfeld (Based on web slides by James H. Steiger)

Slides:



Advertisements
Similar presentations
Correlation and Covariance R. F. Riesenfeld (Based on web slides by James H. Steiger)
Advertisements

Covariance and Correlation
Chapter 16: Correlation.
When Simultaneous observations on hydrological variables are available then one may be interested in the linear association between the variables. This.
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L10.1 CorrelationCorrelation The underlying principle of correlation analysis.
Descriptive Statistics: Numerical Measures
9. SIMPLE LINEAR REGESSION AND CORRELATION
17 Correlation. 17 Correlation Chapter17 p399.
Correlation and Covariance
Five-Number Summary 1 Smallest Value 2 First Quartile 3 Median 4
Major Points Scatterplots The correlation coefficient –Correlations on ranks Factors affecting correlations Testing for significance Intercorrelation matrices.
BIVARIATE DATA: CORRELATION AND REGRESSION Two variables of interest: X, Y. GOAL: Quantify association between X and Y: correlation. Predict value of Y.
Correlation-Regression The correlation coefficient measures how well one can predict X from Y or Y from X.
Analysis of Research Data
Cal State Northridge  320 Andrew Ainsworth PhD Correlation.
Statistics and Quantitative Analysis U4320 Segment 4: Statistics and Quantitative Analysis Prof. Sharyn O’Halloran.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Summarizing Bivariate Data Terms, Scatterplots and Correlation.
Cal State Northridge 427 Ainsworth
Z Scores & Correlation Greg C Elvers.
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Examples of continuous probability distributions: The normal and standard normal.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Equations in Simple Regression Analysis. The Variance.
CORRELATION & REGRESSION
Portfolio Management-Learning Objective
Scatter Plots and Linear Correlation. How do you determine if something causes something else to happen? We want to see if the dependent variable (response.
Correlation.
Correlation1.  The variance of a variable X provides information on the variability of X.  The covariance of two variables X and Y provides information.
Anthony Greene1 Correlation The Association Between Variables.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Measures of Variability James H. Steiger. Overview Discuss Common Measures of Variability Range Semi-Interquartile Range Variance Standard Deviation Derive.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Wednesday, October 12 Correlation and Linear Regression.
Stats/Methods I JEOPARDY. Jeopardy CorrelationRegressionZ-ScoresProbabilitySurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
1 Examining Relationships in Data William P. Wattles, Ph.D. Francis Marion University.
MEASURES of CORRELATION. CORRELATION basically the test of measurement. Means that two variables tend to vary together The presence of one indicates the.
Figure 15-3 (p. 512) Examples of positive and negative relationships. (a) Beer sales are positively related to temperature. (b) Coffee sales are negatively.
Variables and Random Variables àA variable is a quantity (such as height, income, the inflation rate, GDP, etc.) that takes on different values across.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
GG 313 Geological Data Analysis Lecture 13 Solution of Simultaneous Equations October 4, 2005.
Educ 200C Wed. Oct 3, Variation What is it? What does it look like in a data set?
1 Inferences About The Pearson Correlation Coefficient.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
Section 5.1: Correlation. Correlation Coefficient A quantitative assessment of the strength of a relationship between the x and y values in a set of (x,y)
9.1B – Computing the Correlation Coefficient by Hand
Measures of Variability James H. Steiger. Overview Discuss Common Measures of Variability Range Semi-Interquartile Range Variance Standard Deviation Derive.
1 ESTIMATORS OF VARIANCE, COVARIANCE, AND CORRELATION We have seen that the variance of a random variable X is given by the expression above. Variance.
Chapter 15: Correlation. Correlations: Measuring and Describing Relationships A correlation is a statistical method used to measure and describe the relationship.
Summarizing Data Graphical Methods. Histogram Stem-Leaf Diagram Grouped Freq Table Box-whisker Plot.
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Understand.
Theme 5. Association 1. Introduction. 2. Bivariate tables and graphs.
Simple Linear Correlation
Correlation and Covariance
Design and Data Analysis in Psychology II
Elementary Statistics
Examining Relationships in Data
The Pearson Correlation
Correlation and Covariance
Karl’s Pearson Correlation
Ch 4.1 & 4.2 Two dimensions concept
Geology Geomath Chapter 7 - Statistics tom.h.wilson
Correlation and Covariance
Principal Component Analysis (PCA)
Interpreting & Summarizing
Presentation transcript:

Correlation and Covariance R. F. Riesenfeld (Based on web slides by James H. Steiger)

Goals ⇨ Introduce concepts of  Covariance  Correlation ⇨ Develop computational formulas 2R F Riesenfeld Sp 2010CS5961 Comp Stat

Covariance ⇨ Variables may change in relation to each other ⇨ measures how much the movement in one variable predicts the movement in a corresponding variable ⇨ Covariance measures how much the movement in one variable predicts the movement in a corresponding variable 3R F Riesenfeld Sp 2010CS5961 Comp Stat

Smoking and Lung Capacity ⇨ Example: investigate relationship between and ⇨ Example: investigate relationship between cigarette smoking and lung capacity ⇨ Data: sample group response data on smoking habits, measured lung capacities, respectively ⇨ Data: sample group response data on smoking habits, and measured lung capacities, respectively R F Riesenfeld Sp 2010CS5961 Comp Stat4

Smoking v Lung Capacity Data 5 N Cigarettes ( X )Lung Capacity ( Y ) R F Riesenfeld Sp 2010CS5961 Comp Stat

Smoking and Lung Capacity 6

Smoking v Lung Capacity ⇨ Observe that as smoking exposure goes up, corresponding lung capacity goes down ⇨ Variables inversely ⇨ Variables covary inversely ⇨ and quantify relationship ⇨ Covariance and Correlation quantify relationship 7R F Riesenfeld Sp 2010CS5961 Comp Stat

Covariance ⇨ Variables that inversely, like smoking and lung capacity, tend to appear on opposite sides of the group means ⇨ Variables that covary inversely, like smoking and lung capacity, tend to appear on opposite sides of the group means  When smoking is above its group mean, lung capacity tends to be below its group mean. ⇨ Average measures extent to which variables covary, the degree of linkage between them ⇨ Average product of deviation measures extent to which variables covary, the degree of linkage between them 8R F Riesenfeld Sp 2010CS5961 Comp Stat

The Sample Covariance ⇨ Similar to variance, for theoretical reasons, average is typically computed using ( N -1 ), not N. Thus, 9R F Riesenfeld Sp 2010CS5961 Comp Stat

Cigs ( X )Lung Cap ( Y ) Calculating Covariance R F Riesenfeld Sp 2010CS5961 Comp Stat10

Calculating Covariance Cigs (X ) Cap (Y ) ∑= R F Riesenfeld Sp 2010CS5961 Comp Stat11

Evaluation yields, Evaluation yields, 12 Covariance Calculation (2) R F Riesenfeld Sp 2010CS5961 Comp Stat

Covariance under Affine Transformation 13R F Riesenfeld Sp 2010CS5961 Comp Stat

14 CovarianceunderAffineTransf Covariance under Affine Transf (2) R F Riesenfeld Sp 2010CS5961 Comp Stat

(Pearson) Correlation Coefficient r xy ⇨ Like covariance, but uses Z-values instead of deviations. Hence, invariant under linear transformation of the raw data. 15R F Riesenfeld Sp 2010CS5961 Comp Stat

Alternative (common) Expression 16R F Riesenfeld Sp 2010CS5961 Comp Stat

Computational Formula 1 17R F Riesenfeld Sp 2010CS5961 Comp Stat

Computational Formula 2 18R F Riesenfeld Sp 2010CS5961 Comp Stat

Table for Calculating r xy 19 Cigs ( X ) X 2 XYY 2 Cap (Y ) ∑= R F Riesenfeld Sp 2010CS5961 Comp Stat

Computing from Table Computing r xy from Table 20R F Riesenfeld Sp 2010CS5961 Comp Stat

Computing Correlation 21R F Riesenfeld Sp 2010CS5961 Comp Stat

Conclusion ⇨ r xy = implies almost certainty smoker will have diminish lung capacity ⇨ Greater smoking exposure implies greater likelihood of lung damage 22R F Riesenfeld Sp 2010CS5961 Comp Stat

End Covariance & Correlation 23 Notes R F Riesenfeld Sp 2010CS5961 Comp Stat