Covariance and Correlation

Slides:



Advertisements
Similar presentations
Covariance and Correlation
Advertisements

Table of Contents Exit Appendix Behavioral Statistics.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Lesson Fourteen Interpreting Scores. Contents Five Questions about Test Scores 1. The general pattern of the set of scores  How do scores run or what.
Correlation and Covariance
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
Basic Statistical Concepts
Statistics Psych 231: Research Methods in Psychology.
Perfect Negative Correlation Perfect Positive Correlation Non-Existent Correlation Imperfect Negative Correlation Imperfect Positive Correlation.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Correlation and Regression 1. Bivariate data When measurements on two characteristics are to be studied simultaneously because of their interdependence,
11. Multivariate Analysis CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science, IUPUI.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Correlation and regression 1: Correlation Coefficient
Section #6 November 13 th 2009 Regression. First, Review Scatter Plots A scatter plot (x, y) x y A scatter plot is a graph of the ordered pairs (x, y)
Covariance and correlation
Basic linear regression and multiple regression Psych Fraley.
Lecture 3 A Brief Review of Some Important Statistical Concepts.
Wednesday, October 12 Correlation and Linear Regression.
Examining Relationships in Quantitative Research
LECTURE 9 Tuesday, 24 FEBRUARY STA291 Fall Administrative 4.2 Measures of Variation (Empirical Rule) 4.4 Measures of Linear Relationship Suggested.
ContentDetail  Two variable statistics involves discovering if two variables are related or linked to each other in some way. e.g. - Does IQ determine.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
LESSON 5 - STATISTICS & RESEARCH STATISTICS – USE OF MATH TO ORGANIZE, SUMMARIZE, AND INTERPRET DATA.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
1 INVESTMENT ANALYSIS & PORTFOLIO MANAGEMENT Lecture # 35 Shahid A. Zia Dr. Shahid A. Zia.
Descriptive Statistics ( )
Theme 5. Association 1. Introduction. 2. Bivariate tables and graphs.
Chapter 12 Understanding Research Results: Description and Correlation
Simple Linear Correlation
Correlation and Covariance
CORRELATION.
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 20th February 2014  
Chapter 13 Linear Regression and Correlation Basic Statistics
LECTURE 13 Thursday, 8th October
Ch 4 實習.
Chapter 4 Fundamental statistical characteristics II: Dispersion and form measurements.
Research methods Lesson 2.
S1 :: Chapter 6 Correlation
Numerical Descriptive Measures
Descriptive Analysis and Presentation of Bivariate Data
Keller: Stats for Mgmt & Econ, 7th Ed
Regression.
Introduction to bivariate data
Using Statistical techniques in Geography
Since When is it Standard to Be Deviant?
CORRELATION ANALYSIS.
Descriptive Statistics:
Chapter 3D Chapter 3, part D Fall 2000.
Day 42 – Understanding Correlation Coefficient
The Pearson Correlation
Functions and Their Graphs
Correlation and Covariance
Product moment correlation
EXPERIMENT VS. CORRELATIONAL STUDY
CORRELATION AND MULTIPLE REGRESSION ANALYSIS
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Correlation and Covariance
Introduction to Regression
Review I am examining differences in the mean between groups How many independent variables? OneMore than one How many groups? Two More than two ?? ?
Chapter 3 Correlation and Prediction
Week 11.
Correlation & Trend Lines
Business and Economics 7th Edition
MGS 3100 Business Analysis Regression Feb 18, 2016
Forecasting Plays an important role in many industries
REGRESSION ANALYSIS 11/28/2019.
Presentation transcript:

Covariance and Correlation Questions: What does it mean to say that two variables are associated with one another? How can we mathematically formalize the concept of association?

Limitation of covariance One limitation of the covariance is that the size of the covariance depends on the variability of the variables. As a consequence, it can be difficult to evaluate the magnitude of the covariation between two variables. If the amount of variability is small, then the highest possible value of the covariance will also be small. If there is a large amount of variability, the maximum covariance can be large.

Limitations of covariance Ideally, we would like to evaluate the magnitude of the covariance relative to maximum possible covariance How can we determine the maximum possible covariance?

Go vary with yourself Let’s first note that, of all the variables a variable may covary with, it will covary with itself most strongly In fact, the “covariance of a variable with itself” is an alternative way to define variance:

Go vary with yourself Thus, if we were to divide the covariance of a variable with itself by the variance of the variable, we would obtain a value of 1. This will give us a standard for evaluating the magnitude of the covariance. Note: I’ve written the variance of X as sX  sX because the variance is the SD squared

Go vary with yourself However, we are interested in evaluating the covariance of a variable with another variable (not with itself), so we must derive a maximum possible covariance for these situations too. By extension, the covariance between two variables cannot be any greater than the product of the SD’s for the two variables. Thus, if we divide by sxsy, we can evaluate the magnitude of the covariance relative to 1.

Spine-tingling moment Important: What we’ve done is taken the covariance and “standardized” it. It will never be greater than 1 (or smaller than –1). The larger the absolute value of this index, the stronger the association between two variables.

Spine-tingling moment When expressed this way, the covariance is called a correlation The correlation is defined as a standardized covariance.

Correlation It can also be defined as the average product of z-scores because the two equations are identical. The correlation, r, is a quantitative index of the association between two variables. It is the average of the products of the z-scores. When this average is positive, there is a positive correlation; when negative, a negative correlation

Mean of each variable is zero A, D, & B are above the mean on both variables E & C are below the mean on both variables F is above the mean on x, but below the mean on y

+  + = +   + =  +   =     = +

Correlation

Correlation The value of r can range between -1 and + 1. If r = 0, then there is no correlation between the two variables. If r = 1 (or -1), then there is a perfect positive (or negative) relationship between the two variables.

r = + 1 r = 0 r = - 1

Correlation The absolute size of the correlation corresponds to the magnitude or strength of the relationship When a correlation is strong (e.g., r = .90), then people above the mean on x are substantially more likely to be above the mean on y than they would be if the correlation was weak (e.g., r = .10).

r = + .70 r = + .30 r = + 1

Correlation Advantages and uses of the correlation coefficient Provides an easy way to quantify the association between two variables Employs z-scores, so the variances of each variable are standardized & = 1 Foundation for many statistical applications