CORRELATION LECTURE 1 EPSY 640 Texas A&M University.

Slides:



Advertisements
Similar presentations
Correlation and Linear Regression.
Advertisements

Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Overview Correlation Regression -Definition
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Statistics for the Social Sciences Psychology 340 Spring 2005 Prediction cont.
Statistics for the Social Sciences
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
CORRELATION AND SIMPLE LINEAR REGRESSION - Revisited Ref: Cohen, Cohen, West, & Aiken (2003), ch. 2.
Lecture 11 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Regression and Correlation
Correlation-Regression The correlation coefficient measures how well one can predict X from Y or Y from X.
Chapter Eighteen MEASURES OF ASSOCIATION
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Correlation MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central objective of the scientific enterprise.
EPSY 651: Structural Equation Modeling I. Where does SEM fit in Quantitative Methodology? Draws on three traditions in mathematics and science: Psychology.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Chapter 21 Correlation. Correlation A measure of the strength of a linear relationship Although there are at least 6 methods for measuring correlation,
Lecture 16 Correlation and Coefficient of Correlation
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, November 19 Chi-Squared Test of Independence.
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Introduction to Linear Regression and Correlation Analysis
Regression Analysis (2)
Copyright © 2012 Pearson Education, Inc. All rights reserved. Chapter 3 Simple Linear Regression.
Covariance and correlation
Correlation.
Chapter 15 Correlation and Regression
Regression. Correlation and regression are closely related in use and in math. Correlation summarizes the relations b/t 2 variables. Regression is used.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Statistics for the Social Sciences Psychology 340 Fall 2013 Correlation and Regression.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
UNDERSTANDING RESEARCH RESULTS: DESCRIPTION AND CORRELATION © 2012 The McGraw-Hill Companies, Inc.
Chapter 20 Linear Regression. What if… We believe that an important relation between two measures exists? For example, we ask 5 people about their salary.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Figure 15-3 (p. 512) Examples of positive and negative relationships. (a) Beer sales are positively related to temperature. (b) Coffee sales are negatively.
Investigating the Relationship between Scores
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, November 12, 2013 Correlation and Regression.
Educ 200C Wed. Oct 3, Variation What is it? What does it look like in a data set?
1 Inferences About The Pearson Correlation Coefficient.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
Correlation MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central objective of the scientific enterprise.
Reasoning in Psychology Using Statistics Psychology
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Chapter 14 Correlation and Regression
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
CORRELATION ANALYSIS.
SOCW 671 #11 Correlation and Regression. Uses of Correlation To study the strength of a relationship To study the direction of a relationship Scattergrams.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Correlations: Linear Relationships Data What kind of measures are used? interval, ratio nominal Correlation Analysis: Pearson’s r (ordinal scales use Spearman’s.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 3 Investigating the Relationship of Scores.
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
Multiple Regression.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Lecture 2- Alternate Correlation Procedures
Understanding Research Results: Description and Correlation
Theme 7 Correlation.
CORRELATION ANALYSIS.
Statistics for the Social Sciences
The Pearson Correlation
Linear Regression and Correlation
Linear Regression and Correlation
Presentation transcript:

CORRELATION LECTURE 1 EPSY 640 Texas A&M University

ALTITUDE HEIGHT OF COLUMN Figure 3.1: Graph of Torricelli and Viviani 1643/44 data on Altitude and Height of a column of mercury

TABULAR DATA HEIGHTALT CHANGEMN CHNG HT ALTHT ALT Predicted:

SYMBOLIC REPRESENTATION mathematical representation: height  1/altitude where  means “proportional to.” orH = b 1 A + b 0 H =height of the column of mercury, b 1 is a multiplier or coefficient, b 0 is a constant value that makes the data points line up correctly, also the value H takes when A is zero.

MATH REPRESENTATION For the data above the following numbers are produced from the best fit: H = A Thus, for any altitude in feet, we multiply it by and add Our approximation was  H = =  A(+900)  =change = 900 x ( ) close enough

MATH REPRESENTATION Error - the difference between prediction and observation. Note: error in our estimate for going from 3000 to 3900 feet should have dropped the mercury from to 26.37, but it only dropped to 26.65, error = +.28 inches Prediction -the outcome of computing an equation such as that for H above.

Karl Pearson ( (exerpted from E S Pearson, Karl Pearson: An Appreciation of some aspects of his life and works, Cambridge University Press, 1938).

Pearson Correlation standard deviation (SD)- measure of spread of scores SD of the three data points s A = 900 coefficient , the amount of change in height per foot of altitude. s H = m A = , m A = 3900 re-represent the data in standard score units, or z-scores as z H = z A.

Pearson Correlation z H = z A Thus, a 1 standard deviation change in altitude produces a standard deviation change in height Thus, SD A = = x = inches per 900 feet of altitude

Pearson Correlation n  (x i – x x )(y i – y y )/(n-1) r xy = i=1_____________________________ = s xy /s x s y s x s y =  z x i z y i /(n-1) / s x s y = COVARIANCE / SD(x)SD(y)

COVARIANCE DEFINED AS CO-VARIATION “UNSTANDARDIZED CORRELATION”

Squared correlation “r-squared” Most squared things are: –area measures –variance-related –Often have a chi-square distribution (looks somewhat like a Poisson)

Variance of X=1 Variance of Y=1 r 2 = percent overlap in the two squares Fig. 3.6: Geometric representation of r 2 as the overlap of two squares a. Nonzero correlation Variance of X=1 Variance of Y=1 B. Zero correlation

SSy SSx S xy Sums of Squares and Cross Product (Covariance) Circles are easier to show than rectangles, still area concept:

StudentX (SAT Math)  X=X-Mean Y (Calc grade)  Y=Y-Mean  X  Y Contributor Discrepant D = * C = * B = * A = C = * B = * Sum Mean (n-1 divisor) SD Correlation = 40/ =.364 b 1 = b 0 = *550 y =.00364SAT +.5 means:2.5 = Note: prediction always includes the means Pred(Ymean)= b1Xmean + b0 Table 3.1: Calculation of Pearson correlation coefficient for hypothetical data on SAT Math and Calculus Grades

Plot of data of Calc grade by SAT Math

SAT Math Calc Grade.364 (40) error. 932(.955) Figure 3.4: Path model representation of correlation between SAT Math scores and Calculus Grades  1 – r 2 s e = standard deviation of errors correlation covariance

Path Models path coefficient -standardized coefficient next to arrow, covariance in parentheses error coefficient- the correlation between the errors, or discrepancies between observed and predicted Calc Grade scores, and the observed Calc Grade scores. Predicted(Calc Grade) = SAT-Math +.5 errors are sometimes called disturbances

X Y a XY b X Y e c Figure 3.2: Path model representations of correlation

BIVARIATE DATA 2 VARIABLES QUESTION: DO THEY COVARY? IF SO, HOW DO WE INTERPRET? IF NOT, IS THERE A THIRD INTERVENING (MEDIATING) VARIABLE OR EXOGENOUS VARIABLE THAT SUPPRESSES THE RELATIONSHIP? OR MODERATES THE RELATIONSHIP

IDEALIZED SCATTERPLOT POSITIVE RELATIONSHIP X Y Prediction line

IDEALIZED SCATTERPLOT NEGATIVE RELATIONSHIP X Y Prediction line 95% confidence interval around prediction X. Y.

IDEALIZED SCATTERPLOT NO RELATIONSHIP X Y Prediction line

SUPPRESSED SCATTERPLOT NO APPARENT RELATIONSHIP X Y Prediction lines MALES FEMALES

MODEERATION AND SUPPRESSION IN A SCATTERPLOT NO APPARENT RELATIONSHIP X Y Prediction lines MALES FEMALES

IDEALIZED SCATTERPLOT POSITIVE CURVILINEAR RELATIONSHIP X Y Linear prediction line Quadratic prediction line

INFLUENCE OF POINTS SOME POINTS CHANGE RELATIONSHIP (outliers, influence points), OTHERS DO LITTLE ACTIVITY: –1. CONSTRUCT 10 POINT SCATTERPLOT, TRY TO APPROXIMATE.6 CORRELATION –DETERMINE LOCATIONS FOR POINTS THAT CHANGE THE CORRELATION TO.4 OR LESS

Computing Correlation with SPSS SPSS data files are organized by ROWS: people or unitsCOLUMNS: variables Select “Analyze/Correlate/Bivariate” Highlight a variable, move it to the text box, repeat for all variables to be correlated Select “Pearson” or “Spearman (ordinal only) Select “One” or “Two” tailed for significance testing: do you have theory that says a correlation should be positive (or negative)? Test one-tailed, which tests if the correlation is zero or not

Computing Correlation with SPSS continued Select “Options”, check “Means and Standard Deviations” if you want summary statistics correlation signficance Sample size

5%