Correlation. Definition: the degree of relationship between two or more variables. For example, smoking and lung cancer are correlated: – if we look at.

Slides:



Advertisements
Similar presentations
Correlation & Regression Chapter 10. Outline Section 10-1Introduction Section 10-2Scatter Plots Section 10-3Correlation Section 10-4Regression Section.
Advertisements

Measures of Dispersion and Standard Scores
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Correlation CJ 526 Statistical Analysis in Criminal Justice.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Cal State Northridge  320 Andrew Ainsworth PhD Regression.
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
CJ 526 Statistical Analysis in Criminal Justice
PSY 307 – Statistics for the Behavioral Sciences
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Designing Experiments In designing experiments we: Manipulate the independent.
Introduction to Econometrics The Statistical Analysis of Economic (and related) Data.
Lecture 11 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Correlation 2 Computations, and the best fitting line.
Correlation and Covariance
Lecture 4: Correlation and Regression Laura McAvinue School of Psychology Trinity College Dublin.
Correlation-Regression The correlation coefficient measures how well one can predict X from Y or Y from X.
Correlation “A statistician is someone who loves to work with numbers but doesn't have the personality to be an accountant.”
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Correlation A bit about Pearson’s r.
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Relationships Among Variables
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Lecture 16 Correlation and Coefficient of Correlation
Confidence Intervals and Hypothesis Testing
Statistical Analysis Statistical Analysis
Section #6 November 13 th 2009 Regression. First, Review Scatter Plots A scatter plot (x, y) x y A scatter plot is a graph of the ordered pairs (x, y)
CORRELATION & REGRESSION
Covariance and correlation
Correlation.
Correlation and Regression
Sections 9-1 and 9-2 Overview Correlation. PAIRED DATA Is there a relationship? If so, what is the equation? Use that equation for prediction. In this.
Chapter 15 Correlation and Regression
1 Chapter 9. Section 9-1 and 9-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Hypothesis of Association: Correlation
One-way Analysis of Variance 1-Factor ANOVA. Previously… We learned how to determine the probability that one sample belongs to a certain population.
Design and Data Analysis in Psychology I Salvador Chacón Moscoso Susana Sanduvete Chaves School of Psychology Dpt. Experimental Psychology 1.
Investigating the Relationship between Scores
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Elementary Statistics Correlation and Regression.
CORRELATIONS: TESTING RELATIONSHIPS BETWEEN TWO METRIC VARIABLES Lecture 18:
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
Multiple Correlation and Regression
Midterm Review Ch 7-8. Requests for Help by Chapter.
Chapter 14 Correlation and Regression
Statistics for Political Science Levin and Fox Chapter Seven
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 3 Investigating the Relationship of Scores.
Correlation Scientific
Correlation and Covariance
Design and Data Analysis in Psychology II
S519: Evaluation of Information Systems
Scientific Practice Correlation.
Research Statistics Objective: Students will acquire knowledge related to research Statistics in order to identify how they are used to develop research.
Introduction to Econometrics
Statistical Inference about Regression
Correlation and Covariance
Correlation and Prediction
Presentation transcript:

Correlation

Definition: the degree of relationship between two or more variables. For example, smoking and lung cancer are correlated: – if we look at the number of people who smoke and the number of people who die of lung cancer over the course of 80 years, we see a correlation.

Smoking and Lung Cancer Deaths

But… Does this mean that smoking causes lung cancer? How do we make sense of data when we want to know if two variables are correlated? Can things be more or less correlated?

Scatterplots The most common graphical representation of data for the purpose of correlation is the scatterplot. Points for each individual are plotted based on the individual’s X score and Y score.

Activity #1 Create a scatterplot for the following data: NameAverage Hours of Exercise per Week Average Grade (%) for Semester Fina.3486 Sam.973 Bronson1.482 Slona.297 Mell3.391 Monique.6994 Kat Belle4.188 Henry2.196 Gill.367

Scatterplot

Positive Correlation A high score on one variable is associated with a high score on the other and vice versa.

Negative Correlation A high score on one variable is associated with a low score on the other and vice versa.

Zero Correlation There is no relationship between the two variables.

How About This One? Zero Positive Negative

The Pearson Product-Moment Correlation Coefficient Pearson r A summary measure of the direction and degree of linear relationship between two variables. An estimate of ρ (rho), the population correlation coefficient.

Conceptual Formula Multiply the z-score for X and Y for each individual, and then add them all up and divide the sum by N (the number of individuals). To compute using this formula, we would need to determine the standard deviation for both X and Y, then determine the z-score for each raw score (X and Y).

Conceptual Formula How does this formula work? If your two variables are positively correlated, many of the X scores with positive z-scores will be paired with Y scores with positive z-scores. And many X scores with negative z-scores will be paired with Y scores with negative z-scores. If you multiply two positive numbers or two negative numbers, you will get a positive number. If you have a whole bunch of positive products and add them up, you will get a large positive number.

Conceptual Formula If your two variables are negatively correlated, many X scores with positive z-scores will be paired with many Y scores with negative z- scores and vice versa. If you multiply a negative and a positive number, you get a negative number. If you add up a whole bunch of negative products, you would get a large negative number.

Covariance You can also understand r as a measure of covariance, or how much X and Y vary together. We are already familiar with variance, and this will help us understand covariance.

Covariance If we rewrite this formula: …and think of it as a measure of how much X scores vary with other X scores, we can write a new formula that determines how much X scores vary with Y scores:

Covariance To find r we need to “standardize” the covariance, which means that we will have to use the standard deviation. Think of it like this: To do this with covariance, we have to consider the standard deviations of both X and Y. We do this by multiplying them, and putting them in the denominator:

Pearson r Regardless of the formula you use, you will get the same result. r will never be larger than 1 or smaller than -1. The larger the obtained r (negative or positive), the more closely correlated the two variables are. If the r is close to zero, the variables are probably not correlated. If r is large and negative, you have a negative correlation. If r is large and positive, you have a positive correlation.

Activity #2 Determine r for the following data set: NameAverage Hours of Exercise per Week Average Grade (%) for Semester Fina.3486 Sam.973 Bronson1.482 Slona.297 Mell3.391 Monique.6994 Kat Belle4.188 Henry2.196 Gill.367

Activity #2 Step 1: Calculate the standard deviation of X Step 2: Calculate the standard deviation of Y Step 3: Calculate the covariance Step 4: Calculate r NameAverage Hours of Exercise per Week Average Grade (%) for Semester Fina.3486 Sam.973 Bronson1.482 Slona.297 Mell3.391 Monique.6994 Kat Belle4.188 Henry2.196 Gill.367

Activity #2 Standard Deviation of X = 1.36 Standard Deviation of Y = Covariance = r =.1997 =.2

Significance Test for r H 0 : ρ = 0 Use df = N – 2 (N is the number of individuals or rows, and two because we have two data sets) Look at the table of critical values for r. Just like t and F if the obtained r exceeds the critical r, your result is statistically significant at the specified α level. Interpretation is the same: The probability of getting results this extreme or more extreme if the null hypothesis is true is less than.05 (or.01).

Interpreting Correlation Despite what you have probably heard over and over again (ad nauseam), correlation CAN imply causation. Remember, design trumps analysis. If the data we just analyzed turned out to be “significantly” correlated AND I randomly assigned students to high and low exercise groups I could make causal statements.

Interpreting Correlation The reason you hear “correlation does not imply causation” is that most correlation analyses are conducted on data that are not from a true experiment (i.e., no random assignment). For example, the smoking and lung cancer data were not the result of an experiment. So, even if the two variables are causally related, we cannot make causal statements about them.

What are the alternatives to causation? The reverse could be true: the more firefighters fighting a fire, the bigger the fire is, so do firefighters cause fires to be larger? They might both cause each other: exercising causes someone’s health to improve, which is influential in their exercising more (because they “feel like it”) They might both be caused by another variable: Falling asleep with one’s shoes on is correlated with waking up with a headache, but these are both caused by getting completely blitzed the night before. Complete coincidence: for example, the number of churches in the U.S. has been rising steadily and so has the average global temperature (but there may actually be a better explanation for this example).

Homework Study for Chapter 13 Quiz Read Chapter 14 Do Chapter 13 HW