Correlation and Prediction

Slides:



Advertisements
Similar presentations
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Advertisements

Bivariate Analyses.
Describing Relationships Using Correlation and Regression
Correlation Chapter 9.
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Correlation “A statistician is someone who loves to work with numbers but doesn't have the personality to be an accountant.”
Correlational Designs
Chapter 7 Forecasting with Simple Regression
Chapter 9: Correlational Research. Chapter 9. Correlational Research Chapter Objectives  Distinguish between positive and negative bivariate correlations,
Relationships Among Variables
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 3 Correlation and Prediction.
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
Correlation By Dr.Muthupandi,. Correlation Correlation is a statistical technique which can show whether and how strongly pairs of variables are related.
Correlation and regression 1: Correlation Coefficient
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
September In Chapter 14: 14.1 Data 14.2 Scatterplots 14.3 Correlation 14.4 Regression.
CHAPTER NINE Correlational Research Designs. Copyright © Houghton Mifflin Company. All rights reserved.Chapter 9 | 2 Study Questions What are correlational.
Introduction to Quantitative Data Analysis (continued) Reading on Quantitative Data Analysis: Baxter and Babbie, 2004, Chapter 12.
L 1 Chapter 12 Correlational Designs EDUC 640 Dr. William M. Bauer.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
Chapter 11 Correlation Pt 1: Nov. 12, Correlation Association between scores on two variables –e.g., age and coordination skills in children, price.
Chapter 11 Correlation Pt 1: Nov. 6, Correlation Association between scores on two variables –Use scatterplots to see the relationship –Rule of.
Basic Statistics Correlation Var Relationships Associations.
Investigating the Relationship between Scores
Examining Relationships in Quantitative Research
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Chapter 4 Prediction. Predictor and Criterion Variables  Predictor variable (X)  Criterion variable (Y)
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
Chapter 3 Correlation.  Association between scores on two variables –e.g., age and coordination skills in children, price and quality.
Statistics for Psychology CHAPTER SIXTH EDITION Statistics for Psychology, Sixth Edition Arthur Aron | Elliot J. Coups | Elaine N. Aron Copyright © 2013.
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Advanced Statistical Methods: Continuous Variables REVIEW Dr. Irina Tomescu-Dubrow.
Outline of Today’s Discussion 1.Introduction to Correlation 2.An Alternative Formula for the Correlation Coefficient 3.Coefficient of Determination.
©2005, Pearson Education/Prentice Hall CHAPTER 6 Nonexperimental Strategies.
LESSON 5 - STATISTICS & RESEARCH STATISTICS – USE OF MATH TO ORGANIZE, SUMMARIZE, AND INTERPRET DATA.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
AP Statistics Review Day 1 Chapters 1-4. AP Exam Exploring Data accounts for 20%-30% of the material covered on the AP Exam. “Exploratory analysis of.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 3 Investigating the Relationship of Scores.
Correlation.
Chapter 12 Understanding Research Results: Description and Correlation
Statistical analysis.
Design and Data Analysis in Psychology II
Chapter 9: Correlational Research
Statistics for Managers using Microsoft Excel 3rd Edition
Statistical analysis.
Correlation 10/27.
Correlation and Regression
Correlation 10/27.
Elementary Statistics
EDRS6208 Fundamentals of Education Research 1
Statistics for the Social Sciences
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
Theme 7 Correlation.
Correlation and Regression
The Pearson Correlation
Product moment correlation
An Introduction to Correlational Research
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Introduction to Regression
Review I am examining differences in the mean between groups How many independent variables? OneMore than one How many groups? Two More than two ?? ?
Chapter 3 Correlation and Prediction
Z Scores & Correlation.
Warsaw Summer School 2017, OSU Study Abroad Program
Correlation and Prediction
Presentation transcript:

Correlation and Prediction Chapter 3

Chapter Outline Graphing Correlations: The Scatter Diagram Patterns of Correlation The Correlation Coefficient Issues in Interpreting the Correlation Coefficient Prediction The Correlation Coefficient and Proportion of Variance Accounted for Correlation and Prediction in Research Articles Advanced Topic: Multiple Regression Advanced Topic: Multiple Regression in Research Articles

Correlations Can be thought of as a descriptive statistic for the relationship between two variables Describes the relationship between two equal-interval numeric variables e.g., correlation between amount of time studying and amount learned e.g., correlation between number of years of education and salary

Correlation instruct.uwo.ca/geog/500/correlation_by_6.pdf

Scatter Diagram or Scatter Plot Graph showing the pattern o f the relationship between two variables

Patterns of Correlation A linear correlation relationship between two variables on a scatter diagram roughly approximating a straight line Curvilinear correlation any association between two variables other than a linear correlation relationship between two variables that shows up on a scatter diagram as dots following a systematic pattern that is not a straight line No correlation no systematic relationship between two variables

Positive and Negative Linear Correlation Positive Correlation High scores go with high scores. Low scores go with low scores. Medium scores go with medium scores. e.g., level of education achieved and income Negative Correlation High scores go with low scores. e.g., the relationship between fewer hours of sleep and higher levels of stress Strength of the Correlation how close the dots on a scatter diagram fall to a simple straight line

Positive Linear correlation

Negative correlation

Zero Correlation ludwig-sun2.unil.ch/~darlene/Rmini/lec/20021031.ppt

Curvilinear Relationship ludwig-sun2.unil.ch/~darlene/Rmini/lec/20021031.ppt

Curvilinear

How Are You doing? What does it mean when two variables have a curvilinear relationship? True or False: When two variables are negatively correlated, high scores go with high scores, low scores go with low scores, and medium scores go with medium scores.

The Correlation Coefficient Number that gives exact correlation between 2 variables can tell you direction and strength uses Z scores to compare scores on different variables Z scores allow you to calculate a cross-product that tells you the direction of the correlation. A cross-product is the result of multiplying a score on one variable by a score on the other variable. If you multiply a high Z score by a high Z score, you will always get a positive cross-product. If you multiply a low Z score by a low Z score, you will always get a positive cross-product. If you multiply a high Z score with a low Z score or a low Z score with a high Z score, you will get a negative number.

The Correlation Coefficient ( r ) The sign of r (Pearson correlation coefficient) tells the general trend of a relationship between two variables. A + sign means the correlation is positive. A - sign means the correlation is negative. The value of r ranges from 0 to 1. 1 is the highest value a correlation can have. A correlation of 1 or -1 means that the variables are perfectly correlated. 0 = no correlation The value of a correlation defines the strength of the correlation regardless of the sign. e.g., -.99 is a stronger correlation than .75

Formula for a Correlation Coefficient r = ∑ZxZy N Zx = Z score for each person on the X variable Zy = Z score for each person on the Y variable ZxZy = cross-product of Zx and Zy ∑ZxZy = sum of the cross-products of the Z scores over all participants in the study

Pearson Correlation Coefficient Pearson correlation coefficient “r” is the average value of the cross-product of ZX and Zy r is a measure of LINEAR ASSOCIATION (Direction: + vs. – and Strength: How much

Definitional Formula

Computational Formula

Bivariate Correlation

Issues in Interpreting the Correlation Coefficient Direction of causality path of causal effect (e.g., X causes Y) You cannot determine the direction of causality just because two variables are correlated.

Three Possible Directions of Causality Variable X causes variable Y. e.g., less sleep causes more stress Variable Y causes variable X. e.g., more stress causes people to sleep less There is a third variable that causes both variable X and variable Y. e.g., working longer hours causes both stress and fewer hours of sleep

Ruling Out Some Possible Directions of Causality Longitudinal Study a study where people are measured at two or more points in time e.g., evaluating number of hours of sleep at one time point and then evaluating their levels of stress at a later time point True Experiment a study in which participants are randomly assigned to a particular level of a variable and then measured on another variable e.g., exposing individuals to varying amounts of sleep in a laboratory environment and then evaluating their stress levels

The Statistical Significance of r A correlation is statistically significant if it is unlikely that you could have gotten a correlation as big as you did if in fact there was no relationship between variables. If the probability (p) is less than some small degree of probability (e.g., 5% or 1%), the correlation is considered statistically significant.

Malawi Med J. 2012 Sep; 24(3): 69–71.

Key Points Two variables are correlated when they are associated in a clear pattern. A scatter diagram displays the relationship between two variables. A linear correlation is seen when the dots in a scatter diagram generally follow a straight line. In a curvilinear correlation, the dots follow a pattern that does not approximate a straight line. When there is no correlation, the dots do not follow a pattern. In a positive correlation, the highs go with the highs, the lows with the lows, and the mediums go with the mediums. With a negative correlation, the lows go with the highs. r is the correlation coefficient and gives you the direction and strength of a correlation. r = (∑Zx Zy )/N The maximum positive value of r = 1 and the maximum negative value of r = -1. The closer the correlation is to -1 or 1, the stronger the correlation. Correlation does not tell you the direction of causation. Prediction model using Z scores = predicted Zy = ()(Zx). Prediction model with raw scores = predicted Y = (SDy)(predicted Zy) + My. r2 = proportion of variance accounted for and is used to compare linear correlations Correlation coefficients are reported both in the text and in tables of research articles.