Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Kin 304 Regression Linear Regression Least Sum of Squares
Regression Greg C Elvers.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Correlation and Regression
Regression What is regression to the mean?
Describing Relationships Using Correlation and Regression
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Chapter 10 Relationships between variables
Statistics Psych 231: Research Methods in Psychology.
SIMPLE LINEAR REGRESSION
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
SIMPLE LINEAR REGRESSION
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Correlation and Regression Analysis
Leon-Guerrero and Frankfort-Nachmias,
Relationships Among Variables
Correlation & Regression Math 137 Fresno State Burger.
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Chapter 21 Correlation. Correlation A measure of the strength of a linear relationship Although there are at least 6 methods for measuring correlation,
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
Chapter 8: Bivariate Regression and Correlation
Linear Regression.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
HAWKES LEARNING SYSTEMS math courseware specialists Discovering Relationships Chapter 5 Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc.
Overview 4.2 Introduction to Correlation 4.3 Introduction to Regression.
SIMPLE LINEAR REGRESSION
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
Correlation By Dr.Muthupandi,. Correlation Correlation is a statistical technique which can show whether and how strongly pairs of variables are related.
Correlation and regression 1: Correlation Coefficient
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Introduction to Quantitative Data Analysis (continued) Reading on Quantitative Data Analysis: Baxter and Babbie, 2004, Chapter 12.
Chapter 6 & 7 Linear Regression & Correlation
Linear Functions 2 Sociology 5811 Lecture 18 Copyright © 2004 by Evan Schofer Do not copy or distribute without permission.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
Stats/Methods I JEOPARDY. Jeopardy CorrelationRegressionZ-ScoresProbabilitySurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
1.6 Linear Regression & the Correlation Coefficient.
Investigating the Relationship between Scores
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 11 Correlation and Simple Linear Regression Statistics for Business (Econ) 1.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
STA291 Statistical Methods Lecture LINEar Association o r measures “closeness” of data to the “best” line. What line is that? And best in what terms.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Advanced Statistical Methods: Continuous Variables REVIEW Dr. Irina Tomescu-Dubrow.
You can calculate: Central tendency Variability You could graph the data.
Lecturer’s desk Physics- atmospheric Sciences (PAS) - Room 201 s c r e e n Row A Row B Row C Row D Row E Row F Row G Row H Row A
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 3 Investigating the Relationship of Scores.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-1 Overview Overview 10-2 Correlation 10-3 Regression-3 Regression.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Correlation & Regression
Practice. Practice Practice Practice Practice r = X = 20 X2 = 120 Y = 19 Y2 = 123 XY = 72 N = 4 (4) 72.
The Lease Squares Line Finite 1.3.
Statistics 200 Lecture #5 Tuesday, September 6, 2016
Kin 304 Regression Linear Regression Least Sum of Squares
Chapter 5 STATISTICS (PART 4).
Correlation and Simple Linear Regression
Lecture Slides Elementary Statistics Thirteenth Edition
Correlation and Regression
You can calculate: Central tendency Variability You could graph the data.
Correlation and Simple Linear Regression
Correlation and Regression
SIMPLE LINEAR REGRESSION
Simple Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Warsaw Summer School 2017, OSU Study Abroad Program
Correlation & Regression
Presentation transcript:

Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution is required for correlation and regression technique. A bivariate distribution is a joint distribution of two variables, the individual scores of which are paired in some logical way. A bivariate distribution may show positive correlation, negative correlation, or zero correlation. EDUCATIONAL STATISTICS EDU5950 WEEK12

In a case of a positive correlation between two variables, high measurements on one variable tend to be associated with high measurements on the other variable, and low measurements on one variable with low measurements on the other. Table 1 shows a graph of relationship of tall fathers tend to have sons who grew up to be tall men. Short fathers tend to have sons who grow up to be short men. If such were the case (which, of course, is ridiculous), then it would be possible to predict without error the adult height of an unborn son simply by measuring his father. EDUCATIONAL STATISTICS EDU5950 WEEK12

. EDUCATIONAL STATISTICS EDU5950 WEEK12

. EDUCATIONAL STATISTICS EDU5950 WEEK12

In Figure 1, each point represents a pair of scores, the height of a father and the height of his son. Such an array of points is called a scatterplot. The line that runs through the points is called a regression line. It is a line of best fit. When there is perfect correlation (r = 1.00), all points fall exactly on the regression line. EDUCATIONAL STATISTICS EDU5950 WEEK12

When a correlation is negative, increases in one variable are accompanied by decreases in the other variable (an inverse relationship). With negative correlation, the regression line goes from the upper left corner of the graph to the lower right corner. As you may recall, such lines have a negative slope. Although some correlation coefficients are positive and some are negative, one is not more valuable than the other. EDUCATIONAL STATISTICS EDU5950 WEEK12

The algebraic sign simply tells you the direction of the relationship (which is important when you are describing how the variables are related). The absolute size of r, however tells you the degree of the relationship. A strong relationship (either positive or negative) is usually more valuable than a weaker one. A zero correlation means there is no linear relationship between two variables. EDUCATIONAL STATISTICS EDU5950 WEEK12

High and low scores on the two variables are not associated in any predictable manner. Figure 2 shows a scatterplot that produces a zero correlation coefficient. When r = 0, the regression line is horizontal at a height of Y. This make sense; if r = 0, then your best estimate of Y for any value is Y. EDUCATIONAL STATISTICS EDU5950 WEEK12

. EDUCATIONAL STATISTICS EDU5950 WEEK12

A correlation coefficient provides a quantitative way to express the degree of relationship that exists between two variables. The definition formula is where r = Pearson product-moment correlation coefficient zx = a z score for variable X zy = the corresponding z score for variable Y N = number of pairs of X and Y values EDUCATIONAL STATISTICS EDU5950 WEEK12

Because researchers often use means and standard deviations when telling the story of the data, this formula (Blanched formula) is used by many: Where; x & y = paired observations xy = product of each x value multiplied by its paired y value = mean of variable x = mean of variable y = standard deviation of variable x = standard deviation of variable y N= number of pairs of observations EDUCATIONAL STATISTICS EDU5950 WEEK12

With the raw score formula, you calculate r from the raw scores without computing means and standard deviations. The formula is Remember that N is the number of pairs of values. EDUCATIONAL STATISTICS EDU5950 WEEK12

. EDUCATIONAL STATISTICS EDU5950 WEEK12

. EDUCATIONAL STATISTICS EDU5950 WEEK12

. EDUCATIONAL STATISTICS EDU5950 WEEK12

The basic simple interpretation of r is probably familiar to you at this point. A correlation coefficient measures the degree of linear relationship between two variables of a bivariate distribution. What is qualifies as a large correlation coefficient? What is small? Jacob Cohen proposed that the question be answered by calculating an effect size index (d) and that d values of.20,.50, and.80 were designated as small, medium, and large, respectively. EDUCATIONAL STATISTICS EDU5950 WEEK12

In a similar way, Cohen addressed the question of calculating an effect size index for correlation coefficients. Small r =.10 Medium r =.30 Large r =.50 The correlation coefficient is also the basis of the coefficient of determination, which tells the proportion of variance that two variables in a bivariate distribution have in common. EDUCATIONAL STATISTICS EDU5950 WEEK12

The coefficient of determination is calculated by squaring r; it is always a positive value between 0 and 1. coefficient of determination = r 2 EDUCATIONAL STATISTICS EDU5950 WEEK12

Linear regression is a technique that uses the data to write an equation for a straight line – then to make predictions. Often predictions are based on an assumption that the relationship between two variables is linear. Formula for a straight line is Y = mX + b Where; - Y and X are variables representing scores on the Y and X axes - m = slope of the line - b = intercept of the line with the Y axis EDUCATIONAL STATISTICS EDU5950 WEEK12

Least square method is Consider this data: Father height 64 in First daughter height 66 in Second daughter height 64 in Draw a regression line using least square method EDUCATIONAL STATISTICS EDU5950 WEEK12

There is an error for each person on the scatterplot. The least square method creates a straight line such that the sum of the squares of the errors is a minimum. The least square method produces numerical values for the slope and the intercept – write the equation for a straight line; this line is the one that best fits the data. In statistics, the regression equation is EDUCATIONAL STATISTICS EDU5950 WEEK12

Where; Y = Y value predicted from a particular X value a = point at which the regression line intersects the s the axis b = slope of the regression line X = X value for which you wish to predict a Y value Note: In correlation problem, the symbol Y can be assigned to either variable, but in regression equation, Y is assigned to the variable you wish to predict. EDUCATIONAL STATISTICS EDU5950 WEEK12

You need the values for a and b, which are called regression coefficients – can be calculated from any bivariate set of data. To calculate b, To calculate a, EDUCATIONAL STATISTICS EDU5950 WEEK12

Let use our data from earlier correlation problem, predict math from oral test in SPM. EDUCATIONAL STATISTICS EDU5950 WEEK12

Entering these regression coefficient values into the regression equation produces a formula that predicts math from oral test: EDUCATIONAL STATISTICS EDU5950 WEEK12

. EDUCATIONAL STATISTICS EDU5950 WEEK12