R xy. When two variables are correlated, we can predict a score on one variable from a score on the other The stronger the correlation, the more accurate.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Kin 304 Regression Linear Regression Least Sum of Squares
Covariance and Correlation: Estimator/Sample Statistic: Population Parameter: Covariance and correlation measure linear association between two variables,
Regression Greg C Elvers.
Regression What is regression to the mean?
Describing Relationships Using Correlation and Regression
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Chapter 8 Linear Regression © 2010 Pearson Education 1.
Overview Correlation Regression -Definition
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Statistics for the Social Sciences
Correlation and Simple Regression Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Regression and Correlation
Correlation-Regression The correlation coefficient measures how well one can predict X from Y or Y from X.
SIMPLE LINEAR REGRESSION
Chapter 3 Summarizing Descriptive Relationships ©.
REGRESSION AND CORRELATION
SIMPLE LINEAR REGRESSION
Covariance and Correlation
Relationships Among Variables
Statistics for the Behavioral Sciences (5th ed.) Gravetter & Wallnau
Correlation & Regression Math 137 Fresno State Burger.
Correlation 10/30. Relationships Between Continuous Variables Some studies measure multiple variables – Any paired-sample experiment – Training & testing.
Lecture 5 Correlation and Regression
Chapter 4 Two-Variables Analysis 09/19-20/2013. Outline  Issue: How to identify the linear relationship between two variables?  Relationship: Scatter.
Correlation and Regression
Correlation and Linear Regression
Correlation and Linear Regression
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
SIMPLE LINEAR REGRESSION
Linear Regression and Correlation
Correlation and regression 1: Correlation Coefficient
Copyright ©2011 Nelson Education Limited Describing Bivariate Data CHAPTER 3.
Correlation and Regression
Biostatistics Unit 9 – Regression and Correlation.
Linear Regression When looking for a linear relationship between two sets of data we can plot what is known as a scatter diagram. x y Looking at the graph.
Bivariate Data When two variables are measured on a single experimental unit, the resulting data are called bivariate data. You can describe each variable.
1. Graph 4x – 5y = -20 What is the x-intercept? What is the y-intercept? 2. Graph y = -3x Graph x = -4.
Wednesday, October 12 Correlation and Linear Regression.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
1.6 Linear Regression & the Correlation Coefficient.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Linear Regression Least Squares Method: an introduction.
Slide 8- 1 Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 Regression & Correlation (1) 1.A relationship between 2 variables X and Y 2.The relationship seen as a straight line 3.Two problems 4.How can we tell.
Regression and Least Squares The need for a mathematical construct… Insert fig 3.8.
Linear correlation and linear regression + summary of tests Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
LECTURE 9 Tuesday, 24 FEBRUARY STA291 Fall Administrative 4.2 Measures of Variation (Empirical Rule) 4.4 Measures of Linear Relationship Suggested.
Creating a Residual Plot and Investigating the Correlation Coefficient.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Basic Statistics Linear Regression. X Y Simple Linear Regression.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
CORRELATION ANALYSIS.
Summarizing Data Graphical Methods. Histogram Stem-Leaf Diagram Grouped Freq Table Box-whisker Plot.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Part II Exploring Relationships Between Variables.
Correlation & Regression
Regression and Correlation
The Weather Turbulence
Introduction to Probability and Statistics Thirteenth Edition
Correlation and Regression
Presentation transcript:

r xy

When two variables are correlated, we can predict a score on one variable from a score on the other The stronger the correlation, the more accurate our prediction will be

r xy We need a measure of the “strength” of a correlation

r xy We need a number that gets bigger when big numbers are paired with big numbers and small numbers are paired with small numbers We need a number that gets smaller when big numbers are paired with small numbers and small numbers are paired with big numbers

r xy Remember the height/weight example: Big number indicates this (strong positive correlation) 5’5’25’45’65’85’ a a b b, e c c d d ef f

r xy Remember the height/weight example: Small number indicates this (strong negative correlation) 5’5’25’45’65’85’ a a b b, e c c d d ef f

r xy Two sets of scores, x i and y i What could we do?

r xy What could we do?

r xy What could we do? When pairs are multiplied and the products are summed up: – Greatest when big numbers paired with big numbers and small numbers with small numbers –Least when small numbers are paired with big numbers and big numbers are paired with small numbers

r xy analogy: This gets you most money Pennies Quarters Loonies

r xy analogy:this gets you the least… Pennies Quarters Loonies

r xy analogy: Because: 3 x $1 plus 2 x $0.25 plus 1 x $0.01 is more than 1 x $1 plus 2 x $0.25 plus 3 x $0.01

r xy But there’s a problem Not a good measure because the value ultimately depends on n AND the size of the numbers

r xy Try this

r xy Try this Still not so good - doesn’t depend on n anymore, but does depend on size of x’s and y’s

r xy How about multiply deviation scores –comparing each variable relative to its respective mean

r xy Multiply deviation scores Now value depends on the spread of the data

r xy So standardize the scores

r xy This measures strength of correlation: = = r xy

r xy r xy ranges from -1.0 indicating a perfect negative correlation to +1.0 indicating a perfect positive correlation an r xy of zero indicates no correlation whatsoever. Scores are random with respect to each other.

r xy r xy also has a geometric meaning

r xy r xy also has a geometric meaning Recall that the mean of the z x and z y distributions is zero and each z-score is a deviation from the mean

r xy Each point lands in one of four quadrants point z x, z y zxzx zyzy

r xy notice that: both z x and z y are positive r xy =

r xy notice that: z x is negative and z y is positive r xy =

r xy notice that: z x is negative and z y is negative r xy =

r xy notice that: z x is positive and z y is negative r xy =

r xy So Thus if most points tend to fall around a line with a positive (45 degree) slope (I and III), the cross-products will tend to be positive I II III IV

r xy So If most points tend to fall around a line with a negative slope (II and IV), the cross products will tend to be negative Thus if most points tend to fall around a line with a positive (45 degree) slope (I and III), the cross-products will tend to be positive I II III IV

r xy So If the points were randomly scattered about, the negative and positive cross-products cancel

Covariance a related measure of the relationship between scores on two different variables is the covariance

Covariance notice that the variance (S 2 x ) is the covariance between a variable and itself !

Regression If two variables are perfectly correlated (r = + or - 1.0) then one can exactly predict a score on one variable given a score on another

Regression For example: a university charges $250 registration fee plus $100 / credit

Regression tuition = $100(X) + $250 –where X is the number of credits Notice this is a linear relationship (an equation of the form y = ax + b –a = $100/credit –b = $250 –x = number of credits

Regression Tuition as a function of credit hours is a straight line There is a perfect correlation between credit hours and tuition You could predict perfectly the tuition required given the number of credit hours

Next Time Regression - read chapter 8