Correlation 10/27.

Slides:



Advertisements
Similar presentations
Kin 304 Regression Linear Regression Least Sum of Squares
Advertisements

Regression What is regression to the mean?
CJ 526 Statistical Analysis in Criminal Justice
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Introduction to Probability and Statistics Linear Regression and Correlation.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Correlation 10/30. Relationships Between Continuous Variables Some studies measure multiple variables – Any paired-sample experiment – Training & testing.
Correlation & Regression
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
Descriptive Methods in Regression and Correlation
Section #6 November 13 th 2009 Regression. First, Review Scatter Plots A scatter plot (x, y) x y A scatter plot is a graph of the ordered pairs (x, y)
Correlation and Regression. The test you choose depends on level of measurement: IndependentDependentTest DichotomousContinuous Independent Samples t-test.
Prior Knowledge Linear and non linear relationships x and y coordinates Linear graphs are straight line graphs Non-linear graphs do not have a straight.
Chapter 6 & 7 Linear Regression & Correlation
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
1.6 Linear Regression & the Correlation Coefficient.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Regression. Population Covariance and Correlation.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Creating a Residual Plot and Investigating the Correlation Coefficient.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
CORRELATION ANALYSIS.
SOCW 671 #11 Correlation and Regression. Uses of Correlation To study the strength of a relationship To study the direction of a relationship Scattergrams.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Correlation.
Scatterplots, Association, and Correlation
Regression and Correlation
Scatterplots Chapter 6.1 Notes.
Regression Analysis AGEC 784.
Correlation & Regression
Computations, and the best fitting line.
The Lease Squares Line Finite 1.3.
Two Quantitative Variables
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
SCATTERPLOTS, ASSOCIATION AND RELATIONSHIPS
Chapter 11: Simple Linear Regression
Correlation 10/27.
Correlation and Regression
CHAPTER 10 Correlation and Regression (Objectives)
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Lecture Slides Elementary Statistics Thirteenth Edition
Correlation and Regression
2. Find the equation of line of regression
Chapter 12 Regression.
Chapter 2 Looking at Data— Relationships
Least-Squares Regression
Lecture Notes The Relation between Two Variables Q Q
Regression.
Regression.
The Weather Turbulence
CORRELATION ANALYSIS.
Introduction to Probability and Statistics Thirteenth Edition
Regression.
The Pearson Correlation
Line of Best Fit.
Regression Chapter 8.
Regression.
Chapter 8 Part 1 Linear Regression
Correlation and Regression
JEOPARDY Functions Linear Models Exponential Models Quadratic Models
Regression.
Topic 8 Correlation and Regression Analysis
Introduction to Regression
Review I am examining differences in the mean between groups How many independent variables? OneMore than one How many groups? Two More than two ?? ?
Correlation r Explained
Chapters Important Concepts and Terms
Warsaw Summer School 2017, OSU Study Abroad Program
Honors Statistics Review Chapters 7 & 8
Correlation and Simple Linear Regression
Presentation transcript:

Correlation 10/27

Relationships Between Continuous Variables Some studies measure multiple variables Any paired-sample experiment Training & testing performance; personality variables; neurological measures Continuous independent variables How are these variables related? Positive relationship: tend to be both large or both small Negative relationship: when one is large, other tends to be small Independent: value of one tells nothing about other

Scatterplots Graph of relationship between two variables, X and Y One point per subject Horizontal coordinate X Vertical coordinate Y Weight = 181.7 Height = 67.7

Correlation Measure of how closely two variables are related Direction Population correlation: r (rho) Sample correlation: r Direction r > 0: positive relationship; big X goes with big Y r < 0: negative relationship; big X goes with small Y Strength ±1 means perfect relationship Data lie exactly on a line If you know X, you know Y 0 means no relationship Independent: Knowing X tells nothing about Y

r = -1 r = -.75 r = -.5 r = -.25 r = 0 r = .25 r = .5 r = .75 r = 1

Computing Correlation Get z-scores for both samples Multiply all pairs Get average by dividing by n – 1 Positive relationship Positive zX tend to go with positive zY Negative zX tend to go with negative zY zX  zY tends to be positive Negative relationship Positive zX tend to go with negative zY Negative zX tend to go with positive zY zX  zY tends to be negative

Computing Correlation X Y X – MX Y – MY zX zY zX  zY 5 8 1 .00 .27 10 3 1.16 .80 .93 4 -2 -3 -.77 -.80 .62 9 13 6 1.55 1.60 2.48 2 -5 -1.16 -1.34 7 -1 -.39 -.53 .21 MX = 5 MY = 7 S = 5.80 sX = 2.6 sY = 3.7 r = .97 X Y

Correlation and Linear Relationships Correlation measures how well data fit on straight line Assumes linear relationship between X and Y Not useful for nonlinear relationships Arousal Performance r = 0

Predicting One Variable from Another Knowing one measure gives information about others from same subject Knowing a person’s weight tells about his height Goal: Come up with a rule or function that uses X to compute best estimate of Y Y (Y-hat) Predicted value of Y Function of X Just like X from before: best prediction of data

Linear Prediction Simplest way to predict one variable from another Straight line through data Y is linear function of X X= 71

How Good is the Prediction? Sometimes data fall nearly on a perfect line Strong relationship between variables r near ±1 Good prediction Sometimes data are more scattered Weak relationship r near 0 Can’t predict well X Y X Y X Y

How Good is the Prediction? Goal: Keep error close to zero Minimize mean squared error:

Correlation and Prediction Best prediction line minimizes MSE Closest to data; best “fit” Correlation determines best prediction line Slope = r when plotting z-scores: zY zX r = .75 slope = .75

r = -1 r = -.75 r = -.5 r = -.25 r = 0 r = .25 r = .5 r = .75 r = 1

Reduction from knowing X Explained Variance Without knowing X Knowing X Explained Variance Reduction from knowing X Original Variance Residual Variance

Take-away Points Understand scatterplots Correlation measures relationship between two variables How well data are fit by a straight line Sign of r shows direction of relationship Magnitude of r shows strength of relationship Strongest relationships have r = ±1; weak relationships have r ≈ 0 Best prediction line minimizes error of prediction (MSE) Correlation gives slope of line (when using z-scores): r2 equals proportion of variance in one variable explained by other Reduction from original variance (sY) to residual variance (MSE)