Correlation 10/30. Relationships Between Continuous Variables Some studies measure multiple variables – Any paired-sample experiment – Training & testing.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Correlation Data collected from students in Statistics classes included their heights (in inches) and weights (in pounds): Here we see a positive association.
Review ? ? ? I am examining differences in the mean between groups
Chapter 4 The Relation between Two Variables
Regression What is regression to the mean?
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Overview Correlation Regression -Definition
Statistics for the Social Sciences
CJ 526 Statistical Analysis in Criminal Justice
R xy. When two variables are correlated, we can predict a score on one variable from a score on the other The stronger the correlation, the more accurate.
Correlation 2 Computations, and the best fitting line.
The Simple Regression Model
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Introduction to Probability and Statistics Linear Regression and Correlation.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Lecture 17: Correlations – Describing Relationships Between Two Variables 2011, 11, 22.
Correlation and Regression Analysis
Regression, Residuals, and Coefficient of Determination Section 3.2.
(and a bit on regression)
Correlation & Regression
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
Lecture 16 Correlation and Coefficient of Correlation
Descriptive Methods in Regression and Correlation
Introduction to Linear Regression and Correlation Analysis
Correlation and Linear Regression
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
Section #6 November 13 th 2009 Regression. First, Review Scatter Plots A scatter plot (x, y) x y A scatter plot is a graph of the ordered pairs (x, y)
Correlation and Regression. The test you choose depends on level of measurement: IndependentDependentTest DichotomousContinuous Independent Samples t-test.
Anthony Greene1 Correlation The Association Between Variables.
Prior Knowledge Linear and non linear relationships x and y coordinates Linear graphs are straight line graphs Non-linear graphs do not have a straight.
Chapter 6 & 7 Linear Regression & Correlation
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
1.6 Linear Regression & the Correlation Coefficient.
BIOL 582 Lecture Set 11 Bivariate Data Correlation Regression.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
Correlation Correlation is used to measure strength of the relationship between two variables.
Regression. Population Covariance and Correlation.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
Creating a Residual Plot and Investigating the Correlation Coefficient.
3.3 Correlation: The Strength of a Linear Trend Estimating the Correlation Measure strength of a linear trend using: r (between -1 to 1) Positive, Negative.
Section 5.1: Correlation. Correlation Coefficient A quantitative assessment of the strength of a relationship between the x and y values in a set of (x,y)
Correlation The apparent relation between two variables.
1 Virtual COMSATS Inferential Statistics Lecture-25 Ossam Chohan Assistant Professor CIIT Abbottabad.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Advanced Statistical Methods: Continuous Variables REVIEW Dr. Irina Tomescu-Dubrow.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
CORRELATION ANALYSIS.
©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture 1 Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative.
Statistics 7 Scatterplots, Association, and Correlation.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Chapter 2 Bivariate Data Scatterplots.   A scatterplot, which gives a visual display of the relationship between two variables.   In analysing the.
Correlation.
Regression and Correlation
Correlation & Regression
Correlation 10/27.
Correlation 10/27.
Lecture Notes The Relation between Two Variables Q Q
CORRELATION ANALYSIS.
Correlation and Regression
Topic 8 Correlation and Regression Analysis
Warsaw Summer School 2017, OSU Study Abroad Program
Presentation transcript:

Correlation 10/30

Relationships Between Continuous Variables Some studies measure multiple variables – Any paired-sample experiment – Training & testing performance; personality variables; neurological measures – Continuous independent variables How are these variables related? – Positive relationship: tend to be both large or both small – Negative relationship: when one is large, other tends to be small – Independent: value of one tells nothing about other

Scatterplots Graph of relationship between two variables, X and Y One point per subject – Horizontal coordinate X – Vertical coordinate Y Height = 67.7 Weight = 181.7

Correlation Measure of how closely two variables are related – Population correlation:  (rho) – Sample correlation: r Direction – r > 0: positive relationship; big X goes with big Y – r < 0: negative relationship; big X goes with small Y Strength – ±1 means perfect relationship Data lie exactly on a line If you know X, you know Y – 0 means no relationship Independent: Knowing X tells nothing about Y

r = -1r = -.75r = -.5 r = -.25r = 0r =.25 r =.5r =.75r = 1

Computing Correlation 1.Get z-scores for both samples 2.Multiply all pairs 3.Get average by dividing by n – 1 Positive relationship – Positive z X tend to go with positive z Y – Negative z X tend to go with negative z Y – z X  z Y tends to be positive Negative relationship – Positive z X tend to go with negative z Y – Negative z X tend to go with positive z Y – z X  z Y tends to be negative z X > 0z X < 0 MXMX MYMY z Y > 0 z Y < 0 MXMX z X > 0z X < 0 MYMY z Y > 0 z Y < 0

Computing Correlation XYX – M X Y – M Y zXzX zYzY zX  zYzX  zY M X = 5M Y = 7  = 5.80 s X = 2.6s Y = 3.7r =.97 X Y

Correlation and Linear Relationships Correlation measures how well data fit on straight line – Assumes linear relationship between X and Y Not useful for nonlinear relationships Arousal Performance r = 0

Predicting One Variable from Another Knowing one measure gives information about others from same subject – Knowing a person’s weight tells about his height Goal: Come up with a rule or function that uses X to compute best estimate of Y Y (Y-hat) – Predicted value of Y – Function of X – Best prediction of Y based on X

Linear Prediction Simplest way to predict one variable from another Straight line through data Y is linear function of X X= 71

How Good is the Prediction? Sometimes data fall nearly on a perfect line – Strong relationship between variables – r near ±1 – Good prediction Sometimes data are more scattered – Weak relationship – r near 0 – Can’t predict well X Y X Y X Y

How Good is the Prediction? Goal: Keep error close to zero – Minimize mean squared error:

Correlation and Prediction Best prediction line minimizes MS Error – Closest to data; best “fit” Correlation determines best prediction line – Slope = r when plotting z-scores: zYzY zXzX r =.75 slope =.75

r = -1r = -.75r = -.5 r = -.25r = 0r =.25 r =.5r =.75r = 1

Explained Variance Without knowing XKnowing X Original Variance Explained Variance Reduction from knowing X Residual Variance

Properties of Correlation Measures relationship between two continuous variables – How well data are fit by a straight line Sign of r shows direction of relationship Magnitude of r shows strength of relationship – Strongest relationships have r = ±1; weak relationships have r ≈ 0 Best prediction line minimizes error of prediction (MS Error ) – Correlation gives slope of line (when using z-scores): r 2 equals proportion of variance in one variable explained by other – Reduction from original variance (s Y 2 ) to residual variance (MS Error )

Review Find the correlation of r =.7. A B C D

Review Calculate the correlation between X and Y. z X = [ ] z Y = [ ] A.-.94 B.-.70 C.-.02 D.-.001

Review The correlation between IQ and number of bicycles owned is r =.6. Predict the IQ of someone who owns 4 bikes (z bike = 2.5). Recall that µ IQ = 100 and  IQ = 15. A B C D.137.5