Correlation: Relationship between Variables

Slides:



Advertisements
Similar presentations
Correlation Data collected from students in Statistics classes included their heights (in inches) and weights (in pounds): Here we see a positive association.
Advertisements

Statistics lecture 4 Relationships Between Measurement Variables.
Describing Relationships Using Correlation and Regression
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between Quantitative Variables Chapter 5.
Overview Correlation Regression -Definition
Section 7.1 ~ Seeking Correlation
Relationships Between Quantitative Variables
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
BIVARIATE DATA: CORRELATION AND REGRESSION Two variables of interest: X, Y. GOAL: Quantify association between X and Y: correlation. Predict value of Y.
Correlation MARE 250 Dr. Jason Turner.
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Correlation Relationship between Variables. Statistical Relationships What is the difference between correlation and regression? Correlation: measures.
Introduction to Probability and Statistics Linear Regression and Correlation.
Describing Relationships: Scatterplots and Correlation
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Correlational Research Strategy. Recall 5 basic Research Strategies Experimental Nonexperimental Quasi-experimental Correlational Descriptive.
Chapter 6 Prediction, Residuals, Influence Some remarks: Residual = Observed Y – Predicted Y Residuals are errors.
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between Measurements Variables Chapter 10.
Describing Relationships: Scatter Plots and Correlation ● The world is an indivisible whole (butterfly effect and chaos theory; quantum entanglement, etc.)
Correlation and Linear Regression Chapter 13 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
STAT 211 – 019 Dan Piett West Virginia University Lecture 2.
Correlation By Dr.Muthupandi,. Correlation Correlation is a statistical technique which can show whether and how strongly pairs of variables are related.
Correlation and regression 1: Correlation Coefficient
Scatter Plots and Linear Correlation. How do you determine if something causes something else to happen? We want to see if the dependent variable (response.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Copyright © 2011 Pearson Education, Inc. Slide 5-1 Unit 5E Correlation Coefficient.
9.1 Correlation Key Concepts: –Scatter Plots –Correlation –Sample Correlation Coefficient, r –Hypothesis Testing for the Population Correlation Coefficient,
BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation.
BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation.
Run the colour experiment where kids write red, green, yellow …first.
MEASURES of CORRELATION. CORRELATION basically the test of measurement. Means that two variables tend to vary together The presence of one indicates the.
Essential Statistics Chapter 41 Scatterplots and Correlation.
Correlation Correlation is used to measure strength of the relationship between two variables.
The Correlational Research Strategy
Scatterplots are used to investigate and describe the relationship between two numerical variables When constructing a scatterplot it is conventional to.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
CHAPTER 4 SCATTERPLOTS AND CORRELATION BPS - 5th Ed. Chapter 4 1.
Chapter 4 Scatterplots and Correlation. Explanatory and Response Variables u Interested in studying the relationship between two variables by measuring.
Chapter 16 Data Analysis: Testing for Associations.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
Correlations: Relationship, Strength, & Direction Scatterplots are used to plot correlational data – It displays the extent that two variables are related.
Bivariate Data AS (3 credits) Complete a statistical investigation involving bi-variate data.
Linear regression Correlation. Suppose we found the age and weight of a sample of 10 adults. Create a scatterplot of the data below. Is there any relationship.
April 1 st, Bellringer-April 1 st, 2015 Video Link Worksheet Link
Creating a Residual Plot and Investigating the Correlation Coefficient.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
Chapter 141 Describing Relationships: Scatterplots and Correlation.
The Correlational Research Strategy Chapter 12. Correlational Research The goal of correlational research is to describe the relationship between variables.
BPS - 5th Ed. Chapter 41 Scatterplots and Correlation.
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
CORRELATION ANALYSIS.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Correlation & Linear Regression Using a TI-Nspire.
Chapter 2 Bivariate Data Scatterplots.   A scatterplot, which gives a visual display of the relationship between two variables.   In analysing the.
Essential Statistics Chapter 41 Scatterplots and Correlation.
Statistics 200 Lecture #6 Thursday, September 8, 2016

Basic Practice of Statistics - 3rd Edition
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 5th Edition
2. Find the equation of line of regression
CORRELATION ANALYSIS.
Scatterplots and Correlation
Essential Statistics Scatterplots and Correlation
Warsaw Summer School 2017, OSU Study Abroad Program
Plan for Today: Chapter 14: Describing Relationships: Scatterplots and Correlation.
Presentation transcript:

Correlation: Relationship between Variables

Statistical Relationships versus Deterministic Relationships Deterministic: if we know the value of one variable, we can determine the value of the other exactly. e.g. relationship between volume and weight of water. Statistical: natural variability exists in both measurements. Useful for describing what happens to a population or aggregate.

Measuring Strength Through Correlation A Linear Relationship Correlation (or the Pearson product-moment correlation or the correlation coefficient) represented by the letter r: Indicator of how closely the values fall to a straight line. Measures linear relationships only; that is, it measures how close the individual points in a scatterplot are to a straight line.

Features of the Correlation Coefficient It indicates the strength of the relationship and whether there is a positive or negative relationship. The correlation coefficient is a number between -1 and 1. A positive correlation indicates that the variables increase together. A negative correlation indicates that as one variable increases, the other decreases. Correlation of +1/-1 indicates a perfect linear positive/negative relationship between the two variables; as one increases, the other increases/decreases at a constant rate (a deterministic linear relationship).

Features of the Correlation Coefficient Correlation of zero could indicate no linear relationship between the two variables, or that the best straight line through the data on a scatterplot is exactly horizontal. The closer the correlation is to 1 or -1, the stronger the relationship. The closer it is to 0, the weaker the relationship. Crude estimate: > |.5|? most likely a relationship < |.3|? correlation essentially non-existent |.3| < r < |.5|? gray area! Correlations are unaffected if the units of measurement are changed. For example, the correlation between weight and height remains the same regardless of whether height is expressed in inches, feet or millimeters (as long as it isn’t rounded off).

What to discuss? To discuss the relationships between two variables we look at: Statistical Significance Strength of Correlation

Statistical Significance A relationship is statistically significant if that relationship is stronger than 95% of the relationships we would expect to see just by chance.

Statistical Significance What r value is statistically significant? It depends on the size of the sample. The following table indicates the lowest r value that would indicate a statistically significant relationship for a given sample size. A more complete table is in the Excel file. Critical Values for the Correlation Coefficient  Number of Points 95% Confidence 3 0.997 4 0.950 5 0.878 6 0.811 7 0.754 8 0.707 9 0.666 10 0.632 11 0.602 12 0.576 13 0.553 14 0.532 15 0.514 16 0.497 17 0.482 18 0.468 19 0.456 20 0.444

Two Warnings about Statistical Significance Even a minor relationship will achieve “statistical significance” if the sample is very large. A very strong relationship won’t necessarily achieve “statistical significance” if the sample is very small.

Example - Verbal SAT and GPA Scatterplot of GPA and verbal SAT score. The correlation coefficient is .485, indicating a moderate positive relationship. Higher verbal SAT scores tend to indicate higher GPAs as well, but the relationship is nowhere close to being exact.

Example - Husbands’ and Wifes’ Ages and Heights Scatterplot of British husbands’ and wives’ heights (in millimeters); r = .36 Scatterplot of British wives’ and husbands’ and ages; r = .94 Discuss the strength of the relationships. Are they positively or negatively correlated?

Example - Occupational Prestige and Suicide Rates Plot of suicide rate versus occupational prestige for 36 occupations. Correlation of .109. There is a weak positive relationship If outlier removed r drops to .018.

Statistical Relationships What is the difference between correlation and regression? Correlation: measures the strength of a relationship between two measurement variables. Regression: uses the equation of the trendline to predict one measurement variable from another.