Unit 5 Correlation.

Slides:



Advertisements
Similar presentations
Correlation and Linear Regression.
Advertisements

Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Introduction to Excel 2007 Part 1: Basics and Descriptive Statistics Psych 209.
Linear Regression/Correlation
Simple Covariation Focus is still on ‘Understanding the Variability” With Group Difference approaches, issue has been: Can group membership (based on ‘levels.
Statistical Analysis to show Relationship Strength.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
1 GE5 Tutorial 4 rules of engagement no computer or no power → no lessonno computer or no power → no lesson no SPSS → no lessonno SPSS → no lesson no.
Examining Relationships in Quantitative Research
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Chapter 7 Scatterplots, Association, and Correlation.
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Chapter 6 Correlation. Different types of correlation coefficient Pearson’s r is applicable to measure the association between two continuous-scaled variables.
Conduct Simple Correlations Section 7. Correlation –A Pearson correlation analyzes relationships between parametric, linear (interval or ratio which are.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Chapter 15 Association Between Variables Measured at the Interval-Ratio Level.
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
Regression and Correlation
CHAPTER 3 Describing Relationships
Inference for Least Squares Lines
Introduction to Regression Analysis
Introduction to Excel 2007 January 29, 2008.
CHAPTER 7 LINEAR RELATIONSHIPS
Political Science 30: Political Inquiry
Module 11 Math 075. Module 11 Math 075 Bivariate Data Proceed similarly as univariate distributions … What is univariate data? Which graphical models.
Correlations and Scatterplots
SIMPLE LINEAR REGRESSION MODEL
Multiple Regression.
Scatterplots A way of displaying numeric data
CHAPTER fourteen Correlation and Regression Analysis
Statistics for the Social Sciences
Chapter 9 Correlational Research Designs
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Understanding Research Results: Description and Correlation
Scatter plots & Association
Theme 7 Correlation.
Chapter 7 Part 1 Scatterplots, Association, and Correlation
CHAPTER 26: Inference for Regression
Linear Regression/Correlation
Day 43 – Calculating Correlation Coefficient using Excel
STEM Fair Graphs.
Chapter 3 Describing Relationships Section 3.2
The Pearson Correlation
Chapter 7 – Correlation & Differential (Quasi)
Review of Chapter 3 Examining Relationships
Linear Regression and Correlation
CHAPTER 3 Describing Relationships
15.1 The Role of Statistics in the Research Process
Correlation A measure of the strength of the linear association between two numerical variables.
CHAPTER 3 Describing Relationships
Linear Regression and Correlation
Summarizing Bivariate Data
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Review I am examining differences in the mean between groups How many independent variables? OneMore than one How many groups? Two More than two ?? ?
Unit 6 Regression.
Warsaw Summer School 2017, OSU Study Abroad Program
Unit 5 Quiz: Review questions
Honors Statistics Review Chapters 7 & 8
Finding Correlation Coefficient & Line of Best Fit
Introduction to Excel 2007 Part 1: Basics and Descriptive Statistics Psych 209.
Review of Chapter 3 Examining Relationships
Presentation transcript:

Unit 5 Correlation

Relationship Just like it takes two persons to have a romantic relationship, you need at least two variables to form a statistical relationship. A relationship can be shown by a plot or by a number.

Scatterplot/Scattergram The plot for displaying a bivariate relationship is called “scatterplot” or “scattergram” (Cartesian coordinate system). The left-right (horizontal) direction is commonly called the X axis. The up-down (vertical) direction is commonly called the Y axis..

Scatterplot When y = 6, the vertical line goes up by six units. When x = 4, the horizontal line goes across 4 units. The data point is put into the intersection of two lines.

Scatter: Spread out Variations in the data  Scatter everywhere

Different types of correlation coefficient The number to show a correlation is called correlation coefficient. Pearson’s r is applicable to measure the association between two continuous-scaled variables (Our focus). Spearman is suitable to rank-order/ordinal data We will go through the calculation later.

Pearson’s correlation coefficient Range: From -1 to +1. Pearson’s r = 1  Perfect relationship (It may be too good to be true!) Pearson’s r = 0  No relationship Positive but less than 1: Positive relationship Negative but less than -1: Negative relationship Common mistake: When the coefficient is high, it must be a strong relationship!

Pearson’s r = .83

Pearson’s r =.83

Pearson’s r =.83

Pearson’s r =.83

Computation in SPSS Use visulization_data.sav in Unit 4 Analyze  Correlate  Bivariate

Computation in SPSS There is no distinction between DV and IV. Put both college test scores and GPA into variables. The order doesn’t matter. Press OK

Computation in SPSS The correlation of test scores and GPA = the correlation of GPA and test scores Don’t worry about what “significance” (sig) means.

Computation in JMP In Unit 4 folder: visualization_data.jmp I want to know the relationship between overall GPA and a particular college test score. Analyze  Multivariate Methods  Multivariate (It takes multiple variables [at least two])

In correlation there is no distinction between DV (Y) and IV (X), and thus both variables are put into Y, columns. What we care is whether X and Y are related; we don’t care about cause and effect.

Because there is no distinction between DV and IV, the correlation between GPA and test scores (0.5273) is the same as the correlation between the test scores and GPA (0.5273).

Pearson’s r = .5273. It is fair but it could be better! The eclipse (red line) covers the majority of the data. Any points outside the red line are considered outliers (extreme cases).

I can select these outliers (extreme cases) by holding down the shift key and pointing to them one by one.

When I go back to the table, I can see that the extreme cases are highlighted in blue. Mouse over to one of them, right-click and select hide and exclude. These cases will be hidden and excluded.

Back to the graph From the first red triangle, choose Redo  Redo Analysis

Without those outliers, the correlation is stronger. Pearson’s r = .6129

I want to infer from the sample to the population Red Triangle  Pairwise Correlations

The confidence interval shows that in the population the correlation may be as low as .4618 or as high as .7293.

Confidence interval Why do we need a bracket (Confidence interval) to estimate the population correlation coefficient? Because we cannot access to the entire population, we are not sure. If you ask me to guess how old you are and I am not sure, I would also put my estimates within a bracket (interval). Because you are in college, I guess your age is between 18 and 22.

Semi-hand calculation To make it easier and faster, use r_hand_calculation.xlsx

Semi-hand calculation The Greek symbol S = Sum Plug the numbers into the equation: 6(20,485) – (247 × 486) / [√[[6(11,409) – (2472)] × [6(40,022) – 4862]]] = 0.5298

Assignment 5.1 Mary’s highs school GPA is 3.5 and her college GPA is 3.7. Tom’s HS GPA is 3.8 and his college GPA is 4.0. If these data are used for computing correlation, what result would you expect? Write down your expectation before making any computation. Next, use SPSS or JMP to compute the correlation coefficient. Does the result match your expectation? Is it a valid result? Why or why not?

Assignment 5.2 (Canvas) Use visualization_data (Unit 4). Compute the Pearson’s r of GPA and SAT. There is no distinction between DV and IV. You can put them in any order. You can use SPSS, JMP, Excel (semi-hand calculation) or hand-calculation.