Basic Statistics Correlation Var Relationships Associations.

Slides:



Advertisements
Similar presentations
Basic Statistics Correlation.
Advertisements

Chapter 16: Correlation. So far… We’ve focused on hypothesis testing Is the relationship we observe between x and y in our sample true generally (i.e.
Chapter 16: Correlation.
Review ? ? ? I am examining differences in the mean between groups
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Describing Relationships Using Correlation and Regression
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Correlation Chapter 9.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Correlation. Introduction Two meanings of correlation –Research design –Statistical Relationship –Scatterplots.
CJ 526 Statistical Analysis in Criminal Justice
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
A quick introduction to the analysis of questionnaire data John Richardson.
Statistics Psych 231: Research Methods in Psychology.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Chapter Seven The Correlation Coefficient. Copyright © Houghton Mifflin Company. All rights reserved.Chapter More Statistical Notation Correlational.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Correlation Coefficient Correlation coefficient refers to the type of relationship between variables that allows one to make predications from one variable.
Chapter 21 Correlation. Correlation A measure of the strength of a linear relationship Although there are at least 6 methods for measuring correlation,
Week 11 Chapter 12 – Association between variables measured at the nominal level.
Correlation and Linear Regression
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
Chapter 8: Bivariate Regression and Correlation
Lecture 16 Correlation and Coefficient of Correlation
Understanding Research Results
Equations in Simple Regression Analysis. The Variance.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
Covariance and correlation
Correlation.
Introduction to Regression Analysis. Two Purposes Explanation –Explain (or account for) the variance in a variable (e.g., explain why children’s test.
Chapter 15 Correlation and Regression
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Statistics in Applied Science and Technology Chapter 13, Correlation and Regression Part I, Correlation (Measure of Association)
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
Correlations. Outline What is a correlation? What is a correlation? What is a scatterplot? What is a scatterplot? What type of information is provided.
Figure 15-3 (p. 512) Examples of positive and negative relationships. (a) Beer sales are positively related to temperature. (b) Coffee sales are negatively.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Examining Relationships in Quantitative Research
1 Inferences About The Pearson Correlation Coefficient.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Chapter 16 Data Analysis: Testing for Associations.
CORRELATIONAL RESEARCH STUDIES
U Describes the relationship between two or more variables. Describes the strength of the relationship in terms of a number from -1.0 to Describes.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
Psychology 820 Correlation Regression & Prediction.
Examining Relationships in Quantitative Research
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
The basic task of most research = Bivariate Analysis
Correlations. Distinguishing Characteristics of Correlation Correlational procedures involve one sample containing all pairs of X and Y scores Correlational.
Chapter 14 Correlation and Regression
Basic Statistics Linear Regression. X Y Simple Linear Regression.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Chapter 16: Correlation. So far… We’ve focused on hypothesis testing Is the relationship we observe between x and y in our sample true generally (i.e.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
1 Chapter 10 Correlation. 2  Finding that a relationship exists does not indicate much about the degree of association, or correlation, between two variables.
CORRELATION ANALYSIS.
Chapter 15: Correlation. Correlations: Measuring and Describing Relationships A correlation is a statistical method used to measure and describe the relationship.
Chapter 10 CORRELATION.
Chapter 15: Correlation.
Ch. 11: Quantifying and Interpreting Relationships Among Variables
CORRELATION ANALYSIS.
4/4/2019 Correlations.
Presentation transcript:

Basic Statistics Correlation

Var Relationships Associations

The Need for a Measure of Relationship INDIVIDUAL DIFFERENCES (Variance) Describe Predict Control Explain

Information ? COvary In Research Dependent variable Independent variables X1X1 X2X2 X3X3 Y

The Concept of Correlation Association or relationship between two variables X Y Covary---Go together Co-relate?relationr

Patterns of Covariation

Y Positive correlation Negative correlation Correlation Covary Go together XY XY X Zero or no correlation

Scatter plots allow us to visualize the relationships Scatter Plots The chief purpose of the scatter diagram is to study the nature of the relationship between two variables  Linear/curvilinear relationship  Direction of relationship  Magnitude (size) of relationship

Represents both the X and Y scores Variable X Variable Y An illustration of a perfect positive correlation high low Scatter Plot A Exact value

Variable X Variable Y An illustration of a positive correlation high low Scatter Plot B Estimated Y value

Variable X Variable Y An illustration of a perfect negative correlation high low Scatter Plot C Exact value

Variable X Variable Y An illustration of a negative correlation high low Scatter Plot D Estimated Y value

Variable X Variable Y An illustration of a zero correlation high low Scatter Plot E

Variable X Variable Y An illustration of a curvilinear relationship high low Scatter Plot F

The Measurement of Correlation The degree of correlation between two variables can be described by such terms as “strong,” ”low,” ”positive,” or “moderate,” but these terms are not very precise. If a correlation coefficient is computed between two sets of scores, the relationship can be described more accurately. The Correlation Coefficient A statistical summary of the degree and direction of relationship or association between two variables can be computed

Pearson’s Product-Moment Correlation Coefficient r  Direction of relationship: Sign (+ or –)  Magnitude: 0 through +1 or 0 through -1 Negative correlationPositive correlation No Relationship

The Pearson Product-Moment Correlation Coefficient Recall that the formula for a variance is: If we replaced the second X that was squared with a second variable, Y, it would be: This is called a co-variance and is an index of the relationship between X and Y.

Conceptual Formula for Pearson r This formula may be rewritten to reflect the actual method of calculation

Calculation of Pearson r You should notice that this formula is merely the sum of squares for covariance divided by the square root of the product of the sum of squares for X and Y

Formulae for Sums of Squares Therefore, the formula for calculating r may be rewritten as:

Calculation of r Using Sums of Squares

An Example Suppose that a college statistics professor is interested in how the number of hours that a student spends studying is related to how many errors students make on the mid- term examination. To determine the relationship the professor collects the following data:

The Stats Professor’s Data Student Hours Studied (X) Errors (Y) X2X2 Y2Y2 XY Total  X = 70  Y = 73  X 2 =546  Y2=695  XY=429

The Data Needed to Calculate the Sum of Squares XYX2X2 Y2Y2 XY Total  X = 70  Y = 73  X 2 =546  Y2=695  XY=429 = /10 = = 56 = /10 = = = 429 – (70)(73)/10 = 429 – 511 = -82

Calculating the Correlation Coefficient = -82 / √(56)(162.1) = Thus, the correlation between hours studied and errors made on the mid-term examination is -0.86; indicating that more time spend studying is related to fewer errors on the mid-term examination. Hopefully an obvious, but now a statistical conclusion!

Pearson Product-Moment Correlation Coefficient r 0+1 Negative correlation Positive correlation perfect negative correlation Perfect positive correlation Zero correlation

Numerical values Negative correlation Zero correlation Positive correlation Perfect Strong Moderate

The Pearson r and Marginal Distribution The marginal distribution of X is simply the distribution of the X’s; the marginal distribution of Y is the frequency distribution of the Y’s. Y X Bivariate Normal Distribution Bivariate relationship

Marginal distribution of X and Y are precisely the same shape. X variable Y variable

Interpreting r, the Correlation Coefficient Recall that r includes two types of information:  The direction of the relationship (+ or -)  The magnitude of the relationship (0 to 1) However, there is a more precise way to use the correlation coefficient, r, to interpret the magnitude of a relationship. That is, the square of the correlation coefficient or r 2. The square of r tells us what proportion of the variance of Y can be explained by X or vice versa.

Variable X Variable Y An illustration of how the squared correlation accounts for variance in X, r =.7, r 2 =.49 high low How does correlation explain variance? Explained Suppose you wish to estimate Y for a given value of X. 49% of variance is explained Free to Vary

Now, lets look at some correlation coefficients and their corresponding scatter plots.

What is your estimate of r? r =.87r 2 =.76 = 76%

X Y r = -1.00r 2 = 1.00 = 100%

X Y r = +1.00r 2 = 1.00 = 100%

r =.04r 2 =.002 =.2%

r = -.44r 2 =.19 = 19%

Pearson r assumes that we are using interval or ratio data. What do we do if one or both of the variables we measured at the ordinal level? If we replace the scores with ranks, we can use the same formula. However, it can be simplified if we are using ordinal data. It is called a Spearman Rank-Order Correlation Coefficient.

Spearman’s Rank Order Correlation As noted, the Spearman r s is a special case of the Pearson r (when the data are ordinal). The formula, derived from the Pearson, is as follows: The characteristics and interpretation of a Spearman r s are exactly the same as a Pearson r. That is, r S ranges from -1 to +1, and the square provides an estimate of the shared variance.

Spearman Rank Order Correlation Coefficient One or both of the variables are in the form of ranks. Raw data may be converted to ranks, or ranks may be gathered as the original data. Example

Illustrated Calculation XY d= X – Yd2d N = 4

Choosing Between Pearson and Spearman If the data are ordinal, we have no choice, we have to use Spearman. If the data are interval or ratio, we do have a choice. –Pearson is more sensitive –Spearman easier to compute by hand

Summary of Measures of Relationship Spearman Rank Correlation Coefficient The Biserial Correlation Coefficient The Point-Biserial Correlation Coefficient The Phi Correlation Coefficient The Tetrachoric Correlation Coefficient The Rank-Biserial Correlation Coefficient rSrS There are other correlation coefficients for other levels of measurement. However, we will only study three, the two we have already reviewed and later, one more for nominal data.

Summarizing Correlations Pearson and Spearman Correlation Coefficients range from -1.0 to Pearson and Spearman Correlation Coefficients indicate both direction and magnitude of the relationship Correlation does NOT imply Causation