Introduction to Statistics Introduction to Statistics Correlation Chapter 15 April 23-28, 2009 Classes #27-28.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Inference for Linear Regression (C27 BVD). * If we believe two variables may have a linear relationship, we may find a linear regression line to model.
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Correlation and Regression
Describing Relationships Using Correlation and Regression
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Correlation Correlation is the relationship between two quantitative variables. Correlation coefficient (r) measures the strength of the linear relationship.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
CORRELATON & REGRESSION
PPA 501 – Analytical Methods in Administration Lecture 8 – Linear Regression and Correlation.
PPA 415 – Research Methods in Public Administration
Lecture 11 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Irwin/McGraw-Hill © The McGraw-Hill Companies, Inc., 2000 LIND MASON MARCHAL 1-1 Chapter Twelve Multiple Regression and Correlation Analysis GOALS When.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
CORRELATION COEFFICIENTS What Does a Correlation Coefficient Indicate? What is a Scatterplot? Correlation Coefficients What Could a Low r mean? What is.
10-2 Correlation A correlation exists between two variables when the values of one are somehow associated with the values of the other in some way. A.
SIMPLE LINEAR REGRESSION
Correlation and Regression Analysis
Week 9: Chapter 15, 17 (and 16) Association Between Variables Measured at the Interval-Ratio Level The Procedure in Steps.
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Correlation and Regression Quantitative Methods in HPELS 440:210.
Correlation and Linear Regression
Correlation and Linear Regression
Correlation and Linear Regression Chapter 13 Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
STATISTICS ELEMENTARY C.M. Pascual
PRED 354 TEACH. PROBILITY & STATIS. FOR PRIMARY MATH Lesson 14 Correlation & Regression.
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
SIMPLE LINEAR REGRESSION
Week 12 Chapter 13 – Association between variables measured at the ordinal level & Chapter 14: Association Between Variables Measured at the Interval-Ratio.
Linear Regression and Correlation
Correlation and Regression
Correlation.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 15 Correlation and Regression
1 Chapter 9. Section 9-1 and 9-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Anthony Greene1 Correlation The Association Between Variables.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Hypothesis of Association: Correlation
Figure 15-3 (p. 512) Examples of positive and negative relationships. (a) Beer sales are positively related to temperature. (b) Coffee sales are negatively.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Elementary Statistics Correlation and Regression.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 15: Correlation and Regression Part 2: Hypothesis Testing and Aspects of a Relationship.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
Introduction to Statistics Introduction to Statistics Correlation Chapter 15 Apr 29-May 4, 2010 Classes #28-29.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
The Statistical Imagination Chapter 15. Correlation and Regression Part 2: Hypothesis Testing and Aspects of a Relationship.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
Correlation. u Definition u Formula Positive Correlation r =
SOCW 671 #11 Correlation and Regression. Uses of Correlation To study the strength of a relationship To study the direction of a relationship Scattergrams.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Understand.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Chapter 15 Association Between Variables Measured at the Interval-Ratio Level.
Solutions to Healey 2/3e #13.1 (1e #15.1) Turnout by Unemployment and by Negative Campaigning.
Chapter 12: Correlation and Linear Regression 1.
Chapter Thirteen McGraw-Hill/Irwin
Presentation transcript:

Introduction to Statistics Introduction to Statistics Correlation Chapter 15 April 23-28, 2009 Classes #27-28

Correlation A statistical technique that is used to measure and describe a relationship between two variables –For example: GPA and TD’s scored Statistics exam scores and amount of time spent studying

Notation A correlation requires two scores for each individual –One score from each of the two variables –They are normally identified as X and Y

Three characteristics of X and Y are being measured… The direction of the relationship –Positive or negative The form of the relationship –Usually linear form The strength or consistency of the relationship –Perfect correlation = 1.00; no consistency would be 0.00 –Therefore, a correlation measures the degree of relationship between two variables on a scale from 0.00 to 1.00.

Assumptions There are 3 main assumptions… –1. The dependent and independent are normally distributed. We can test this by looking at the histograms for the two variables –2. The relationship between X and Y is linear. We can check this by looking at the scattergram –3. The relationship is homoscedastic. We can test homoscedasticity by looking at the scattergram and observing that the data points form a “roughly symmetrical, cigar-shaped pattern” about the regression line. If the above 3 assumptions have been met, then we can use correlation and test r for significance

Pearson r The most commonly used correlation Measures the degree of straight-line relationship Computation: r = SP / (SS X )(SS Y )

Example 1 X Y X ,444 2,704 8,100 9,025 22,173 Y 2 25,600 32,400 44,100 57, ,100 XY 4,800 6,840 9,360 18,900 22,800 62,700 (  X) (  X 2 ) (  Y) (  Y 2 ) (  XY)

Example 1 SS X SS X = X2 X2 X2 X2 - (  X) 2 (  X) 2 = 22, = n 5 = 22, /5 = 22, ,605 = 3,568 SS Y = Y2 Y2 - (  Y) 2 = 192, = n 5 = 192, ,900/5 = 192, ,180 = 3,920

Example 1 SP =  XY  XY - (  X)(  Y) (  X)(  Y) = n 62,700 - (305)(970) 5 = 62, ,850/5 = 62, ,170 = 3,530

Example 1 r = SP / (SS X )(SS Y ) = 3,530 / (3,568)(3,920) = 3,530 / 13,986,560 = 3,530 / 3, =.944

Coefficient of Determination (r 2 ) The value r 2 is called the coefficient of determination because it measures the proportion in variability in one variable that can be determined from the relationship with the other variable –For example: A correlation of r =.42 (or r = -.42) means that r 2 =.17 (or 17%) of the variability in the Y scores can be predicted from the relationship with the X scores

Coefficient of Determination (r 2 ) and Interpret: The coefficient of determination is r 2 =.891. Education, by itself, explains 89.1% of the variation in voter turnout.

Example 2 A researcher predicts that there is a high correlation between years of education and voter turnout –She chooses Alamosa, Boston, Chicago, Detroit, and NYC to test her theory

Example 2 The scores on each variable are displayed in table format: –Y = % Turnout –X = Years of Education CityXY Alamosa Boston Chicago Detroit NYC13.070

Scatterplot The relationship between X and Y is linear.

Make a Computational Table XYX2X2 Y2Y2 XY ∑ X =∑Y =∑ X 2 =∑Y 2 = ∑XY =

Find Pearson’s r and Interpret:

Pearson’s r Had the relationship between % college educated and turnout, r =.32. –This relationship would have been positive and weak to moderate. Had the relationship between % college educated and turnout, r = –This relationship would have been negative and weak.

Find the Coefficient of Determination (r 2 ) and Interpret:

Hypothesis Testing with Pearson We can have a two-tailed hypothesis: H o : ρ = 0.0 H 1 : ρ ≠ 0.0 We can have a one-tailed hypothesis: H o : ρ = 0.0 H 1 : ρ 0.0) Note that ρ (rho) is the population parameter, while r is the sample statistic

Find r critical See Table B.6 (page 537) –You need to know the alpha level –You need to know the sample size –See that we always will use: df = n-2

Find r calculated See previous slides for formulas

Make you decision… r calculated < r critical then Retain H 0 r calculated > r critical then Reject H 0

Always include a brief summary of your results: Was it positive or negative? Was it significant ? Explain the correlation Explain the variation –Coefficient of Determination (r 2 )

Credits Example using Healey P. 418 Problem 15.1