With the growth of internet service providers, a researcher decides to examine whether there is a correlation between cost of internet service per.

Slides:



Advertisements
Similar presentations
Sleeping and Happiness
Advertisements

Managerial Economics in a Global Economy
Forecasting Using the Simple Linear Regression Model and Correlation
Inference for Regression
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
Objectives (BPS chapter 24)
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Cal State Northridge  320 Andrew Ainsworth PhD Regression.
© 2000 Prentice-Hall, Inc. Chap Multiple Regression Models.
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
Linear Regression and Correlation
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
SIMPLE LINEAR REGRESSION
Simple Linear Regression Analysis
Lecture 5 Correlation and Regression
Lecture 16 Correlation and Coefficient of Correlation
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
CORRELATION & REGRESSION
Correlation and Regression
Correlation and Regression. The test you choose depends on level of measurement: IndependentDependentTest DichotomousContinuous Independent Samples t-test.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Elementary Statistics Correlation and Regression.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
Chapter 13 Multiple Regression
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Practice You collect data from 53 females and find the correlation between candy and depression is Determine if this value is significantly different.
Example You give 100 random students a questionnaire designed to measure attitudes toward living in dormitories Scores range from 1 to 7 –(1 = unfavorable;
You can calculate: Central tendency Variability You could graph the data.
SPSS SPSS Problem # (7.19) 7.11 (b) You can calculate: Central tendency Variability You could graph the data.
Economics 173 Business Statistics Lecture 10 Fall, 2001 Professor J. Petry
Practice Does drinking milkshakes affect (alpha =.05) your weight? To see if milkshakes affect a persons weight you collected data from 5 sets of twins.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Practice You recently finished giving 5 Villanova students the MMPI paranoia measure. Determine if Villanova students’ paranoia score is significantly.
Linear Regression Hypothesis testing and Estimation.
CHAPTER 12 More About Regression
Chapter 14 Introduction to Multiple Regression
Regression and Correlation
Chapter 4 Basic Estimation Techniques
Practice. Practice Practice Practice Practice r = X = 20 X2 = 120 Y = 19 Y2 = 123 XY = 72 N = 4 (4) 72.
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Basic Estimation Techniques
CHAPTER 12 More About Regression
Review. Review Statistics Needed Need to find the best place to draw the regression line on a scatter plot Need to quantify the cluster.
Statistical Inference about Regression
Regression Chapter 8.
Correlation and Regression
You can calculate: Central tendency Variability You could graph the data.
Basic Practice of Statistics - 3rd Edition Inference for Regression
SIMPLE LINEAR REGRESSION
CHAPTER 12 More About Regression
SIMPLE LINEAR REGRESSION
CHAPTER 12 More About Regression
Practice As part of a program to reducing smoking, a national organization ran an advertising campaign to convince people to quit or reduce their smoking.
Sleeping and Happiness
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

With the growth of internet service providers, a researcher decides to examine whether there is a correlation between cost of internet service per month (rounded to the nearest dollar) and degree of customer satisfaction (on a scale of 1 - 10 with a 1 being not at all satisfied and a 10 being extremely satisfied). The researcher only includes programs with comparable types of services. Determine if customers should be happy about paying more. dollars satisfaction 11 6 18 8 17 10 15 4 9 5 12 3 19 22 2 25

Practice Situation 1 Based on a sample of 100 subjects you find the correlation between extraversion is happiness is r=.15. Determine if this value is significantly different than zero. Situation 2 Based on a sample of 600 subjects you find the correlation between extraversion is happiness is r=.15. Determine if this value is significantly different than zero.

Step 1 Situation 1 H1: r is not equal to 0 H0: r is equal to zero The two variables are related to each other H0: r is equal to zero The two variables are not related to each other Situation 2

Step 2 Situation 1 df = 98 t crit = +1.985 and -1.984 Situation 2

Step 3 Situation 1 r = .15 Situation 2

Step 4 Situation 1 Situation 2

Step 5 Situation 1 If tobs falls in the critical region: Reject H0, and accept H1 If tobs does not fall in the critical region: Fail to reject H0 Situation 2

Step 6 Situation 1 Based on a sample of 100 subjects you find the correlation between extraversion is happiness is r=.15. Determine if this value is significantly different than zero. There is not a significant relationship between extraversion and happiness Situation 2 Based on a sample of 600 subjects you find the correlation between extraversion is happiness is r=.15. Determine if this value is significantly different than zero. There is a significant relationship between extraversion and happiness.

Practice You collect data from 53 females and find the correlation between candy and depression is -.40. Determine if this value is significantly different than zero. You collect data from 53 males and find the correlation between candy and depression is -.50. Determine if this value is significantly different than zero.

Practice You collect data from 53 females and find the correlation between candy and depression is -.40. t obs = 3.12 t crit = 2.00 You collect data from 53 males and find the correlation between candy and depression is -.50. t obs = 4.12

Practice You collect data from 53 females and find the correlation between candy and depression is -.40. You collect data from 53 males and find the correlation between candy and depression is -.50. Is the effect of candy significantly different for males and females?

Hypothesis H1: the two correlations are different H0: the two correlations are not different

Testing Differences Between Correlations Must be independent for this to work

When the population value of r is not zero the distribution of r values gets skewed Easy to fix! Use Fisher’s r transformation Page 746

Testing Differences Between Correlations Must be independent for this to work

Testing Differences Between Correlations

Testing Differences Between Correlations

Testing Differences Between Correlations

Testing Differences Between Correlations Note: what would the z value be if there was no difference between these two values (i.e., Ho was true)

Testing Differences Z = -.625 What is the probability of obtaining a Z score of this size or greater, if the difference between these two r values was zero? p = .267 If p is < .025 reject Ho and accept H1 If p is = or > .025 fail to reject Ho The two correlations are not significantly different than each other!

Remember this: Statistics Needed Need to find the best place to draw the regression line on a scatter plot Need to quantify the cluster of scores around this regression line (i.e., the correlation coefficient)

Regression allows us to predict! . . . . .

Straight Line Y = mX + b Where: Y and X are variables representing scores m = slope of the line (constant) b = intercept of the line with the Y axis (constant)

Excel Example

That’s nice but. . . . How do you figure out the best values to use for m and b ? First lets move into the language of regression

Straight Line Y = mX + b Where: Y and X are variables representing scores m = slope of the line (constant) b = intercept of the line with the Y axis (constant)

Regression Equation Y = a + bX Where: Y = value predicted from a particular X value a = point at which the regression line intersects the Y axis b = slope of the regression line X = X value for which you wish to predict a Y value

Practice Y = -7 + 2X What is the slope and the Y-intercept? Determine the value of Y for each X: X = 1, X = 3, X = 5, X = 10

Practice Y = -7 + 2X What is the slope and the Y-intercept? Determine the value of Y for each X: X = 1, X = 3, X = 5, X = 10 Y = -5, Y = -1, Y = 3, Y = 13

Finding a and b Uses the least squares method Minimizes Error Error = Y - Y  (Y - Y)2 is minimized

. . . . .

. . . . . Error = Y - Y  (Y - Y)2 is minimized Error = 1 Error = .5

Finding a and b Ingredients COVxy Sx2 Mean of Y and X

Regression

Regression Ingredients Mean Y =4.6 Mean X = 3 Covxy = 3.75 S2X = 2.50

Regression Ingredients Mean Y =4.6 Mean X = 3 Covxy = 3.75 S2x = 2.50

Regression Ingredients Mean Y =4.6 Mean X = 3 Covxy = 3.75 S2x = 2.50

Regression Equation Y = a + bx Equation for predicting smiling from talking Y = .10+ 1.50(x)

Regression Equation Y = .10+ 1.50(x) How many times would a person likely smile if they talked 15 times?

Regression Equation Y = .10+ 1.50(x) How many times would a person likely smile if they talked 15 times? 22.6 = .10+ 1.50(15)

Y = 0.1 + (1.5)X . . . . .

Y = 0.1 + (1.5)X X = 1; Y = 1.6 . . . . . .

Y = 0.1 + (1.5)X X = 5; Y = 7.60 . . . . . . .

Y = 0.1 + (1.5)X . . . . . . .

Mean Y = 14.50; Sy = 4.43 Mean X = 6.00; Sx= 2.16 Quantify the relationship with a correlation and draw a regression line that predicts aggression.

∑XY = 326 ∑Y = 58 ∑X = 24 N = 4

∑XY = 326 ∑Y = 58 ∑X = 24 N = 4

COV = -7.33 Sy = 4.43 Sx= 2.16

COV = -7.33 Sy = 4.43 Sx= 2.16

Regression Ingredients Mean Y =14.5 Mean X = 6 Covxy = -7.33 S2X = 4.67

Regression Ingredients Mean Y =14.5 Mean X = 6 Covxy = -7.33 S2X = 4.67

Regression Equation Y = a + bX Y = 23.92 + (-1.57)X

Y = 23.92 + (-1.57)X . 22 20 . 18 16 . 14 . 12 10

Y = 23.92 + (-1.57)X . . 22 20 . 18 16 . 14 . 12 10

Y = 23.92 + (-1.57)X . . 22 20 . 18 16 . 14 . 12 . 10

Y = 23.92 + (-1.57)X . . 22 20 . 18 16 . 14 . 12 . 10

Hypothesis Testing Have learned How to calculate r as an estimate of relationship between two variables How to calculate b as a measure of the rate of change of Y as a function of X Next determine if these values are significantly different than 0

Testing b The significance test for r and b are equivalent If X and Y are related (r), then it must be true that Y varies with X (b). Important to learn b significance tests for multiple regression

Steps for testing b value 1) State the hypothesis 2) Find t-critical 3) Calculate b value 4) Calculate t-observed 5) Decision 6) Put answer into words

Practice You are interested in if candy consumption significantly alters a persons depression. Create a graph showing the relationship between candy consumption and depression (note: you must figure out which is X and which is Y)

Practice Candy Depression Charlie 5 55 Augustus 7 43 Veruca 4 59 Mike 108 Violet 65

Step 1 H1: b is not equal to 0 H0: b is equal to zero

Step 2 Calculate df = N - 2 Page 747 df = 3 First Column are df Look at an alpha of .05 with two-tails t crit = 3.182 and -3.182

Step 3 Candy Depression Charlie 5 55 Augustus 7 43 Veruca 4 59 Mike 3 108 Violet 65 COV = -30.5 N = 5 r = -.81 Sy = 24.82 Sx = 1.52

Step 3 Y = 127 + -13.26(X) b = -13.26 COV = -30.5 N = 5 r = -.81 Sx = 1.52 Sy = 24.82

Step 4 Calculate t-observed b = Slope Sb = Standard error of slope

Step 4 Syx = Standard error of estimate Sx = Standard Deviation of X

Step 4 Sy = Standard Deviation of y r = correlation between x and y

Note

. . . . . Error = Y - Y  (Y - Y)2 is minimized Error = 1 Error = .5

Step 4 Sy = Standard Deviation of y r = correlation between x and y

Step 4 Syx = Standard error of estimate Sx = Standard Deviation of X

Step 4 Syx = Standard error of estimate Sx = Standard Deviation of X

Step 4 Calculate t-observed b = Slope Sb = Standard error of slope

Step 4 Calculate t-observed b = Slope Sb = Standard error of slope

Step 4 Note: same value at t-observed for r

Step 5 If tobs falls in the critical region: Reject H0, and accept H1 If tobs does not fall in the critical region: Fail to reject H0

t distribution tcrit = -3.182 tcrit = 3.182

t distribution tcrit = -3.182 tcrit = 3.182 -2.39

Step 5 If tobs falls in the critical region: Reject H0, and accept H1 If tobs does not fall in the critical region: Fail to reject H0

Practice

Practice Page 288 9.18

9.18 The regression equation for faculty shows that the best estimate of starting salary for faculty is $15,000 (intercept). For every additional year the salary increases on average by $900 (slope). For administrative staff the best estimate of starting salary is $10,000 (slope), for every additional year the salary increases on average by $1500 (slope). They will be equal at 8.33 years of service.

Practice Page 290 9.23

9.23 r = .68 r1 = .829 r = .51 r1 = .563 Z = .797 p = .2119 Correlations are not different from each other

SPSS Problem #3 Due March 14th Page 287 9.2 9.3 9.10 and create a graph by hand