13 - 1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter Thirteen McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. Linear Regression and Correlation.
Objectives (BPS chapter 24)
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson11-1 Lesson 11: Linear Regression and Correlation.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Correlation and Regression Analysis
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Chapter 12a Simple Linear Regression
Ka-fu Wong © 2003 Chap Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Simple Linear Regression Analysis
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
SIMPLE LINEAR REGRESSION
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Simple Linear Regression Analysis
Correlation and Linear Regression
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Correlation and Linear Regression Chapter 13 Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Correlation and Regression
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Linear Regression and Correlation
Correlation and Linear Regression
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Bivariate Regression (Part 1) Chapter1212 Visual Displays and Correlation Analysis Bivariate Regression Regression Terminology Ordinary Least Squares Formulas.
Chapter 6 & 7 Linear Regression & Correlation
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Examining Relationships in Quantitative Research
Chap 13-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 12.
14- 1 Chapter Fourteen McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Lecture 10: Correlation and Regression Model.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 12 Simple Linear Regression.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Correlation and Linear Regression Chapter 13 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 13 Simple Linear Regression
Simple Linear Regression
Chapter 11 Simple Regression
Chapter 13 Simple Linear Regression
Inferential Statistics and Probability a Holistic Approach
Multiple Regression Chapter 14.
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Chapter Thirteen McGraw-Hill/Irwin
Linear Regression and Correlation
Chapter 13 Simple Linear Regression
Presentation transcript:

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. When you have completed this chapter, you will be able to: Identify a relationship between variables on a scatter diagram Measure and interpret a degree of relationship by a coefficient of correlation Conduct a test of hypothesis about the coefficient of correlation in a population Identify the roles of dependent and independent variables, the concept of regression, and its distinction from the concept of correlation.

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Conduct a test of hypothesis for a regression model and each coefficient of regression Conduct analysis of variance and calculate coefficient of determination. Estimate confidence and prediction intervals Measure and interpret the strength of relationship between two variables through a regression line and the technique of least squares.

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. T erminology …is a chart that portrays the relationship between the two variables. Scatter Diagram Correlation Analysis …is a group of statistical techniques used to measure the strength of the association between two variables. Dependent Variable …is the variable being predicted or estimated. …provides the basis for estimation. It is the predictor variable. Independent Variable

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The Coefficient of Correlation… r … Is a measure of strength of the relationship between two variables … It requires interval or ratio-scaled data … It can range from to 1.00 …Values of or 1.00 indicate perfect and strong correlation …Values close to 0.0 indicate weak correlation … Negative values indicate an inverse relationship and positive values indicate a direct relationship

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved X Y Perfect Negative Correlation

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved X Y Perfect Positive Correlation

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved X Y Zero Correlation

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved X Y Example Strong Positive Correlation

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Chart

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Chart 13.4

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. How Income and Well-Being of Canadians are Related ( ) r = Estimate r

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Formula for Correlation Coefficient Formula for Correlation Coefficient sX SYsX SY x ( x - r =  )( y - ) ( n – 1 )  n xy – ( x )( y )  n x 2 – ( x ) 2  n y 2 – ( y ) 2  y _

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. … represented by r 2 … is the proportion of the total variation in the dependent variable (Y) that is explained or accounted for by the variation in the independent variable (X). … it is the square of the coefficient of correlation … it ranges from 0 to 1 … it does not give any information on the direction of the relationship between the variables Coefficient of Determination

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Dan Ireland, the student body president, is concerned about the cost to students of textbooks. He believes there is a relationship between the number of pages in the text and the selling price of the book! To provide insight into the problem he selects a sample of eight (8) textbooks currently on sale in the bookstore. Draw a scatter diagram. Compute the correlation coefficient. Correlation Coefficient

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Book # Pages Price ($) Into to History Basic Algebra Intro. to Psych Intro. to Sociology Bus. Mgmt Intro to Biology Fund. of Jazz Intro. to Nursing Data Correlation Coefficient

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved Price ($) Pages Scatter Diagram of Number of Pages and Selling Price of Text Scatter Diagram

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Scatter Diagram Excel Printout Scatter Diagram Excel Printout

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Book # Pages Price ($) Into to History Basic Algebra Intro. to Psych Intro. to Sociology Bus. Mgmt Intro to Biology Fund. of Jazz Intro. to Nursing Total Correlation Coefficient x y xy x 2 y , ))( ( yx xy n r 2 )( x  2 xn     2 )( y  2 yn 

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Correlation Coefficient x y xy x 2 y  The correlation coefficient is 61.4%. This indicates a moderate association between the variables. 2 )636()606,51(8  )9004((8  )636)(4 900(  r = ))( ( yx xy n r 2 )( x  2 xn     2 )( y  2 yn  )200397(8 

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. H 0 is rejected if t>3.143 or if t< There are 6 df, found by n – 1 = 8 – 2 = 6. Let’s test the hypothesis that there is no correlation in the population. Use a.02 significance level. H 0 : r = 0 H 1 : r  0  = 0.02 State the null and alternate hypotheses Step 1 Select the level of significance Step 2 Identify the test statistic Step 3 State the decision rule Step 4 Compute the test statistic and make a decision Step 5...Step 5

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Compute the test statistic and make a decision Step 5 H 0 is not rejected. We cannot reject the hypothesis...that there is no correlation in the population. The amount of association could be due to chance. continued… Let’s test the hypothesis that there is no correlation in the population. Use a.02 significance level )614(    

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. We use the independent variable (X) to estimate the dependent variable (Y) Regression Analysis … both variables must be at least interval scale … the relationship between the variables is linear … least squares criterion is used to determine the equation… i.e. the term  (y – y) 2 is minimized ^

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Regression Equation …a is the Y-intercept … it is the estimated y value when x = 0 …the least squares principle is used to obtain a and b y = a + bx …y is the average predicted value of y for any x …b is the slope of the line, or the average change in y for each change of one unit in x

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. a y n b x n   b nxyxy nxx    ()()() ()()   22 Regression Equation y = a + bx

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Dan Ireland, the student body president, is concerned about the cost to students of textbooks. He believes there is a relationship between the number of pages in the text and the selling price of the book! To provide insight into the problem he selects a sample of eight (8) textbooks currently on sale in the bookstore. Develop a regression equation that can be used to estimate the selling price based on the number of pages !

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. x y xy x 2 y  8( ) – (4 900)(636) 8( ) – (4 900) 2 = = = = 48.0 = x Suggests …each extra page adds $0.05 to the price of a book; the y-intercept suggests that a book with 0 pages would cost $48. b nxyxyxy nx x    ()()() ()( )    2 2 a y n b x n   y = a + bx

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. …continued Find the estimated selling price of an 800 page book. Substituting 800 for x, The estimated selling price of an 800 page book is $89.14 y = x y = (800) = 89.14

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Using Excel

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Using Excel Click on CHART WIZARD See

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Click on XY (Scatter) Using Excel

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. INPUT DATA range Click Next Using Excel

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Complete INPUTTING of TITLES Click Next Click Finish Using Excel

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. To “format the axes scales”… Right mouse click on one of the axes Complete INPUTTING of VALUES Click OK See Using Excel Click on Format Axis

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. To remove the Legend on the right side… Right mouse click and Click on Clear Using Excel

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. To add the Regression Line and equation to this scatter plot… Right mouse click on one of the data points... Scroll down to Add Trendline... Click Using Excel See

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. … then CLICK on OPTIONS TAB Using Excel Click OK Choose Linear See

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Check EQUATION and R-squared Value Click OK Using Excel See

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. You can now interpret your results! Using Excel Concerned about the y intercept?

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Alternate Solution Formatting the axes… Resulted in …. a distortion of the y-intercept

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Using Excel

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Using Excel See Click on Tools Click on DATA ANALYSIS See

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Highlight REGRESSION Using Excel See …Click OK

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. INPUT NEEDS Using Excel See …Click OK

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Using Excel See

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The regression equation is: y = x The regression equation is: y = x Using Excel

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The Standard Error of Estimate …this measures the scatter, or dispersion, of the observed values around the line of regression The formulas that are used to compute the standard error are:   S e 2 2     n xybyay 2 )( 2    n yy

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The Standard Error of Estimate Find the standard error of estimate for the problem involving the number of pages in a book and the selling price  28 )200,397( )636(48606,51    x y xy x 2 y  Previously:  S e 2 2     n xyxybyay

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Assumptions Underlying Linear Regression  For each value of x, there is a group of y values, and these y values are normally distributed  The means of these normal distributions of y values all lie on the straight line of regression  The standard deviations of these normal distributions are equal  The y values are statistically independent. This means that in the selection of a sample the y values chosen for a particular x value do not depend on the y values for any other x values

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Confidence Interval The confidence interval for the mean value of y for a given value of x is given by:  8 )4 900( ) (    )408.10(  Previously: x y xy x 2 y  )( )(     n x n e y 0 t α/2(n-2)  S xx 0  x

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Prediction Interval The prediction interval for an individual value of y for a given value of x is given by: )408.10(  8 )4 900( ) (     )( )(     n x n e y 0 t α/2(n-2)  S xx 0  x Previously: x y xy x 2 y 

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The estimated selling price for a book with 800 pages is $89.14 The standard error of estimate is $10.41 The 95 percent confidence interval for all books with 800 pages is $ $15.31 This means the limits are between $73.83 and $ The 95 percent prediction interval for a particular book with 800 pages is $ $29.72 The means the limits are between $59.42 and $ Summarizing the Results These results appear in the following MINITAB output.

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The regression equation is Price = Pages Predictor Coef SE Coef T P Constant Pages S = R-Sq = 37.7% R-Sq(adj) = 27.3% Analysis of Variance Source DF SS MS F P Regression Residual Error Total Predicted Values for New Observations New Obs Fit SE Fit 95.0% CI 95.0% PI (73.82,104.46) (59.41,118.88) Regression Analysis: Price versus Pages

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. EXCEL output: Price vs. Pages

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Test your learning … Click on… Online Learning Centre for quizzes extra content data sets searchable glossary access to Statistics Canada’s E-Stat data …and much more!

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. This completes Chapter 13