Bivariate Regression (Part 1) Chapter1212 Visual Displays and Correlation Analysis Bivariate Regression Regression Terminology Ordinary Least Squares Formulas.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Hypothesis Testing Steps in Hypothesis Testing:
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Objectives (BPS chapter 24)
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Chapter 12 Simple Linear Regression
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Chapter 10 Simple Regression.
Chapter 12 Simple Regression
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
The Simple Regression Model
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Simple Linear Regression Analysis
SIMPLE LINEAR REGRESSION
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
BCOR 1020 Business Statistics
BCOR 1020 Business Statistics Lecture 24 – April 17, 2008.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Simple Linear Regression and Correlation
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics.
Chapter 7 Forecasting with Simple Regression
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Correlation and Linear Regression
Correlation and Regression
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Correlation and Linear Regression
CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.
Simple Linear Regression Models
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Correlation and Regression
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide Simple Linear Regression Part A n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Chapter 5: Regression Analysis Part 1: Simple Linear Regression.
Copyright © 2005 by the McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Managerial Economics Thomas Maurice eighth edition Chapter 4.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Correlation and Regression
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Presentation transcript:

Bivariate Regression (Part 1) Chapter1212 Visual Displays and Correlation Analysis Bivariate Regression Regression Terminology Ordinary Least Squares Formulas Tests for Significance McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All rights reserved.

12A-2 Visual Displays and Correlation Analysis Begin the analysis of bivariate data (i.e., two variables) with a scatter plot.Begin the analysis of bivariate data (i.e., two variables) with a scatter plot. A scatter plot - displays each observed data pair (x i, y i ) as a dot on an X/Y grid - indicates visually the strength of the relationship between the two variablesA scatter plot - displays each observed data pair (x i, y i ) as a dot on an X/Y grid - indicates visually the strength of the relationship between the two variables Visual Displays Visual Displays

12A-3 Visual Displays and Correlation Analysis Visual Displays Visual Displays Figure 12.1

12A-4 Visual Displays and Correlation Analysis The sample correlation coefficient (r) measures the degree of linearity in the relationship between X and Y.The sample correlation coefficient (r) measures the degree of linearity in the relationship between X and Y. -1 < r < +1 r = 0 indicates no linear relationshipr = 0 indicates no linear relationship In Excel, use =CORREL(array1,array2), where array1 is the range for X and array2 is the range for Y.In Excel, use =CORREL(array1,array2), where array1 is the range for X and array2 is the range for Y. Correlation Analysis Strong negative relationship Strong positive relationship

12A-5 Visual Displays and Correlation Analysis Correlation Analysis

12A-6 Visual Displays and Correlation Analysis Correlation Analysis Strong Positive Correlation Weak Positive Correlation

12A-7 Visual Displays and Correlation Analysis Correlation Analysis Weak Negative Correlation Strong Negative Correlation

12A-8 Visual Displays and Correlation Analysis Correlation Analysis No Correlation Nonlinear Relation

12A-9 Visual Displays and Correlation Analysis r is an estimate of the population correlation coefficient  (rho).r is an estimate of the population correlation coefficient  (rho). To test the hypothesis H 0 :  = 0, the test statistic is:To test the hypothesis H 0 :  = 0, the test statistic is: The critical value t  is obtained from Appendix D using = n – 2 degrees of freedom for any .The critical value t  is obtained from Appendix D using = n – 2 degrees of freedom for any . Find the p-value using Excel’s function =TDIST(t,deg_freedom,tails) or MINITAB.Find the p-value using Excel’s function =TDIST(t,deg_freedom,tails) or MINITAB. Tests for Significance calc

12A-10 Visual Displays and Correlation Analysis Equivalently, you can calculate the critical value for the correlation coefficient usingEquivalently, you can calculate the critical value for the correlation coefficient using This method gives a benchmark for the correlation coefficient.This method gives a benchmark for the correlation coefficient. However, there is no p-value and is inflexible if you change your mind about .However, there is no p-value and is inflexible if you change your mind about . Tests for Significance critical

12A-11 Visual Displays and Correlation Analysis Step 1: State the Hypotheses Determine whether you are using a one or two- tailed test and the level of significance (  ). H 0 :  = 0 H 1 :  ≠ 0Step 1: State the Hypotheses Determine whether you are using a one or two- tailed test and the level of significance (  ). H 0 :  = 0 H 1 :  ≠ 0 Step 2: Specify the Decision Rule For degrees of freedom = n -2, look up the critical value t  in Appendix D, then calculateStep 2: Specify the Decision Rule For degrees of freedom = n -2, look up the critical value t  in Appendix D, then calculate Steps in Testing if  = 0

12A-12 Visual Displays and Correlation Analysis Step 3: Calculate the Test StatisticStep 3: Calculate the Test Statistic Step 4: Make the Decision If the sample correlation coefficient r exceeds the critical value r , then reject H 0.Step 4: Make the Decision If the sample correlation coefficient r exceeds the critical value r , then reject H 0. If using the t statistic method, reject H 0 if t > t  or if the p-value t  or if the p-value < . Steps in Testing if  = 0 calc

12A-13 Visual Displays and Correlation Analysis A quick test for significance of a correlation at  =.05 is |r| > 2/ nA quick test for significance of a correlation at  =.05 is |r| > 2/ n Quick Rule for Significance Table 12.1

12A-14 Visual Displays and Correlation Analysis Autocorrelation is a special type of correlation analysis useful in business for time series data.Autocorrelation is a special type of correlation analysis useful in business for time series data. The autocorrelation coefficient is the simple correlation between y t and y t-k where k is any lagThe autocorrelation coefficient is the simple correlation between y t and y t-k where k is any lag Autocorrelation

12A-15 Bivariate Regression Bivariate Regression analyzes the relationship between two variables.Bivariate Regression analyzes the relationship between two variables. It specifies one dependent (response) variable and one independent (predictor) variable.It specifies one dependent (response) variable and one independent (predictor) variable. This hypothesized relationship may be linear, quadratic, or whatever.This hypothesized relationship may be linear, quadratic, or whatever. What is Bivariate Regression? What is Bivariate Regression?

12A-16 Bivariate Regression Model Form Model Form Figure 12.6

12A-17 Regression Terminology Unknown parameters are  0 Intercept  1 SlopeUnknown parameters are  0 Intercept  1 Slope The assumed model for a linear relationship isThe assumed model for a linear relationship is y i =  0 +  1 x i +  i for all observations (i = 1, 2, …, n) The error term is not observable, is assumed normally distributed with mean of 0 and standard deviation .The error term is not observable, is assumed normally distributed with mean of 0 and standard deviation . Models and Parameters Models and Parameters

12A-18 Regression Terminology The fitted model used to predict the expected value of Y for a given value of X isThe fitted model used to predict the expected value of Y for a given value of X is y i = b 0 + b 1 x i Models and Parameters Models and Parameters The fitted coefficients areThe fitted coefficients are b 0 the estimated intercept b 1 the estimated slope Residual is e i = y i - y i.Residual is e i = y i - y i. Residuals may be used to estimate , the standard deviation of the errors.Residuals may be used to estimate , the standard deviation of the errors. ^ ^

12A-19 Regression Terminology Step 1: - Highlight the data columns. - Click on the Chart Wizard and choose Scatter Plot - In the completed graph, click once on the points in the scatter plot to select the data - Right-click and choose Add Trend line - Choose Options and check Display EquationStep 1: - Highlight the data columns. - Click on the Chart Wizard and choose Scatter Plot - In the completed graph, click once on the points in the scatter plot to select the data - Right-click and choose Add Trend line - Choose Options and check Display Equation Fitting a Regression on a Scatter Plot in Excel Fitting a Regression on a Scatter Plot in Excel

12A-20 Regression Terminology Fitting a Regression on a Scatter Plot in Excel Fitting a Regression on a Scatter Plot in Excel Figure 12.8

12A-21 Regression Terminology

12A-22 Ordinary Least Squares Formulas The ordinary least squares method (OLS) estimates the slope and intercept of the regression line so that the residuals are small.The ordinary least squares method (OLS) estimates the slope and intercept of the regression line so that the residuals are small. The sum of the residuals = 0The sum of the residuals = 0 The sum of the squared residuals is SSEThe sum of the squared residuals is SSE Slope and Intercept Slope and Intercept

12A-23 Ordinary Least Squares Formulas The OLS estimator for the slope is:The OLS estimator for the slope is: The OLS estimator for the intercept is:The OLS estimator for the intercept is: Slope and Intercept Slope and Intercept or

12A-24 Ordinary Least Squares Formulas We want to explain the total variation in Y around its mean (SST for Total Sums of Squares)We want to explain the total variation in Y around its mean (SST for Total Sums of Squares) The regression sum of squares (SSR) is the explained variation in YThe regression sum of squares (SSR) is the explained variation in Y Assessing Fit Assessing Fit

12A-25 Ordinary Least Squares Formulas The error sum of squares (SSE) is the unexplained variation in YThe error sum of squares (SSE) is the unexplained variation in Y If the fit is good, SSE will be relatively small compared to SST.If the fit is good, SSE will be relatively small compared to SST. A perfect fit is indicated by an SSE = 0.A perfect fit is indicated by an SSE = 0. The magnitude of SSE depends on n and on the units of measurement.The magnitude of SSE depends on n and on the units of measurement. Assessing Fit Assessing Fit

12A-26 Ordinary Least Squares Formulas Coefficient of Determination Coefficient of Determination 0 < R 2 < 1 Often expressed as a percent, an R 2 = 1 (i.e., 100%) indicates perfect fit.Often expressed as a percent, an R 2 = 1 (i.e., 100%) indicates perfect fit. In a bivariate regression, R 2 = (r) 2In a bivariate regression, R 2 = (r) 2 R 2 is a measure of relative fit based on a comparison of SSR and SST.R 2 is a measure of relative fit based on a comparison of SSR and SST.

12A-27 Tests for Significance The standard error (s yx ) is an overall measure of model fit.The standard error (s yx ) is an overall measure of model fit. Standard Error of Regression Standard Error of Regression If the fitted model’s predictions are perfect (SSE = 0), then s yx = 0. Thus, a small s yx indicates a better fit.If the fitted model’s predictions are perfect (SSE = 0), then s yx = 0. Thus, a small s yx indicates a better fit. Used to construct confidence intervals.Used to construct confidence intervals. Magnitude of s yx depends on the units of measurement of Y and on data magnitude.Magnitude of s yx depends on the units of measurement of Y and on data magnitude.

12A-28 Tests for Significance Standard error of the slope:Standard error of the slope: Confidence Intervals for Slope and Intercept Confidence Intervals for Slope and Intercept Standard error of the intercept:Standard error of the intercept:

12A-29 Tests for Significance Confidence interval for the true slope:Confidence interval for the true slope: Confidence Intervals for Slope and Intercept Confidence Intervals for Slope and Intercept Confidence interval for the true intercept:Confidence interval for the true intercept:

12A-30 Tests for Significance If  1 = 0, then X cannot influence Y and the regression model collapses to a constant  0 plus random error.If  1 = 0, then X cannot influence Y and the regression model collapses to a constant  0 plus random error. The hypotheses to be tested are:The hypotheses to be tested are: Hypothesis Tests Hypothesis Tests

12A-31 Tests for Significance A t test is used with = n – 2 degrees of freedom The test statistics for the slope and intercept are:A t test is used with = n – 2 degrees of freedom The test statistics for the slope and intercept are: Hypothesis Tests Hypothesis Tests t n-2 is obtained from Appendix D or Excel for a given .t n-2 is obtained from Appendix D or Excel for a given . Reject H 0 if t > t  or if p-value t  or if p-value < . Slope: Intercept: calc

12A-32 Tests for Significance Using Excel Using Excel

12A-33 Tests for Significance Using MegaStat Using MegaStat

12A-34 Tests for Significance Using MINITAB Using MINITAB

12A-35 Tests for Significance Using MINITAB Using MINITAB

Applied Statistics in Business & Economics End of Chapter 12A 12A-36