14 - 1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Inference for Regression
Pengujian Parameter Regresi Pertemuan 26 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Objectives (BPS chapter 24)
Analysis of Economic Data
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Note 14 of 5E Statistics with Economics and Business Applications Chapter 12 Multiple Regression Analysis A brief exposition.
Linear Regression and Correlation
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 5 月 4 日 第十二週:複迴歸.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
Irwin/McGraw-Hill © The McGraw-Hill Companies, Inc., 2000 LIND MASON MARCHAL 1-1 Chapter Twelve Multiple Regression and Correlation Analysis GOALS When.
Simple Linear Regression Analysis
Multiple Regression and Correlation Analysis
SIMPLE LINEAR REGRESSION
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Business Statistics - QBM117 Statistical inference for regression.
Correlation and Regression Analysis
Simple Linear Regression Analysis
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Lecture 5 Correlation and Regression
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Hypothesis Testing in Linear Regression Analysis
Regression Method.
Chapter 12 Multiple Regression and Model Building.
Multiple Linear Regression and Correlation Analysis
Multiple Regression Analysis
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2003 Thomson/South-Western Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #9 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
CHAPTER 14 MULTIPLE REGRESSION
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Introduction to Linear Regression
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Chap 14-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics.
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Chap 13-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 12.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
14- 1 Chapter Fourteen McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 13 Multiple Regression
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
Chapter 12 Simple Linear Regression.
Business Research Methods
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Multiple Linear Regression and Correlation Analysis Chapter 14.
Chapter 20 Linear and Multiple Regression
Inference for Least Squares Lines
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
John Loucks St. Edward’s University . SLIDES . BY.
Chapter 13 Simple Linear Regression
Quantitative Methods Simple Regression.
Multiple Regression Chapter 14.
Chapter Fourteen McGraw-Hill/Irwin
Chapter Thirteen McGraw-Hill/Irwin
Presentation transcript:

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. When you have completed this chapter, you will be able to: Understand the importance of an appropriate model specification and multiple regression analysis Comprehend the nature and technique of multiple regression models and the concept of partial regression coefficients. Use the estimation techniques for multiple regression models Conduct an analysis of variance of an estimated model

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Identify the problems raised, and the remedies thereof, by the presence of multicollinearity in the data sets Draw inferences about the assumed (true) model though a joint test of hypothesis (F test) on the coefficients of all variables Draw inferences about the importance of the independent variables through tests of hypothesis (t-tests) 5. Explain the goodness of fit of an estimated model. Identify the problems raised, and the remedies thereof, by the presence of outliers/influential observations in the data sets 9.

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Comprehend the concept of partial correlations and its importance in multiple regression analysis Write a research report on an investigation using multiple regression analysis Use some simple remedial measures in the presence of violations of the model assumptions Identify the violation of model assumptions, including linearity, homoscedasticity, autocorrelation, and normality through simple diagnosic procedures.

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved Apply some advanced diagnostic checks and remedies in multiple regression analysis Use qualitative variables, as well as their interactions with other independent variables through a joint test of hypothesis 14. Draw inferences about the importance of a subset of the importance in multiple regression analysis

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. For two independent variables, the general form of the multiple regression equation is: x 1 and x 2 are the independent variables. a is the y-intercept. Multiple Regression Analysis b1 is the net change in y for each unit change in x1 holding x2 constant. It is called …a partial regression coefficient, …a net regression coefficient, or …just a regression coefficient. yabxbx  112 2

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The general multiple regression with k independent variables is given by: The least squares criterion is used to develop this equation. Because determining b 1, b 2, etc. is very tedious, a software package such as Excel or MINITAB is recommended. Multiple Regression Analysis yabxbx  bx  k k ...

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. … is measured in the same units as the dependent variable Multiple Standard Error of Estimate … is a measure of the effectiveness of the regression equation … i t is difficult to determine what is a large value and what is a small value of the standard error!

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Multiple Regression and Correlation Assumptions Multiple Regression and Correlation Assumptions … the independent variables and the dependent variables have a linear relationship … the dependent variable must be continuous and at least interval-scale … the variation in (y - y) or residual must be the same for all values of y. When this is the case, we say the difference exhibits homoscedasticity … the residuals should follow the normal distribution with mean of 0 … successive values of the dependent variable must be uncorrelated

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. … reports the variation in the dependent variable … the variation is divided into two components: a. … the Explained Variation is that accounted for by the set of independent variable b. … the Unexplained or Random Variation is not accounted for by the independent variables The AVOVA Table

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. A correlation matrix is used to show all possible simple correlation coefficients among the variables … i t shows how strongly each independent variable is correlated with the dependent variable. …t he matrix is useful for locating correlated independent variables. Correlation Matrix

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Global Test The global test is used to investigate whether any of the independent variables have significant coefficients. 0 equal s allNot : 1  H0...: 210  H k  The hypotheses are: … continued

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The test statistic is the … F distribution with k (number of independent variables) and n-(k+1) degrees of freedom, where n is the sample size Global Test … continued

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Test for Individual Variables This test is… used to determine which independent variables have nonzero regression coefficients … the variables that have zero regression coefficients are usually dropped from the analysis … the test statistic is the t distribution with n-(k+1) degrees of freedom.

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. A market researcher for Super Dollar Super Markets is studying the yearly amount families of four or more spend on food. Three independent variables are thought to be related to yearly food expenditures (Food). Those variables are: … total family income (Income) in $00, … size of family (Size), and … whether the family has children in college (College)

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. … gender … the part is acceptable or unacceptable … the voter will or will not vote for the incumbent … continued Note: … the following regarding the regression equation … the variable college is called a dummy or indicator variable. (It can take only one of two possible outcomes, i.e. a child is a college student or not) Other examples of dummy variables include… We usually code one value of the dummy variable as “1” and the other “0”

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. … continued FamilyFoodIncomeSizeStudent

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Use a computer software package, such as MINITAB or Excel, to develop a correlation matrix. From the analysis provided by MINITAB, write out the regression equation: … continued What food expenditure would you estimate for a family of 4, with no college students, and an income of $50,000 (which is input as 500)?

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The regression equation is Food = Income Size Student Predictor Coef SE Coef T P Constant Income Size Student S = R-Sq = 80.4% R-Sq(adj) = 73.1% Analysis of Variance Source DF SS MS F P Regression Residual Error Total … continued y = x x x 3

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The regression equation is Food = Income Size Student Predictor Coef SE Coef T P Constant Income Size Student S=572.7 R-Sq = 80.4% R-Sq(adj) = 73.1% Analysis of Variance Source DF SS MS F P Regression Residual Error Total From the regression output we note: The coefficient of determination is 80.4 percent. From the regression output we note: The coefficient of determination is 80.4 percent. … continued This means that more than 80 percent of the variation in the amount spent on food is accounted for by the variables income, family size, and student

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The regression equation is Food = Income Size Student Predictor Coef SE Coef T P Constant Income Size Student S=572.7 R-Sq = 80.4% R-Sq(adj) = 73.1% Analysis of Variance Source DF SS MS F P Regression Residual Error Total An additional family member will increase the amount spent per year on food by $748 … continued A family with a college student will spend $565 more per year on food than those without a college student

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. … continued The correlation matrix is as follows: Food Income Size Income Size Student The strongest correlation between the dependent variable (Food) and an independent variable is between family size and amount spent on food. None of the correlations among the independent variables should cause problems. All are between –.70 and.70

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Find the estimated food expenditure for a family of 4 with a $500 (that is $50,000) income and no college student. … continued The regression equation is… Food = Income Size Student y = (500) + 748(4) + 565(0) = $4,491

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Decision: H 0 is rejected. Not all the regression coefficients are zero The regression equation is Food = Income Size Student Predictor Coef SE Coef T P Constant Income Size Student S=572.7 R-Sq = 80.4% R-Sq(adj) = 73.1% Analysis of Variance Source DF SS MS F P Regression Residual Error Total Conduct a global test of hypothesis to determine if any of the regression coefficients are not zero H 0 is rejected if F>4.07 …from the MINITAB output, the computed value of F is … continued   oneleastat: 1 H  0: 3210 H

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. … Using the 5% level of significance, reject H 0 if the P-value<.05 … continued From the MINITAB output, the only significant variable is FAMILY (family size) using the P-values (The other variables can be omitted from the model) H 12 0:  H 02 0:  The regression equation is Food = Income Size Student Predictor Coef SECoef T P Constant Income Size Student Conduct an individual test to determine which coefficients are not zero (This is the hypothesis for the independent variable family size)

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. The regression equation is Food = Income Size Student Predictor Coef SECoef Income Size Student S=572.7 R-Sq = 80.4% R-Sq(adj) = 73.1% Regression Analysis: Food versus Size The regression equation is Food = Size Predictor Coef SECoef Constant Size S = R-Sq = 76.8% R-Sq(adj) = 74.4% … continued Rerun the analysis using only the significant independent family size …and the R-square term was reduced by only 3.6 percent...the coefficient of determination is 76.8 percent (the two independent variables are dropped) …the new regression equation is: y = X 2

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Residuals should be approximately normally distributed Analysis Residuals … histograms and stem-and-leaf charts are useful in checking this requirement A residual is the difference between the actual value of y and the predicted value y … a plot of the residuals and their corresponding y values is used for showing that there are no trends or patterns in the residuals

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Analysis Residuals y Residuals Plot

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Analysis Residuals Frequency Residuals Histograms

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Test your learning … Click on… Online Learning Centre for quizzes extra content data sets searchable glossary access to Statistics Canada’s E-Stat data …and much more!

Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. This completes Chapter 14