LESSON 5 Multiple Regression Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 7-1.

Slides:



Advertisements
Similar presentations
Dummy variables Hill et al chapter 9. Parameters that vary between observations Assumption MR1 The parameters are the same for all observations. k= the.
Advertisements

Class 18 – Thursday, Nov. 11 Omitted Variables Bias
Multiple Regression and Model Building
Random Assignment Experiments
Using Indicator Variables
FIN822 Li11 Binary independent and dependent variables.
Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
Qualitative Variables and
Multiple Regression [ Cross-Sectional Data ]
The Use and Interpretation of the Constant Term
Copyright © 2006 Pearson Addison-Wesley. All rights reserved. Lecture 12: Joint Hypothesis Tests (Chapter 9.1–9.3, 9.5–9.6)
1 Module II Lecture 3: Misspecification: Non-linearities Graduate School Quantitative Research Methods Gwilym Pryce.
Chapter 13 Multiple Regression
Linear Regression.
Chapter 12 Simple Regression
Interaksi Dalam Regresi (Lanjutan) Pertemuan 25 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Regresi dan Rancangan Faktorial Pertemuan 23 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter 12 Multiple Regression
© 2000 Prentice-Hall, Inc. Chap Multiple Regression Models.
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
Functional Form, Scaling and Use of Dummy Variables Copyright © 2006 Pearson Addison-Wesley. All rights reserved
Economics 20 - Prof. Anderson
© 2003 Prentice-Hall, Inc.Chap 14-1 Basic Business Statistics (9 th Edition) Chapter 14 Introduction to Multiple Regression.
Multiple Regression Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 7-1.
Topic 3: Regression.
Review.
Ch. 14: The Multiple Regression Model building
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 5. Dummy Variables.
1 Javier Aparicio División de Estudios Políticos, CIDE Primavera Regresión.
Multiple Regression 2 Sociology 5811 Lecture 23 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Multiple Linear Regression Analysis
Hypothesis Testing in Linear Regression Analysis
  What is Econometrics? Econometrics literally means “economic measurement” It is the quantitative measurement and analysis of actual economic and business.
Chapter 12 Multiple Regression and Model Building.
Multiple Regression. In the previous section, we examined simple regression, which has just one independent variable on the right side of the equation.
1 Research Method Lecture 6 (Ch7) Multiple regression with qualitative variables ©
Lecture 3-3 Summarizing r relationships among variables © 1.
1 Dummy Variables. 2 Topics for This Chapter 1. Intercept Dummy Variables 2. Slope Dummy Variables 3. Different Intercepts & Slopes 4. Testing Qualitative.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2003 Thomson/South-Western Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Chapter 14 Introduction to Multiple Regression
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Chap 14-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 14 Additional Topics in Regression Analysis Statistics for Business.
Multiple Regression 3 Sociology 5811 Lecture 24 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Multiple Linear Regression ● For k>1 number of explanatory variables. e.g.: – Exam grades as function of time devoted to study, as well as SAT scores.
Chapter 13 Multiple Regression
Welcome to Econ 420 Applied Regression Analysis Study Guide Week Seven.
Chapter 4 The Classical Model Copyright © 2011 Pearson Addison-Wesley. All rights reserved. Slides by Niels-Hugo Blunch Washington and Lee University.
9.1 Chapter 9: Dummy Variables A Dummy Variable: is a variable that can take on only 2 possible values: yes, no up, down male, female union member, non-union.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Business Research Methods
Multiple Regression Learning Objectives n Explain the Linear Multiple Regression Model n Interpret Linear Multiple Regression Computer Output n Test.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Lecturer: Ing. Martina Hanová, PhD. Business Modeling.
Chapter 14 Introduction to Multiple Regression
Using Indicator Variables
Multiple Regression Analysis and Model Building
Multiple Regression Analysis with Qualitative Information
Prepared by Lee Revere and John Large
Chapter 8: DUMMY VARIABLE (D.V.) REGRESSION MODELS
Regression and Categorical Predictors
Chapter 9 Dummy Variables Undergraduated Econometrics Page 1
Financial Econometrics Fin. 505
Presentation transcript:

LESSON 5 Multiple Regression Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 7-1

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 7-2 Multiple Regression We know how to regress Y on a constant and a single X variable  1 is the change in Y from a 1-unit change in X

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 7-3 Multiple Regression (cont.) Usually we will want to include more than one independent variable. How can we extend our procedures to permit multiple X variables?

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 7-4 Gauss–Markov DGP with Multiple X ’s

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 7-5 BLUE Estimators Ordinary Least Squares is still BLUE The OLS formula for multiple X ’s requires matrix algebra, but is very similar to the formula for a single X

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 7-6 BLUE Estimators (cont.) Intuitions from the single variable formulas tend to generalize to multiple variables. We’ll trust the computer to get the formulas right. Let’s focus on interpretation.

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 7-7 Single Variable Regression

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 7-8 Multiple Regression  1 is the change in Y from a 1-unit change in X 1

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 7-9 Multiple Regression (cont.) How can we interpret  1 now?  1 is the change in Y from a 1-unit change in X 1, holding X 2 …X k FIXED

Copyright © 2006 Pearson Addison-Wesley. All rights reserved Multiple Regression (cont.)

Copyright © 2006 Pearson Addison-Wesley. All rights reserved Multiple Regression (cont.) How do we implement multiple regression with our software?

Copyright © 2006 Pearson Addison-Wesley. All rights reserved Example: Growth Regress GDP growth from 1960–1985 on – GDP per capita in 1960 (GDP60) – Primary school enrollment in 1960 (PRIM60) – Secondary school enrollment in 1960 (SEC60) – Government spending as a share of GDP (G/Y) – Number of coups per year (REV) – Number of assassinations per year (ASSASSIN) – Measure of Investment Price Distortions (PPI60DEV)

Copyright © 2006 Pearson Addison-Wesley. All rights reserved Hit Table Ext.1.1 A Multiple Regression Model of per Capita GDP Growth.

Copyright © 2006 Pearson Addison-Wesley. All rights reserved Example: Growth (cont.) A 1-unit increase in GDP in 1960 predicts a unit decrease in GDP growth, holding fixed the level of PRIM60, SEC60, G/Y, REV, ASSASSIN, and PPI60DEV.

Copyright © 2006 Pearson Addison-Wesley. All rights reserved Example: Growth (cont.) Before we controlled for other variables, we found a POSITIVE relationship between growth and GDP per capita in After controlling for measures of human capital and political stability, the relationship is negative, in accordance with “catch up” theory.

Copyright © 2006 Pearson Addison-Wesley. All rights reserved Example: Growth (cont.) Countries with high values of GDP per capita in 1960 ALSO had high values of schooling and a low number of coups/assassinations. Part of the relationship between growth and GDP per capita is actually reflecting the influence of schooling and political stability. Holding those other variables constant lets us isolate the effect of just GDP per capita.

Copyright © 2006 Pearson Addison-Wesley. All rights reserved Example: Growth The Growth of GDP from 1960–1985 was higher: 1.The lower starting GDP, and 2.The higher the initial level of human capital. Poor countries tended to “catch up” to richer countries as long as the poor country began with a comparable level of human capital, but not otherwise.

Copyright © 2006 Pearson Addison-Wesley. All rights reserved Example: Growth (cont.) Bigger government consumption is correlated with lower growth; bigger government investment is only weakly correlated with growth. Politically unstable countries tended to have weaker growth. Price distortions are negatively related to growth.

Copyright © 2006 Pearson Addison-Wesley. All rights reserved Example: Growth (cont.) The analysis leaves largely unexplained the very slow growth of Sub-Saharan African countries and Latin American countries.

20 Omitted Variable Bias

21 Omitted variable bias, ctd.

22 Omitted variable bias, ctd.

23 The omitted variable bias formula:

24 Measures of Fit for Multiple Regression

25 S e and RMSE

26 R 2 and

27 R 2 and, ctd.

28 Measures of fit, ctd.

29 The Least Squares Assumptions for Multiple Regression

Testing Hypotheses Copyright © 2006 Pearson Addison-Wesley. All rights reserved t- test – individual test F-test – joint test

Dummy Variables Used to capture qualitative explanatory variables Used to capture any event that has only two possible outcomes e.g. race, gender, geographic region of residence etc.

Use of Intercept Dummy Most common use of dummy variables. Modifies the regression model intercept parameter e.g. Let test the “location”, “location” “location” model of real estate Suppose we take into account location near say a university or golf course

P t = β o + β 1 S t +β 2 D t + ε t St = square footage D = dummy variable to represent if the characteristic is present or not D = 1if property is in a desirable neighborhood 0if not in a desirable neighborhood

Effect of the dummy variable is best seen by examining the E(Pt). If model is specified correctly, E(ε t ) = 0 E(P t ) = ( β o + β 2 ) + β 1 S t when D=1 β o + β 1 S t when D = 0

B2 is the location premium in this case. It is the difference between the Price of a house in a desirable are and one in a not so desirable area, all things held constant The dummy variable is to capture the shift in the intercept as a result of some qualitative variable  Dt is an intercept dummy variable

Dt is treated as any explanatory variable. You can construct a confidence interval for B2 You can test if B2 is significantly different from zero. In such a test, if you accept Ho, then there is no difference between the two categories.

Application of Intercept Dummy Variable Wages = B0 + B1EXP + B2RACE +B3SEX + Et Race = 1 if white 0 if non white Sex = 1 if male 0 if female

WAGES = 40, EXP RACE +1082SEX Mean salary for black female 40, EXP Mean salary for white female 41, EXP +1102

Mean salary for Asian male Mean salary for white male What sucks more, being female or non white?

Determining the # of dummies to use If h categories, then use h-1 dummies Category left out defines reference group If you use h dummies you’d fall into the dummy trap

Slope Dummy Variables Allows for different slope in the relationship Use an interaction variable between the actual variable and a dummy variable e.g. Pt = Bo + B1Sqfootage+B2(Sqfootage*D)+et D= 1 desirable area, 0 otherwise

Captures the effect of location and size on the price of a house E(Pt) = B0 + (B1+B2)Sqfoot if D=1 = BO + B1Sqfoot if D = 0  in the desirable area, price per square foot is b1+b2, and it is b1 in other areas If we believe that a house location affects both the intercept and the slope then the model is

Pt = B0 +B1sqfoot +B2(sqfoot*D) + B3D +et

44 Dummies for Multiple Categories We can use dummy variables to control for something with multiple categories Suppose everyone in your data is either a HS dropout, HS grad only, or college grad To compare HS and college grads to HS dropouts, include 2 dummy variables hsgrad = 1 if HS grad only, 0 otherwise; and colgrad = 1 if college grad, 0 otherwise

45 Multiple Categories (cont) Any categorical variable can be turned into a set of dummy variables Because the base group is represented by the intercept, if there are n categories there should be n – 1 dummy variables If there are a lot of categories, it may make sense to group some together Example: top 10 ranking, 11 – 25, etc.

46 Interactions Among Dummies Interacting dummy variables is like subdividing the group Example: have dummies for male, as well as hsgrad and colgrad Add male*hsgrad and male*colgrad, for a total of 5 dummy variables –> 6 categories Base group is female HS dropouts hsgrad is for female HS grads, colgrad is for female college grads The interactions reflect male HS grads and male college grads

47 More on Dummy Interactions Formally, the model is y =  0 +  1 male +  2 hsgrad +  3 colgrad +  4 male*hsgrad +  5 male*colgrad +  1 x + u, then, for example: If male = 0 and hsgrad = 0 and colgrad = 0 y =  0 +  1 x + u If male = 0 and hsgrad = 1 and colgrad = 0 y =  0 +  2 hsgrad +  1 x + u If male = 1 and hsgrad = 0 and colgrad = 1 y =  0 +  1 male +  3 colgrad +  5 male*colgrad +  1 x + u

48 Other Interactions with Dummies Can also consider interacting a dummy variable, d, with a continuous variable, x y =  0 +  1 d +  1 x +  2 d*x + u If d = 0, then y =  0 +  1 x + u If d = 1, then y = (  0 +  1 ) + (  1 +  2 ) x + u This is interpreted as a change in the slope

49 y x y =  0 +  1 x y = (  0 +  0 ) + (  1 +  1 ) x Example of  0 > 0 and  1 < 0 d = 1 d = 0