DUMMY VARIABLES BY HARUNA ISSAHAKU Haruna Issahaku.

Slides:



Advertisements
Similar presentations
The Regression Equation  A predicted value on the DV in the bi-variate case is found with the following formula: Ŷ = a + B (X1)
Advertisements

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
The Gender Gap in Earning: Methods and Evidence Chapter 10.
Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
Qualitative Variables and
Econ 140 Lecture 151 Multiple Regression Applications Lecture 15.
Summary of previous lecture Introduction of dummy variable
Differences in earnings and employment opportunities may arise even among equally skilled workers employed in the same job simply because of the workers’
Multiple Regression Involves the use of more than one independent variable. Multivariate analysis involves more than one dependent variable - OMS 633 Adding.
7 Dummy Variables Thus far, we have only considered variables with a QUANTITATIVE MEANING -ie: dollars, population, utility, etc. In this chapter we will.
Econ 140 Lecture 171 Multiple Regression Applications II &III Lecture 17.
Nemours Biomedical Research Statistics April 2, 2009 Tim Bunnell, Ph.D. & Jobayer Hossain, Ph.D. Nemours Bioinformatics Core Facility.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
7.1 Lecture #7 Studenmund(2006) Chapter 7 Objective: Applications of Dummy Independent Variables.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Multiple Regression 2 Sociology 5811 Lecture 23 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Dummy variable Up till now we have dealt exclusively with the variables which can measures in quantitative terms. But sometimes variables which we consider.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 25 Categorical Explanatory Variables.
Chapter 12 Multiple Regression and Model Building.
Multiple Regression. In the previous section, we examined simple regression, which has just one independent variable on the right side of the equation.
1 Research Method Lecture 6 (Ch7) Multiple regression with qualitative variables ©
Moderation & Mediation
Lecture 3-3 Summarizing r relationships among variables © 1.
Chapter 14 Introduction to Multiple Regression
CHAPTER 9 DUMMY VARIABLE REGRESSION MODELS
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Statistics and Econometrics for Business II Fall 2014 Instructor: Maksym Obrizan Lecture notes III # 2. Advanced topics in OLS regression # 3. Working.
1 Prices of Antique Clocks Antique clocks are sold at auction. We wish to investigate the relationship between the age of the clock and the auction price.
Chap 14-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics.
Chapter 13 Multiple Regression
Lecture 25 Summary of previous lecture Specification Bias Detection methods.
Overview of Regression Analysis. Conditional Mean We all know what a mean or average is. E.g. The mean annual earnings for year old working males.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Developing a Hiring System Measuring Applicant Qualifications or Statistics Can Be Your Friend!
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
EKONOMETRIKA 1 FE_UB By Al Muiz almuiz 2009.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
1/25 Introduction to Econometrics. 2/25 Econometrics Econometrics – „economic measurement“ „May be defined as the quantitative analysis of actual economic.
Lecturer: Ing. Martina Hanová, PhD. Business Modeling.
Chapter 6: Multiple Regression – Additional Topics
Heteroscedasticity Chapter 8
Chapter 15 Multiple Regression Model Building
QUALITATIVE EXPLANATORY VARIABLES REGRESSION MODELS
Chapter 14 Introduction to Multiple Regression
Multiple Regression Analysis with Qualitative Information
Ch. 2: The Simple Regression Model
Multiple Regression Analysis: OLS Asymptotics
Multiple Regression Analysis and Model Building
Essentials of Modern Business Statistics (7e)
Multiple Regression Analysis with Qualitative Information
Chapter 6: Multiple Regression – Additional Topics
Ch. 2: The Simple Regression Model
QUALITATIVE EXPLANATORY VARIABLES REGRESSION MODELS
Multiple Regression Analysis with Qualitative Information
Chapter 6: Multiple Regression – Additional Topics
Multiple Regression Analysis: Further Issues
Soc 3306a Lecture 11: Multivariate 4
Chapter 6: Multiple Regression – Additional Topics
Heteroskedasticity.
Chapter 8: DUMMY VARIABLE (D.V.) REGRESSION MODELS
The Simple Regression Model
Econometrics I Professor William Greene Stern School of Business
Multiple Regression Analysis with Qualitative Information
Regression and Categorical Predictors
Financial Econometrics Fin. 505
Presentation transcript:

DUMMY VARIABLES BY HARUNA ISSAHAKU Haruna Issahaku

Definition Dummy variables (DV) indicate the presence or absence of a ‘quality’ or an attribute such as male or female, black or white, north or south, etc. A dummy variable will take the value 1 or 0 according to whether or not the condition is present or absent for a particular observation. Haruna Issahaku

1 indicates the presence of the attribute 0 indicates the absence of the attribute Eg. 1 may represent female and 0 for male DV are sometimes called categorical variables, or qualitative variables or indicator variables ANOVA MODELS: Used to test the statistical significance of the relationship between a quantitative regressand and qualitative or dummy regressors Haruna Issahaku

ANCOVA MODELS: regression models containing an admixture of quantitative and dummy regressors. – They provide a method of statistically controlling for the effects of quantitative regressors called covariates or control variables in a model that includes both quantitative and qualitative regressors Haruna Issahaku

A single dummy independent variable Given the wage discrimination model Wage=hourly wage D=1 for females and 0 for males is the difference in hourly wage between males and females given the same level of education and the same error term u. is the mean hourly wage for males (the base or benchmark group) Haruna Issahaku

Why a single category is included in a single DV. equation Using two DV would introduce perfect colinearity because for example Female + male=1 Ie. Male is a perfect colinear function of female Introducing DV for both male and female is the simplest example of the so called dummy variable trap. It arises when too many DV describe a given number of groups Haruna Issahaku

Overcoming the DV trap One way of overcoming the DV trap is to drop the intercept and include the two categories Eg. Haruna Issahaku

Some interpretations Given Wage = female educ. Se= (0.12) (0.26) (0.049) N=526R-sqd= males receive a mean wage of Ghc2.91 per hour -on the average females receive Ghc1.81 less than their male counterparts holding education constant. -the average hourly wage of female is =Ghc1.10 Haruna Issahaku

Interpreting coefficients on DV when an explanatory v. is in log form Given the ff. model on the effects of training grants on hours of training by firms hrsemp=hours of training per employee at the firm level Grant is a DV =1 if a firm received a job training grant and 0 otherwise Haruna Issahaku

Sales=annual sales Employ=total no. of employees of the firm Grant coefficient of means controlling for sales and employment, firms that receive grants trained each worker on the average hours more The coefficient on log sales is small and insignificant Haruna Issahaku

The coefficient on log employ of means If a firm is 1% larger it trains its workers by (ie. 6.07/100) hours less Haruna Issahaku

Interpreting DV when dependent v. is in log form 1. given the housing price equation Colonial is a DV. =1 if house is of a colonial style and 0 otherwise Raw interpretation: holding other factors constant the difference in log(price) of a house of a colonial style is Correct interpretation: a house of the colonial style is predicted to sell for about 5.4% more, holding other factors constant Ie. When the dependent v. is in log form the coefficient on the DV multiplied by 100 is interpreted as percentage difference in the dependent variable holding other factors constant. Haruna Issahaku

2. Given a log hourly wage equation Using log(wage) as the dependent v. and adding quadratics in experience The coefficient on female implies for the same levels of experience and education women on the average earn 29.7% (ie. 100*0.297) less than men More accurate interpretation: a more accurate interpretation is obtained by using the formula Haruna Issahaku

100*[exp.(B)-1] Where B is the coefficient on the dummy variable Thus, 100*[exp.(-0.297)-1]=25.7 More accurately, on the average a woman’s wage is 25.7% below a comparable man’s wage Haruna Issahaku

Median hourly wage for males is calculated by taking the antilog of the intercept exp(0.417)=Ghc1.517 The median hourly wage of females Exp[0.417+(-0.279)]=Ghc1.148 Haruna Issahaku

ANOVA models with 2 DV. Model: hourly wage in relation to marital status and region of residence Y=hourly wage (Ghc) =marital status 1=married 0=otherwise =region of residence 1=south 0=otherwise Haruna Issahaku

Which is the benchmark group? Unmarried non-south residents All comparisms are made wrt this group as ff: -Unmarried non-south residents receive a mean hourly wage of GhC8.81 -Those married on average receive Ghc1.10 more than the unmarried non-south residents Haruna Issahaku

The mean hourly wage for the married is Ghc 9.91 (ie ) -the hourly wage of those from the south is lower by Ghc 1.67 The mean hourly wage of those from the south is Ghc7.14 (ie ) All are statistically significant. Haruna Issahaku

Interaction effects using DV. Given Female is a dummy=1 for females; 0=otherwise Married is a dummy 1=married 0=otherwise The differential effect of being a married female is Married females earn 35.12% more than single men with a mode of Ghc1.35 Haruna Issahaku

What will be the earnings for – 1. married men? – 2. single men? – 3. unmarried females? Haruna Issahaku

Uses of DV in applied econometric research Read Akoutsoyannis pp As proxies to qualitative variables 2. As proxies for numerical factors 3. Measuring shifts of a function over time 4. Measuring the change in parameters over time 5. As proxies for the dependent variable 6. For seasonal adjustment of time series Haruna Issahaku

Indicator vs effects coding Indicator coding: where the reference category is assigned zero across the set of DV. Effects coding: where the reference category is assigned a value of negative 1 across the set of DV. In indicator coding coefficients represent group deviations on the dependent variable from the reference group While in effect coding coefficients become group deviations on the dependent variable from the mean of the dependent variable across all groups Haruna Issahaku