Statistical Analysis SC504/HS927 Spring Term 2008 Session 6: Week 22: 29 th February OLS (3): multiple regression and dummy coding.

Slides:



Advertisements
Similar presentations
Dummy variables Hill et al chapter 9. Parameters that vary between observations Assumption MR1 The parameters are the same for all observations. k= the.
Advertisements

Quantitative Methods II
Continued Psy 524 Ainsworth
Applied Econometrics Second edition
Multivariate Data/Statistical Analysis SC504/HS927 Spring Term 2008 Week 18: Relationships between variables: simple ordinary least squares (OLS) regression.
Statistical Analysis SC504/HS927 Spring Term 2008
Qualitative predictor variables
USING DUMMY VARIABLES IN REGRESSION MODELS. Qualitative Variables Qualitative variables can be introduced into regression models using dummy variables.
The Regression Equation  A predicted value on the DV in the bi-variate case is found with the following formula: Ŷ = a + B (X1)
1 AAEC 4302 ADVANCED STATISTICAL METHODS IN AGRICULTURAL RESEARCH Chapter 7.3 Dummy Variables.
FIN822 Li11 Binary independent and dependent variables.
Qualitative Variables and
1 Multiple Regression Response, Y (numerical) Explanatory variables, X 1, X 2, …X k (numerical) New explanatory variables can be created from existing.
Regression With Categorical Variables. Overview Regression with Categorical Predictors Logistic Regression.
Econ 140 Lecture 151 Multiple Regression Applications Lecture 15.
Summary of previous lecture Introduction of dummy variable
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
QUALITATIVE AND LIMITED DEPENDENT VARIABLE MODELS.
Multinomial Logistic Regression
January 6, afternoon session 1 Statistics Micro Mini Multiple Regression January 5-9, 2008 Beth Ayers.
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
Statistical Analysis SC504/HS927 Spring Term 2008 Session 5: Week 20: 15 th February OLS (2): assessing goodness of fit, extension to multiple regression.
Economics 20 - Prof. Anderson
Econ 140 Lecture 181 Multiple Regression Applications III Lecture 18.
FIN357 Li1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u Chapter 7. Dummy Variables.
Econ 140 Lecture 171 Multiple Regression Applications II &III Lecture 17.
Project One Fall II a: plot IIb: Survive = a + b*female + c*fare + e i. Is regression statistically significant?, yes, F-stat & probability ii.
Statistical Analysis SC504/HS927 Spring Term 2008 Session 7: Week 23: 7 th March 2008 Complex independent variables and regression diagnostics.
An Introduction to Logistic Regression
1Prof. Dr. Rainer Stachuletz Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u 5. Dummy Variables.
CHAPTER 6 ECONOMETRICS x x x x x Dummy Variable Regression Models Dummy, or indicator, variables take on values of 0 or 1 to indicate the presence or absence.
1 Javier Aparicio División de Estudios Políticos, CIDE Primavera Regresión.
Multiple Regression continued… STAT E-150 Statistical Methods.
Multiple Regression Analysis Multivariate Analysis.
1 MF-852 Financial Econometrics Lecture 9 Dummy Variables, Functional Form, Trends, and Tests for Structural Change Roy J. Epstein Fall 2003.
Lecture 3-3 Summarizing r relationships among variables © 1.
1 1 Slide © 2016 Cengage Learning. All Rights Reserved. The equation that describes how the dependent variable y is related to the independent variables.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
Statistics and Quantitative Analysis U4320 Segment 12: Extension of Multiple Regression Analysis Prof. Sharyn O’Halloran.
Statistics and Econometrics for Business II Fall 2014 Instructor: Maksym Obrizan Lecture notes III # 2. Advanced topics in OLS regression # 3. Working.
1 G Lect 9M Example Nominal (Categorical) Variables as Explanatory Factors Coding of Nominal Explanatory Variables G Multiple Regression.
Multiple Regression Lab Chapter Topics Multiple Linear Regression Effects Levels of Measurement Dummy Variables 2.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Copyright © 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Dummy Variable Regression Models chapter ten.
Regression analysis and multiple regression: Here’s the beef* *Graphic kindly provided by Microsoft.
Categorical Independent Variables STA302 Fall 2013.
Multiple Regression  Similar to simple regression, but with more than one independent variable R 2 has same interpretation R 2 has same interpretation.
9.1 Chapter 9: Dummy Variables A Dummy Variable: is a variable that can take on only 2 possible values: yes, no up, down male, female union member, non-union.
Dummy Variables; Multiple Regression July 21, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
A first order model with one binary and one quantitative predictor variable.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Week 13 Application! DPPG 8504 Sara Spowart, PhD, MPA
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Categorical Independent Variables STA302 Fall 2015.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
DISCRIMINANT ANALYSIS. Discriminant Analysis  Discriminant analysis builds a predictive model for group membership. The model is composed of a discriminant.
Special Topics in Multiple Regression Analysis
Review Multiple Regression Multiple-Category Dummy Variables
Statistical Analysis of the Randomized Block Design
Multiple Regression Analysis with Qualitative Information
Supplement on Multiple Regression Interpretation
Multiple Regression Example
Multiple Regression Analysis with Qualitative Information
Applied Economic Analysis
Modeling with Dichotomous Dependent Variables
Regression and Categorical Predictors
General Linear Regression
Presentation transcript:

Statistical Analysis SC504/HS927 Spring Term 2008 Session 6: Week 22: 29 th February OLS (3): multiple regression and dummy coding

2 Example from Suicide data:

3 E.g.: Using the data set alcohol

4 Hierarchical Regression

5 ‘Dummy’ variables  variables given the values 1 or 0  typically to indicate ‘yes’ or ‘no’  e.g. 1=‘female’ 0=‘not female’ (i.e. male)  Dummy coding will change the value of the constant (intercept) but NOT the gradient (b)  NB: create one dummy variable even though there are two possible genders

6 Working out Predicted Y Value  You need to work out 2 regression equations  If Females = 1, then: Y = a + b 1 (age) + b 2 (1)  If Males = 0, then: Y = a + b 1 (age) + b 2 (0)

7 dummy variables with multiple categories  suppose you want to investigate the effect of housing tenure  you have a variable coded:  1=owns outright  2=owns with a mortgage  3=part owns, part rents  4=rents  5=rent free  NB: the coding is arbitrary. You could have 5=owns outright. 3= owns with a mortgage, 1=rents, 2=rent free, 4=part owns, part rents

8 Multiple categories (cont)  If the categorical variable has z categories, create z-1 dummy variables e.g.  d 1 =1 if owns with a mortgage, 0 otherwise  d 2 =1 if part owns, part rents, 0 otherwise  d 3 =1 if rents, 0 otherwise  d 4 =1 if rent free, 0 otherwise  The omitted category is known as the ‘reference’ or ‘baseline’ category  each case will have a maximum of one dummy coded 1, outright owners will have them all coded 0

9 Choice of reference category  Any category can be the reference  Choose for ease of and meaningful interpretation  e.g.  the ‘norm’ or most common category  the majority ethnic group in a study of the consequences of being from a minority ethnic group  unemployed people if you are studying the effects of unemployment