Outline When X’s are Dummy variables –EXAMPLE 1: USED CARS –EXAMPLE 2: RESTAURANT LOCATION Modeling a quadratic relationship –Restaurant Example.

Slides:



Advertisements
Similar presentations
Multiple Regression. Introduction In this chapter, we extend the simple linear regression model. Any number of independent variables is now allowed. We.
Advertisements

1 Chapter 9 Supplement Model Building. 2 Introduction Introduction Regression analysis is one of the most commonly used techniques in statistics. It is.
Lecture 17: Tues., March 16 Inference for simple linear regression (Ch ) R2 statistic (Ch ) Association is not causation (Ch ) Next.
Example 1 To predict the asking price of a used Chevrolet Camaro, the following data were collected on the car’s age and mileage. Data is stored in CAMARO1.
Examining Relationships Chapter 3. Least Squares Regression Line If the data in a scatterplot appears to be linear, we often like to model the data by.
Fundamentals of Real Estate Lecture 13 Spring, 2003 Copyright © Joseph A. Petry
Correlation and regression Dr. Ghada Abo-Zaid
1 Multiple Regression Chapter Introduction In this chapter we extend the simple linear regression model, and allow for any number of independent.
1 Simple Linear Regression and Correlation The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES Assessing the model –T-tests –R-square.
1 Multiple Regression Model Error Term Assumptions –Example 1: Locating a motor inn Goodness of Fit (R-square) Validity of estimates (t-stats & F-stats)
Lecture 9- Chapter 19 Multiple regression Introduction In this chapter we extend the simple linear regression model and allow for any number of.
Lecture 26 Model Building (Chapters ) HW6 due Wednesday, April 23 rd by 5 p.m. Problem 3(d): Use JMP to calculate the prediction interval rather.
Simple Linear Regression
Class 19: Tuesday, Nov. 16 Specially Constructed Explanatory Variables.
1 Multiple Regression Chapter Introduction In this chapter we extend the simple linear regression model, and allow for any number of independent.
© 2000 Prentice-Hall, Inc. Chap Multiple Regression Models.
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
Lecture 22 Multiple Regression (Sections )
1 Multiple Regression. 2 Introduction In this chapter we extend the simple linear regression model, and allow for any number of independent variables.
Lecture 26 Omitted Variable Bias formula revisited Specially constructed variables –Interaction variables –Polynomial terms for curvature –Dummy variables.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
1 Lecture Eleven Probability Models. 2 Outline Bayesian Probability Duration Models.
Lecture 27 Polynomial Terms for Curvature Categorical Variables.
1 Simple Linear Regression and Correlation Chapter 17.
Lecture 23 Multiple Regression (Sections )
1 Lecture Eleven Probability Models. 2 Outline Bayesian Probability Duration Models.
Stat 112: Lecture 18 Notes Chapter 7.1: Using and Interpreting Indicator Variables. Visualizing polynomial regressions in multiple regression Review Problem.
Stat 112: Lecture 13 Notes Finish Chapter 5: –Review Predictions in Log-Log Transformation. –Polynomials and Transformations in Multiple Regression Start.
1 4. Multiple Regression I ECON 251 Research Methods.
Multiple Regression and Correlation Analysis
Lecture 22 – Thurs., Nov. 25 Nominal explanatory variables (Chapter 9.3) Inference for multiple regression (Chapter )
Lecture 17 Interaction Plots Simple Linear Regression (Chapter ) Homework 4 due Friday. JMP instructions for question are actually for.
Lecture 21 – Thurs., Nov. 20 Review of Interpreting Coefficients and Prediction in Multiple Regression Strategy for Data Analysis and Graphics (Chapters.
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Correlation and Linear Regression
Active Learning Lecture Slides
Multiple Regression Analysis
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Multivariate Data Analysis CHAPTER seventeen.
Economics 173 Business Statistics Lecture 22 Fall, 2001© Professor J. Petry
Economics 173 Business Statistics Lecture 20 Fall, 2001© Professor J. Petry
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Copyright © 2009 Cengage Learning 18.1 Chapter 20 Model Building.
Lecture 27 Chapter 20.3: Nominal Variables HW6 due by 5 p.m. Wednesday Office hour today after class. Extra office hour Wednesday from Final Exam:
Chapter 11 Correlation and Simple Linear Regression Statistics for Business (Econ) 1.
Economics 173 Business Statistics Lecture 19 Fall, 2001© Professor J. Petry
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
1 MGT 511: Hypothesis Testing and Regression Lecture 8: Framework for Multiple Regression Analysis K. Sudhir Yale SOM-EMBA.
Economics 173 Business Statistics Lecture 10 Fall, 2001 Professor J. Petry
Chapter 8: Simple Linear Regression Yang Zhenlin.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice- Hall, Inc. Chap 14-1 Business Statistics: A Decision-Making Approach 6 th Edition.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
The coefficient of determination, r 2, is The fraction of the variation in the value of y that is explained by the regression line and the explanatory.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Go to Table of Content Correlation Go to Table of Content Mr.V.K Malhotra, the marketing manager of SP pickles pvt ltd was wondering about the reasons.
WELCOME TO THE PRESENTATION ON LINEAR REGRESSION ANALYSIS & CORRELATION (BI-VARIATE) ANALYSIS.
Statistics for Business and Economics Module 2: Regression and time series analysis Spring 2010 Lecture 6: Multiple Regression Model Building Priyantha.
1 Chapter 20 Model Building Introduction Regression analysis is one of the most commonly used techniques in statistics. It is considered powerful.
1 Assessment and Interpretation: MBA Program Admission Policy The dean of a large university wants to raise the admission standards to the popular MBA.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Lecture Eleven Probability Models.
Chapter 14 Introduction to Multiple Regression
Inference for Least Squares Lines
Linear Regression.
Examining Relationships
Multiple Regression Analysis and Model Building
Keller: Stats for Mgmt & Econ, 7th Ed Linear Regression Analysis
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
a.) What score represents the 90th percentiles?
Presentation transcript:

Outline When X’s are Dummy variables –EXAMPLE 1: USED CARS –EXAMPLE 2: RESTAURANT LOCATION Modeling a quadratic relationship –Restaurant Example

Qualitative Independent Variables In many real-life situations one or more independent variables are qualitative. Including qualitative variables in a regression analysis model is done via indicator variables. An indicator variable (I) can assume one out of two values, “zero” or “one”. 1 if a first condition out of two is met 0 if a second condition out of two is met I= 1 if data were collected before if data were collected after if the temperature was below 50 o 0 if the temperature was 50 o or more 1 if a degree earned is in Finance 0 if a degree earned is not in Finance

Example 1 The dealer believes that color is a variable that affects a car’s price. Three color categories are considered: –White –Silver –Other colors Note: Color is a qualitative variable. I 1 = 1 if the color is white 0 if the color is not white I 2 = 1 if the color is silver 0 if the color is not silver And what about “Other colors”? Set I 1 = 0 and I 2 = 0

Solution –the proposed model is y =  0 +  1 (Odometer) +  2 I 1 +  3 I 2 +  –The data To represent a qualitative variable that has m possible categories (levels), we must create m-1 indicator variables. White car Other color Silver color

There is insufficient evidence to infer that a white color car and a car of “Other color” sell for a different auction price. There is sufficient evidence to infer that a silver color car sells for a larger price than a car of the “Other color” category.

Price = (Odometer) (0) + 148(1) Price = (Odometer) (1) + 148(0) Price = (Odometer) (0) + 148(0) From Excel we get the regression equation PRICE = (ODOMETER)+45.2I I 2 For one additional mile the auction price decreases by 2.78 cents. Odometer Price A white car sells, on the average, for $45.2 more than a car of the “Other color” category (Odometer) (Odometer) (Odometer) A silver color car sells, on the average, for $148 more than a car of the “Other color” category The equation for a car of the “Other color” category. The equation for a car of white color The equation for a car of silver color

Example 2 Location for a new restaurant –A fast food restaurant chain tries to identify new locations that are likely to be profitable. –The primary market for such restaurants is middle-income adults and their children (between the age 5 and 12). –Which regression model should be proposed to predict the profitability of new locations?

Solution –The dependent variable will be Gross Revenue –There are quadratic relationships between Revenue and each predictor variable. Why? Members of middle-class families are more likely to visit a fast food family than members of poor or wealthy families. Income Low Middle High Revenue Families with very young or older kids will not visit the restaurant as frequent as families with mid-range ages of kids. age Revenue Low Middle High Revenue =  0 +  1 Income +  2 Age +  3 Income 2 +  4 Age 2 +  5 ( Income )( Age ) +  Revenue =  0 +  1 Income +  2 Age +  3 Income 2 +  4 Age 2 +  5 ( Income )( Age ) + 

Example 2 –To verify the validity of the model proposed in example 19.1, 25 areas with fast food restaurants were randomly selected. –Data collected included (see Xm19-02.xls): Previous year’s annual gross sales. Mean annual household income. Mean age of children

The model provides a good fit

The model can be used to make predictions. However, do not interpret the coefficients or test them. Multicollinearity is a problem!! In excel: Tools > Data Analysis > Correlation

Regression results of the modified model Multicolinearity is not a problem anymore