1 Quadratic Model In order to account for curvature in the relationship between an explanatory and a response variable, one often adds the square of the.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Inference for Regression
Simple Linear Regression. Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Chapter 8 Linear Regression © 2010 Pearson Education 1.
1 Home Gas Consumption Interaction? Should there be a different slope for the relationship between Gas and Temp after insulation than before insulation?
1 Multiple Regression Response, Y (numerical) Explanatory variables, X 1, X 2, …X k (numerical) New explanatory variables can be created from existing.
LECTURE 3 Introduction to Linear Regression and Correlation Analysis
BA 555 Practical Business Analysis
Stat 512 – Lecture 18 Multiple Regression (Ch. 11)
Stat 512 – Lecture 17 Inference for Regression (9.5, 9.6)
Ch. 14: The Multiple Regression Model building
Business Statistics - QBM117 Statistical inference for regression.
Correlation and Regression Analysis
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Chapter 12 Section 1 Inference for Linear Regression.
Simple Linear Regression Analysis
Correlation & Regression
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Chapter 14 Introduction to Multiple Regression Sections 1, 2, 3, 4, 6.
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Lecture 22 Dustin Lueker.  The sample mean of the difference scores is an estimator for the difference between the population means  We can now use.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
12a - 1 © 2000 Prentice-Hall, Inc. Statistics Multiple Regression and Model Building Chapter 12 part I.
Soc 3306a Multiple Regression Testing a Model and Interpreting Coefficients.
Soc 3306a Lecture 9: Multivariate 2 More on Multiple Regression: Building a Model and Interpreting Coefficients.
1 Prices of Antique Clocks Antique clocks are sold at auction. We wish to investigate the relationship between the age of the clock and the auction price.
Simple Linear Regression. Deterministic Relationship If the value of y (dependent) is completely determined by the value of x (Independent variable) (Like.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Correlation – Recap Correlation provides an estimate of how well change in ‘ x ’ causes change in ‘ y ’. The relationship has a magnitude (the r value)
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Lesson 14 - R Chapter 14 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Multiple Regression Analysis Regression analysis with two or more independent variables. Leads to an improvement.
BPA CSUB Prof. Yong Choi. Midwest Distribution 1. Create scatter plot Find out whether there is a linear relationship pattern or not Easy and simple using.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
REGRESSION REVISITED. PATTERNS IN SCATTER PLOTS OR LINE GRAPHS Pattern Pattern Strength Strength Regression Line Regression Line Linear Linear y = mx.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Chapter 11: Linear Regression E370, Spring From Simple Regression to Multiple Regression.
Yandell – Econ 216 Chap 15-1 Chapter 15 Multiple Regression Model Building.
Stats Methods at IC Lecture 3: Regression.
The simple linear regression model and parameter estimation
CHAPTER 3 Describing Relationships
Basic Estimation Techniques
Political Science 30: Political Inquiry
Chapter 12: Regression Diagnostics
Correlation and Simple Linear Regression
Basic Estimation Techniques
Stats Club Marnie Brennan
Multiple Regression Models
Correlation and Simple Linear Regression
STA 291 Summer 2008 Lecture 23 Dustin Lueker.
Simple Linear Regression and Correlation
Indicator Variables Response: Highway MPG
Algebra Review The equation of a straight line y = mx + b
STA 291 Spring 2008 Lecture 23 Dustin Lueker.
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

1 Quadratic Model In order to account for curvature in the relationship between an explanatory and a response variable, one often adds the square of the explanatory variable to the simple linear model.

2 Quadratic Model Conditions on Independent Identically distributed Normally distributed with common standard deviation,

3 Example Response, Y: Population of the U.S. (millions) Explanatory, X: Year the census was taken.

4 Quadratic Model Predicted Population = –24.390*Year *Year 2 We cannot interpret the estimated slope coefficients because we cannot change Year by 1 while holding Year 2 constant.

5 Model Utility F= , P-value< The small P-value indicates that the quadratic model relating population to Year and Year 2 is statistically significant (useful).

6 Statistical Significance Year (added to Year 2 ) t=–39.87, P-value< The P-value is small, therefore the addition of Year is statistically significant.

7 Statistical Significance Year 2 (added to Year) t=42.10, P-value< The P-value is small, therefore the addition of Year 2 is statistically significant.

8 Quadratic Model R 2 =0.999 or 99.9% of the variation in population can be explained by the quadratic model. RMSE=3.03 million

9 Summary - Quadratic The model is useful. Each term is a statistically significant addition. 99.9% of the variation in population is explained by the quadratic model.

10 Prediction Year 2010 Predicted Population = – (2010) *(2010) 2 = million Not bad as the actually figure in 2010 was million.

11 Prediction for 2020 Year 2020 Predicted Population = – (2020) *(2020) 2 = million

12 Prediction Year 1800 Predicted Population = – (1800) *(1800) 2 = million Very close to the actual value of million

13

14

15 Plot of Residuals The residuals wiggle around the zero line. Hard to say whether this is a pattern or not. The residuals for 1940 and 1950 stick out. The quadratic model over predicts for these years.

16 Possible explanation census followed the Great Depression when immigration was very low census followed WW-II when birth rates and immigration were low.

17 Can we do better? Could try higher order polynomial terms like Year 3 or Year 4. Year 3 is statistically significant (P- value = ) in cubic model. Year 4 is not statistically significant in a quartic model.

18 Quadratic Model There is still the issue of trying to interpret the coefficients in the quadratic model. Again, creating a new explanatory variable, Year 2, has introduced multicollinearity into the quadratic model.

19

20 Correlation Year and Year 2 Correlation: r = For the values that Year takes on, there is an extremely strong positive linear correlation with Year 2.

21 Centering Center Year by subtracting off the mean before constructing the squared term in the quadratic model. Mean year is 1900.

22 Quadratic Model Predicted Population = – *Year *(Year – 1900) 2 Note that the estimated slope for year is exactly the same as in the simple linear model.

23

24 Correlation Year and (Year – 1900) 2 Correlation: r = – For the values that Year takes on, there is no linear correlation with (Year – 1900) 2.

25 Centering Centering has completely removed the multicollinearity resulting from the inclusion of the quadratic term in the quadratic model.

26 Quadratic Model Predicted Population = *(Year – 1900) *(Year – 1900) 2 The predicted population in 1900 is million.

27 Quadratic Model Predicted Population = *(Year – 1900) *(Year – 1900) 2 For each additional year, the population goes up, on average, million.

28 Quadratic Model Predicted Population = *(Year – 1900) *(Year – 1900) 2 In addition to the average change per year, there is a bigger adjustment to this rate of change the further away you are from 1900.

29 Ten Year Change 1880: Pred = million 1890: Pred = million Difference of million Average increase of million per year.

30 Ten Year Change 1980: Pred = million 1990: Pred = million Difference of million Average increase of million per year.