Regression Models Residuals and Diagnosing the Quality of a Model.

Slides:



Advertisements
Similar presentations
Chapter 9: Simple Regression Continued
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
BA 275 Quantitative Business Methods
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Statistical Analysis Regression & Correlation Psyc 250 Winter, 2013.
Simple Linear Regression and Correlation
Objectives (BPS chapter 24)
Classical Regression III
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 10 Simple Regression.
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
The Simple Regression Model
Chapter Topics Types of Regression Models
Chapter 11 Multiple Regression.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Introduction to Regression Analysis, Chapter 13,
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Simple Linear Regression Analysis
Lecture 5 Correlation and Regression
Multiple Linear Regression Response Variable: Y Explanatory Variables: X 1,...,X k Model (Extension of Simple Regression): E(Y) =  +  1 X 1 +  +  k.
Introduction to Linear Regression and Correlation Analysis
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Chapter 11 Simple Regression
Analysis of Variance: Some Review and Some New Ideas
Correlation and Regression
© 1998, Geoff Kuenning Linear Regression Models What is a (good) model? Estimating model parameters Allocating variation Confidence intervals for regressions.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Name: Angelica F. White WEMBA10. Teach students how to make sound decisions and recommendations that are based on reliable quantitative information During.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Introduction to Linear Regression
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Statistical Analysis Regression & Correlation Psyc 250 Winter, 2008.
Byron Gangnes Econ 427 lecture 3 slides. Byron Gangnes A scatterplot.
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Multivariate Models Analysis of Variance and Regression Using Dummy Variables.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Regression & Correlation. Review: Types of Variables & Steps in Analysis.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Linear Regression Models Andy Wang CIS Computer Systems Performance Analysis.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
1 Experimental Statistics - week 11 Chapter 11: Linear Regression and Correlation.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Chapter 13 Simple Linear Regression
Chapter 14 Introduction to Multiple Regression
Chapter 20 Linear and Multiple Regression
Inference for Least Squares Lines
Introduction to Regression Analysis
Chapter 13 Simple Linear Regression
BIVARIATE REGRESSION AND CORRELATION
Linear Regression Models
Analysis of Variance and Regression Using Dummy Variables
Residuals and Diagnosing the Quality of a Model
BA 275 Quantitative Business Methods
Analysis of Variance: Some Review and Some New Ideas
PENGOLAHAN DAN PENYAJIAN
Multivariate Models Regression.
Presentation transcript:

Regression Models Residuals and Diagnosing the Quality of a Model

Visualizing Regression Models

Collinearity

An Omitted Variable?

Models A Model: A statement of the relationship between a phenomenon to be explained and the factors, or variables, which explain it. Steps in the Process of Quantitative Analysis: –Specification of the model –Estimation of the model –Evaluation of the model

Thus far… We’ve discussed… –The specification of a model, –The estimation of a model and how to read and interpret the statistics we’ve produced: coefficients, t tests, F tests, R Square Now we need to evaluate the model for problems and further elaboration.

We need to evaluate The variation in the predicted values and the difference between the Yi and the predicted Y. That difference is called a “residual.” We can analyze the residuals to see how good the equation is, and whether there are problems with the model that need correction or improvement.

More statistics… Standard Error of the Estimate: The square root of the average squared error of prediction is used as a measure of the accuracy of prediction. (p. 281 and 340 in the text). For the population: For the sample:

Standard Error of the Estimate Used to calculate a confidence interval around the predicted y. As a rule of thumb, multiply the SEE by 2 and add and subtract from the predicted Ys to determine a measure of the variability of the prediction at a 95% confidence level. At the mean of the independent variable: the standard error of the prediction = SEE/(square root of n).

Hypothetical Example 55 predicted value is X Y residual is 6.2

Example from last week…. Newval = a + b1(Newsize) + b2(Families) + b3(Eastside) + b4(South) Dep Var: NEWVAL N: 467 Multiple R: 0.75 Squared multiple R: 0.56 Adjusted squared multiple R: 0.55 Standard error of estimate: Effect Coefficient Std Error Std Coef Tolerance t P(2 Tail) CONSTANT NEWSIZE FAMILIES EASTSIDE SOUTH

To understand the principles, let’s simplify…. We return to the bivariate case: House value is a function of the size of the building. Regression models assume that the errors of prediction are homoscedastic, not autocorrelated, normally distributed, and not correlated with the independent variables. That is, the error term should be noise. Now we ask: –1. how accurate our prediction is, –2. what are the characteristics of the residuals or the error term.

Model of Housing Values and Building Size Dep Var: NEWVAL N: 467 Multiple R: Squared multiple R: Adjusted squared multiple R: Standard error of estimate: Effect Coefficient Std Error Std Coef Tolerance t P(2 Tail) CONSTANT NEWSIZE Analysis of Variance Source Sum-of-Squares df Mean-Square F-ratio P Regression Residual

Scatterplot of Newsize and Newval

Scatterplot, cont.

95% Confidence Intervals for Mean Predictions of Y (left) and Individual Predictions of Y (right)

Hypothetical Example 55 predicted value is X Y residual is 6.2

Analysis of Residuals ESTIMATE NEWVAL RESIDUAL N of cases Minimum Maximum Range Sum Median Mean % CI Upper % CI Lower Std. Error Standard Dev Variance C.V E+14 Skewness(G1) SE Skewness Kurtosis(G2) SE Kurtosis

Visualizing Regression Models

Collinearity

An Omitted Variable?