Feb 21, 2006Lecture 6Slide #1 Adjusted R 2, Residuals, and Review Adjusted R 2 Residual Analysis Stata Regression Output revisited –The Overall Model –Analyzing.

Slides:



Advertisements
Similar presentations
Correlation and regression
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Inference for Regression
Regression Analysis Simple Regression. y = mx + b y = a + bx.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Class 16: Thursday, Nov. 4 Note: I will you some info on the final project this weekend and will discuss in class on Tuesday.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Linear Regression with One Regression
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Lecture 24: Thurs., April 8th
March 7, 2006Lecture 8bSlide #1 Class Analysis Review Predict expected temperature change (c4_34_tc), using the following independent variables: –Age (c5_3_age)
Simple Linear Regression Analysis
Regression Diagnostics Checking Assumptions and Data.
BCOR 1020 Business Statistics
Business Statistics - QBM117 Statistical inference for regression.
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics.
Linear Regression 2 Sociology 5811 Lecture 21 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Simple Linear Regression Analysis
Relationships Among Variables
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Simple Linear Regression Models
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
What does it mean? The variance of the error term is not constant
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Economics 173 Business Statistics Lecture 20 Fall, 2001© Professor J. Petry
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Regression Analysis Week 8 DIAGNOSTIC AND REMEDIAL MEASURES Residuals The main purpose examining residuals Diagnostic for Residuals Test involving residuals.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Lecture Slide #1 Logistic Regression Analysis Estimation and Interpretation Hypothesis Tests Interpretation Reversing Logits: Probabilities –Averages.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Regression Analysis Part C Confidence Intervals and Hypothesis Testing
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
1 1 Slide Simple Linear Regression Estimation and Residuals Chapter 14 BA 303 – Spring 2011.
Week 5Slide #1 Adjusted R 2, Residuals, and Review Adjusted R 2 Residual Analysis Stata Regression Output revisited –The Overall Model –Analyzing Residuals.
Week 7Review SessionSlide #1 Review Regression Model Assumptions Calculus –Derivation, critical points (Excel, plots, etc.) –Deriving OLS estimators (b.
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
April 4, 2006Lecture 11Slide #1 Diagnostics for Multivariate Regression Analysis Homeworks Assumptions of OLS Revisited Visual Diagnostic Techniques Non-linearity.
Correlation & Regression Analysis
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Review Session Linear Regression. Correlation Pearson’s r –Measures the strength and type of a relationship between the x and y variables –Ranges from.
Robust Regression. Regression Methods  We are going to look at three approaches to robust regression:  Regression with robust standard errors  Regression.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Stat 112 Notes 14 Assessing the assumptions of the multiple regression model and remedies when assumptions are not met (Chapter 6).
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Regression Analysis AGEC 784.
Inference for Least Squares Lines
I271B Quantitative Methods
BA 275 Quantitative Business Methods
CHAPTER 29: Multiple Regression*
Product moment correlation
Regression Forecasting and Model Building
Chapter 13 Additional Topics in Regression Analysis
Essentials of Statistics for Business and Economics (8e)
Presentation transcript:

Feb 21, 2006Lecture 6Slide #1 Adjusted R 2, Residuals, and Review Adjusted R 2 Residual Analysis Stata Regression Output revisited –The Overall Model –Analyzing Residuals Review for Exam 2

Feb 21, 2006Lecture 6Slide #2 Homework Review Use variables to make an “egalitarian” index for your dependent variable (Y) Use p98_ideo (ideology) as the independent variable (X) to predict egalitarianism. Fully interpret the results

Feb 21, 2006Lecture 6Slide #3 Adjusted R 2 : An Alternative “Goodness of Fit” Measure Recall that R 2 is calculated as: Hypothetically, as K approaches n, R 2 approaches one (why?) – “degrees of freedom” Adjusted R 2 compensates for that tendency “explained sum of squares”“total sum of squares”

Feb 21, 2006Lecture 6Slide #4 Calculating Adjusted R 2 The bigger the sample size (n), the smaller the adjustment The more complex the model (the bigger K is), the larger the adjustment The bigger R 2 is, the smaller the adjustment

Feb 21, 2006Lecture 6Slide #5 Residual Analysis: Trouble Shooting Conceptual use of residuals –e, or what the model can’t explain Visual Diagnostics –Ideal: a “Sneeze plot” –Diagnostics using Residual Plots: Checking for heteroscedasticity Checking for non-linearity Checking for outliers Saving and Analyzing Residuals in Stata

Feb 21, 2006Lecture 6Slide #6 Review: Assumptions Necessary for Estimating Linear Models 1.Errors have identical distributions Zero mean, same variance, across the range of X 2.Errors are independent of X and other  i 3.Errors are normally distributed   i =0 X

Feb 21, 2006Lecture 6Slide #7 The Ideal: Sneeze Splatter e Predicted Y Problems: It is possible to “over-interpret” residual plots; it is also possible to miss patterns when there are large numbers of observations

Feb 21, 2006Lecture 6Slide #8 Heteroscedasticity e Predicted Y Problem: Standard errors are not constant; hypothesis tests invalid

Feb 21, 2006Lecture 6Slide #9 Non-Linearity e Predicted Y Problem: Biased estimated coefficients, inefficient model

Feb 21, 2006Lecture 6Slide #10 Checking for Outliers e Predicted Y Problem: Under-specified model; measurement error Residuals for model using all data Possible Outliers Residuals for model with outliers deleted

Feb 21, 2006Lecture 6Slide #11 Stata Regression Model: Regressing “Militant” onto “Egal” Y is index of 81, 82 (rev’d), 84-86: 1=low militant; 7=high militant X is “Egal” -- an index of =reject egal; 7=accept egal

Feb 21, 2006Lecture 6Slide #12 Regression Output

Feb 21, 2006Lecture 6Slide #13 Regression Plot

Feb 21, 2006Lecture 6Slide #14 Predicted Case Confidence Bands

Feb 21, 2006Lecture 6Slide #15 Residual Plot Probable Outliers

Feb 21, 2006Lecture 6Slide #16 Examination of Residuals gsort e (or you can use “-e”) list resp_id egal militant yhat e in 1/5 Use the respondent ID number to find the relevant observation in the data set

Feb 21, 2006Lecture 6Slide #17 Residuals v. Predicted Values Using an “ocular test,” neither non-linearity nor heteroscedasticity are obvious here. But should we trust our eyeballs?

Feb 21, 2006Lecture 6Slide #18 Formal Test for Non-linearity: Omitted Variables Tests whether adding 2nd, 3rd and 4th powers of X will improve the fit of the model: Y=b 0 +b 1 X+b 2 X 2 +b 3 X 3 +b 4 X 4 +e

Feb 21, 2006Lecture 6Slide #19 Formal Tests for Heteroscedasticity Tests to see whether the squared standardized residuals are linearly related to the predicted value of Y: std(e 2 )=b 0 +b 1 (Predicted Y)

Feb 21, 2006Lecture 6Slide #20 Case-wise Influence Analysis The Leverage versus Squared Residual Plot

Feb 21, 2006Lecture 6Slide #21 What to Do? Nonlinearity –Polynomial regression: try X and X 2 –Variable transformation: logged variables –Use non-OLS regression Heteroscedasticity –Re-specify model Omitted variables? Use non-OLS regression (WLS) Influential and Deviant Cases –Evaluate the cases –Run with controls (multivariate model) –Omit cases (last option)

Feb 21, 2006Lecture 6Slide #22 Review Basic statistics and functions Calculus –Derivatives and critical values Deriving b 1 and b 0 –Intuitive meaning of formulas Interpreting OLS bivariate regression output Basic regression diagnostics