LESSON 4.4. MULTIPLE LINEAR REGRESSION. Residual Analysis

Slides:



Advertisements
Similar presentations
Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.
Advertisements

1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Linear Regression with One Regression
1 BA 275 Quantitative Business Methods Residual Analysis Multiple Linear Regression Adjusted R-squared Prediction Dummy Variables Agenda.
Correlation. Two variables: Which test? X Y Contingency analysis t-test Logistic regression Correlation Regression.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter Topics Types of Regression Models
Regression Diagnostics - I
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide Are the Means of Several Groups Equal? Ho:Ha: Consider the following.
Chapter 11: Inference for Distributions
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Business Statistics - QBM117 Statistical inference for regression.
Introduction to Regression Analysis, Chapter 13,
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
Chapter 13: Inference in Regression
BPS - 3rd Ed. Chapter 211 Inference for Regression.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide Simple Linear Regression Part A n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Design and Data Analysis in Psychology I Salvador Chacón Moscoso Susana Sanduvete Chaves School of Psychology Dpt. Experimental Psychology 1.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 14 Inference for Regression © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Regression Analysis Week 8 DIAGNOSTIC AND REMEDIAL MEASURES Residuals The main purpose examining residuals Diagnostic for Residuals Test involving residuals.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
1 1 Slide Simple Linear Regression Estimation and Residuals Chapter 14 BA 303 – Spring 2011.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Week 5Slide #1 Adjusted R 2, Residuals, and Review Adjusted R 2 Residual Analysis Stata Regression Output revisited –The Overall Model –Analyzing Residuals.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
732G21/732G28/732A35 Lecture 3. Properties of the model errors ε 4. ε are assumed to be normally distributed
LESSON 4.1. MULTIPLE LINEAR REGRESSION 1 Design and Data Analysis in Psychology II Salvador Chacón Moscoso Susana Sanduvete Chaves.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
REGRESSION AND CORRELATION SIMPLE LINEAR REGRESSION 10.2 SCATTER DIAGRAM 10.3 GRAPHICAL METHOD FOR DETERMINING REGRESSION 10.4 LEAST SQUARE METHOD.
BPS - 5th Ed. Chapter 231 Inference for Regression.
3.3. SIMPLE LINEAR REGRESSION: DUMMY VARIABLES 1 Design and Data Analysis in Psychology II Salvador Chacón Moscoso Susana Sanduvete Chaves.
Design and Data Analysis in Psychology II
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Statistics for Managers using Microsoft Excel 3rd Edition
Virtual COMSATS Inferential Statistics Lecture-26
Inferences for Regression
Inference for Regression
Chapter 12: Regression Diagnostics
Slides by JOHN LOUCKS St. Edward’s University.
Regression model Y represents a value of the response variable.
Diagnostics and Transformation for SLR
Lecture 14 Review of Lecture 13 What we’ll talk about today?
BA 275 Quantitative Business Methods
Residuals The residuals are estimate of the error
Motivational Examples Three Types of Unusual Observations
Design and Data Analysis in Psychology I
Unit 3 – Linear regression
Basic Practice of Statistics - 3rd Edition Inference for Regression
Three Measures of Influence
Adequacy of Linear Regression Models
Adequacy of Linear Regression Models
Regression Assumptions
Chapter 14 Inference for Regression
Inference for Regression Slope
Adequacy of Linear Regression Models
Adequacy of Linear Regression Models
Making Inferences about Slopes
Inferences for Regression
Diagnostics and Transformation for SLR
3.2. SIMPLE LINEAR REGRESSION
Regression and Correlation of Data
Regression Assumptions
Introductory Statistics
Presentation transcript:

LESSON 4.4. MULTIPLE LINEAR REGRESSION. Residual Analysis Design and Data Analysis in Psychology II Susana Sanduvete Chaves Salvador Chacón Moscoso

Type of residuals Residuals (ordinary): difference between the observation (Y) and prediction( ). The in residue ei is a random variable has the following properties : Under the assumption of normality is obtained:

Type of residuals Standardized residuals: errors after being established (zero mean and variance close to 1). Helps to distinguish huge residuals.

Type of residuals Outlier: one that has a large residue. Subjective criteria. The most common is to consider an outlier when its standardized residual is bigger than 2. The larger the standardized residual, more unusual is the observation.

Type of residuals Outliers are important because their inclusion or not in the sample can differ greatly estimated regression line. It is necessary to study direct scores with high standardized residuals. There are many causes that prompt the existence of outliers. Some of them are: The observed point is an error (in measurement, in the transcription of data, etc.), but the fitted model is adequate. The observed point is correct but the model fit is not, due to possible different reasons: Because the relationship between the two variables is linear in a certain range but it is not linear to the point where it is observed. There is a strong heteroscedasticity with some observations that are separated from the tag. There is a classification variable that has not been taken into account.

Type of residuals Studentized Residual: It is calculated the same way as standardized, but calculating the residual variance (sR) from the whole sample, except the residue of the observation under study. Thus, dependence between numerator and denominator disappears.

Type of residuals If n is high, the standardized and studentized residuals acquire close values. Under the normality hypothesis, it is verified that ti follows a t distribution with n- 3 degrees of freedom.

Type of residuals Eliminated residuals: Difference between the value observed in the answer and the prediction, when the whole sample is used, except the measurement that is being studied. If the measurement has a huge influence in the calculation of the regression line, the ordinary and eliminated residuals are different; in other cases, both values will be similar.

Graphics of residuals The Box-Plot and the histogram of standardized residuals provide information about their distribution. If the sample size is low, instead the histogram of residuals the dot-plot or the stem and leaf plot are used; their interpretations are the same.

Graphics of residuals It implies the existence of a hidden variable.

Graphics of residuals Dot-plot of a group of residuals.

Graphics of residuals-predictions There is no problem detected.  

Graphics of residuals-predictions The linear fitness is not adequate.

Graphics of residuals-predictions Linear fitness wrongly calculated.

Graphics of residuals-predictions There is heteroscedasticity.

Graphics of residuals-predictions Non-linear fitness and heteroscedasticity.

Graphics of residuals-predictions There are some outliers.