Adequacy of Linear Regression Models

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

Lesson 10: Linear Regression and Correlation
Kin 304 Regression Linear Regression Least Sum of Squares
Forecasting Using the Simple Linear Regression Model and Correlation
Regresi Linear Sederhana Pertemuan 01 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
Correlation and Regression
Sociology 601 Class 17: October 28, 2009 Review (linear regression) –new terms and concepts –assumptions –reading regression computer outputs Correlation.
Chapter 12 Simple Linear Regression
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
Chapter Topics Types of Regression Models
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 11 Notes Class notes for ISE 201 San Jose State University.
Simple Linear Regression Analysis
Introduction to Probability and Statistics Linear Regression and Correlation.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Correlation & Regression
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Relationship of two variables
Chapter 6 & 7 Linear Regression & Correlation
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Bivariate Distributions Overview. I. Exploring Data Describing patterns and departures from patterns (20%-30%) Exploring analysis of data makes use of.
Regression Regression relationship = trend + scatter
Regression Analysis Week 8 DIAGNOSTIC AND REMEDIAL MEASURES Residuals The main purpose examining residuals Diagnostic for Residuals Test involving residuals.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Creating a Residual Plot and Investigating the Correlation Coefficient.
MARE 250 Dr. Jason Turner Linear Regression. Linear regression investigates and models the linear relationship between a response (Y) and predictor(s)
AP Statistics HW: p. 165 #42, 44, 45 Obj: to understand the meaning of r 2 and to use residual plots Do Now: On your calculator select: 2 ND ; 0; DIAGNOSTIC.
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
Chapter 11: Linear Regression and Correlation Regression analysis is a statistical tool that utilizes the relation between two or more quantitative variables.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Transforming Numerical Methods Education for the STEM Undergraduate Regression
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Topic 13: Quantitative-Quantitative Association Part 1:
Lecture 11: Simple Linear Regression
Simple Linear Regression
Regression Analysis AGEC 784.
Part 5 - Chapter
AP Statistics Chapter 14 Section 1.
Kin 304 Regression Linear Regression Least Sum of Squares
10.3 Coefficient of Determination and Standard Error of the Estimate
BPK 304W Regression Linear Regression Least Sum of Squares
AP Stats: 3.3 Least-Squares Regression Line
Linear Regression.
PENGOLAHAN DAN PENYAJIAN
Linear regression Fitting a straight line to observations.
REGRESI.
Chapter 14 Inference for Regression
Correlation and Regression
Adequacy of Linear Regression Models
Correlation and Regression
Linear Regression and Correlation
Adequacy of Linear Regression Models
Topic 8 Correlation and Regression Analysis
Linear Regression and Correlation
Adequacy of Linear Regression Models
Algebra Review The equation of a straight line y = mx + b
3.2. SIMPLE LINEAR REGRESSION
Regression and Correlation of Data
Correlation & Trend Lines
Presentation transcript:

Adequacy of Linear Regression Models http://numericalmethods.eng.usf.edu Transforming Numerical Methods Education for STEM Undergraduates 5/3/2019 http://numericalmethods.eng.usf.edu 1

Data

Therm exp coeff vs temperature α 60 6.36 40 6.24 20 6.12 6.00 -20 5.86 -40 5.72 -60 5.58 -80 5.43 T α -100 5.28 -120 5.09 -140 4.91 -160 4.72 -180 4.52 -200 4.30 -220 4.08 -240 3.83 T α -280 3.33 -300 3.07 -320 2.76 -340 2.45

Is this adequate? Straight Line Model

Quality of Fitted Data Does the model describe the data adequately? How well does the model predict the response variable predictably?

Linear Regression Models Limit our discussion to adequacy of straight-line regression models

Four checks Does the model look like it explains the data? Do 95%of the residuals fall with ±2 standard error of estimate? Is the coefficient of determination acceptable? Does the model meet the assumption of random errors?

Check 1: Does the model look like it explains the data?

Data and Model -340 -260 -180 -100 -20 60 2.45 3.58 4.52 5.28 5.86 6.36

Check 2. Do 95%of the residuals fall with ±2 standard error of estimate?

Standard error of estimate

Standard Error of Estimate -340 -260 -180 -100 -20 60 2.45 3.58 4.52 5.28 5.86 6.36 2.7357 3.5114 4.2871 5.0629 5.8386 6.6143 -0.28571 0.068571 0.23286 0.21714 0.021429 -0.25429

Standard Error of Estimate

Standard Error of Estimate      

Scaled Residuals 95% of the scaled residuals need to be in [-2,2]

Scaled Residuals Ti αi Residual Scaled Residual -340 -260 -180 -100 -20 60 2.45 3.58 4.52 5.28 5.86 6.36 -0.28571 0.068571 0.23286 0.21714 0.021429 -0.25429 -1.1364 0.27275 0.92622 0.86369 0.085235 -1.0115

3. Find the coefficient of determination

Coefficient of determination

Sum of square of residuals between data and mean y x

Sum of square of residuals between observed and predicted y x

Calculation of St -340 -260 -180 -100 -20 60 2.45 3.58 4.52 5.28 5.86 6.36 -2.2250 -1.0950 0.15500 0.60500 1.1850 1.6850

Calculation of Sr -340 -260 -180 -100 -20 60 2.45 3.58 4.52 5.28 5.86 6.36 2.7357 3.5114 4.2871 5.0629 5.8386 6.6143 -0.28571 0.068571 0.23286 0.21714 0.021429 -0.25429

Coefficient of determination

Limits of Coefficient of Determination

Correlation coefficient How do you know if r is positive or negative ?

What does a particular value of |r| mean? 0.8 to 1.0 - Very strong relationship  0.6 to 0.8 - Strong relationship  0.4 to 0.6 - Moderate relationship  0.2 to 0.4 - Weak relationship  0.0 to 0.2 - Weak or no relationship 

Final Exam Grade

Final Exam Grade vs Pre-Req GPA

Redoing Check 1, 2 and 3 with 20 data points

Check 1:Plot Model and Data α 60 6.36 40 6.24 20 6.12 6.00 -20 5.86 -40 5.2 -60 5.58 -80 5.43 T α -100 5.28 -120 5.09 -140 4.91 -160 4.72 -180 4.52 -200 4.30 -220 4.08 -240 3.83 -280 3.33 -300 3.07 -320 2.76 -340 2.45

Check 2: Using Standard Error of Estimate

Check 3: Using Coefficient of Determination      

Check 4. Does the model meet assumption of random errors?

Model meets assumption of random errors Residuals are negative as well as positive Variation of residuals as a function of the independent variable is random Residuals follow a normal distribution There is no autocorrelation between the data points.

Are residuals negative and positive?

Is variation of residuals as a function of independent variable random?

Do the residuals follow normal distribution?

END

What polynomial model to choose if one needs to be chosen?

Which model to choose?

Optimum Polynomial: Wrong Criterion

Optimum Polynomial of Regression

END

Effect of an Outlier

Effect of Outlier

Effect of Outlier