Linear Lack of Fit (LOF) Test An F test for checking whether a linear regression function is inadequate in describing the trend in the data.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Hypothesis Testing Steps in Hypothesis Testing:
Inference for Regression
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Simple Linear Regression
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
Independent Sample T-test Formula
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
PSY 307 – Statistics for the Behavioral Sciences
1 Pertemuan 13 Uji Koefisien Korelasi dan Regresi Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Simple Linear Regression Analysis
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Simple Linear Regression and Correlation
Chapter 7 Forecasting with Simple Regression
Descriptive measures of the strength of a linear association r-squared and the (Pearson) correlation coefficient r.
Linear Regression/Correlation
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Hypothesis tests for slopes in multiple linear regression model Using the general linear test and sequential sums of squares.
Review Guess the correlation. A.-2.0 B.-0.9 C.-0.1 D.0.1 E.0.9.
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
Simple linear regression Linear regression with one predictor variable.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
1 1 Slide Simple Linear Regression Coefficient of Determination Chapter 14 BA 303 – Spring 2011.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
One-Factor Analysis of Variance A method to compare two or more (normal) population means.
Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.
One-Way Analysis of Variance … to compare 2 or population means.
Prediction concerning the response Y. Where does this topic fit in? Model formulation Model estimation Model evaluation Model use.
An alternative approach to testing for a linear association The Analysis of Variance (ANOVA) Table.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Sequential sums of squares … or … extra sums of squares.
Analisa Regresi Week 7 The Multiple Linear Regression Model
Lack of Fit (LOF) Test A formal F test for checking whether a specific type of regression function adequately fits the data.
Multiple regression. Example: Brain and body size predictive of intelligence? Sample of n = 38 college students Response (Y): intelligence based on the.
VI. Regression Analysis A. Simple Linear Regression 1. Scatter Plots Regression analysis is best taught via an example. Pencil lead is a ceramic material.
Diagnostics – Part II Using statistical tests to check to see if the assumptions we made about the model are realistic.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
The general linear test approach to regression analysis.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Inference for  0 and 1 Confidence intervals and hypothesis tests.
The p-value approach to Hypothesis Testing
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
Chapter 9 Minitab Recipe Cards. Contingency tests Enter the data from Example 9.1 in C1, C2 and C3.
1 1 Slide © 2011 Cengage Learning Assumptions About the Error Term  1. The error  is a random variable with mean of zero. 2. The variance of , denoted.
Summary of the Statistics used in Multiple Regression.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Simple linear regression. What is simple linear regression? A way of evaluating the relationship between two continuous variables. One variable is regarded.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Simple linear regression. What is simple linear regression? A way of evaluating the relationship between two continuous variables. One variable is regarded.
Psychology 202a Advanced Psychological Statistics October 27, 2015.
Analysis of variance approach to regression analysis … an (alternative) approach to testing for a linear association.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Chapter 20 Linear and Multiple Regression
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Chapter 13 Simple Linear Regression
Prediction of new observations
Hypothesis testing and Estimation
Goodness of Fit The sum of squared deviations from the mean of a variable can be decomposed as follows: TSS = ESS + RSS This decomposition can be used.
Simple Linear Regression
The Analysis of Variance
F test for Lack of Fit The lack of fit test..
Presentation transcript:

Linear Lack of Fit (LOF) Test An F test for checking whether a linear regression function is inadequate in describing the trend in the data

Where does this topic fit in? Model formulation Model estimation Model evaluation Model use

Example 1 Do the data suggest that a linear function is inadequate in describing the relationship between skin cancer mortality and latitude?

Example 2 Do the data suggest that a linear function is inadequate in describing the relationship between the length and weight of an alligator?

Example 3 Do the data suggest that a linear function is inadequate in describing the relationship between iron content and weight loss due to corrosion?

Some notation

Decomposing the error

The basic idea Break down the residual error (“error sum of squares – SSE) into two components: –a component that is due to lack of model fit (“lack of fit sum of squares” – SSLF) –a component that is due to pure random error (“pure error sum of squares” – SSPE) If the lack of fit sum of squares is a large component of the residual error, it suggests that a linear function is inadequate.

A geometric decomposition

The decomposition holds for the sum of the squared deviations, too: Error sum of squares (SSE) Lack of fit sum of squares (SSLF) Pure error sum of squares (SSPE)

Breakdown of degrees of freedom Degrees of freedom associated with SSE Degrees of freedom associated with SSLF Degrees of freedom associated with SSPE

Definitions of Mean Squares And, the pure error mean square (MSPE) is defined as: The lack of fit mean square (MSLF) is defined as:

Expected Mean Squares If μ i = β 0 +β 1 X i, we’d expect the ratio MSLF/MSPE to be … If μ i ≠ β 0 +β 1 X i, we’d expect the ratio MSLF/MSPE to be … Use ratio, MSLF/MSPE, to reject whether or not μ i = β 0 +β 1 X i.

Expanded Analysis of Variance Table SourceDFSSMSF Regression1 Residual error n-2 Lack of fitc-2 Pure errorn-c Totaln-1

The formal lack of fit F-test Null hypothesis H 0 : μ i = β 0 +β 1 X i Alternative hypothesis H A : μ i ≠ β 0 +β 1 X i Test statistic P-value = What is the probability that we’d get an F* statistic as large as we did, if the null hypothesis is true? The P-value is determined by comparing F* to an F distribution with c-2 numerator degree of freedom and n-c denominator degrees of freedom.

LOF Test in Minitab Stat >> Regression >> Regression … Specify predictor and response. Under Options… –under Lack of Fit Tests, select the box labeled Pure error. Select OK.

Decomposing the error

Is there lack of linear fit? Analysis of Variance Source DF SS MS F P Regression Residual Error Lack of Fit Pure Error Total rows with no replicates

Decomposing the error

Is there lack of linear fit? Analysis of Variance Source DF SS MS F P Regression Residual Error Lack of Fit Pure Error Total rows with no replicates

Example 1 Do the data suggest that a linear function is not adequate in describing the relationship between skin cancer mortality and latitude?

Example 1: Mortality and Latitude Analysis of Variance Source DF SS MS F P Regression Residual Error Lack of Fit Pure Error Total rows with no replicates

Example 2 Do the data suggest that a linear function is not adequate in describing the relationship between the length and weight of an alligator?

Example 2: Alligator length and weight Analysis of Variance Source DF SS MS F P Regression Residual Error Lack of Fit Pure Error Total rows with no replicates

Example 3 Do the data suggest that a linear function is not adequate in describing the relationship between iron content and weight loss due to corrosion?

Example 3: Iron and corrosion Analysis of Variance Source DF SS MS F P Regression Residual Error Lack of Fit Pure Error Total rows with no replicates

Example 4 Do the data suggest that a linear function is not adequate in describing the relationship between mileage and groove depth?

Example 4: Tread wear Analysis of Variance Source DF SS MS F P Regression Residual Error Total No replicates. Cannot do pure error test.

When is it okay to perform the LOF Test? When the “INE” part of the “LINE” assumptions are met. The LOF test requires repeat observations, called replicates, for at least one of the values of the predictor X.