Lack of Fit (LOF) Test A formal F test for checking whether a specific type of regression function adequately fits the data.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Inference for Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Objectives (BPS chapter 24)
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Simple Linear Regression Estimates for single and mean responses.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Note 14 of 5E Statistics with Economics and Business Applications Chapter 12 Multiple Regression Analysis A brief exposition.
Every achievement originates from the seed of determination. 1Random Effect.
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
REGRESSION AND CORRELATION
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Simple Linear Regression and Correlation
Descriptive measures of the strength of a linear association r-squared and the (Pearson) correlation coefficient r.
Hypothesis tests for slopes in multiple linear regression model Using the general linear test and sequential sums of squares.
Regression and Correlation Methods Judy Zhong Ph.D.
Simple linear regression Linear regression with one predictor variable.
Linear Lack of Fit (LOF) Test An F test for checking whether a linear regression function is inadequate in describing the trend in the data.
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
Design and Analysis of Experiments Dr. Tai-Yue Wang Department of Industrial and Information Management National Cheng Kung University Tainan, TAIWAN,
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Introduction to Linear Regression
Chapter 11 Linear Regression Straight Lines, Least-Squares and More Chapter 11A Can you pick out the straight lines and find the least-square?
Prediction concerning the response Y. Where does this topic fit in? Model formulation Model estimation Model evaluation Model use.
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
An alternative approach to testing for a linear association The Analysis of Variance (ANOVA) Table.
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Chap 13-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 12.
Copyright ©2011 Nelson Education Limited Linear Regression and Correlation CHAPTER 12.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Solutions to Tutorial 5 Problems Source Sum of Squares df Mean Square F-test Regression Residual Total ANOVA Table Variable.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Sequential sums of squares … or … extra sums of squares.
Analisa Regresi Week 7 The Multiple Linear Regression Model
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
14- 1 Chapter Fourteen McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Multiple regression. Example: Brain and body size predictive of intelligence? Sample of n = 38 college students Response (Y): intelligence based on the.
Diagnostics – Part II Using statistical tests to check to see if the assumptions we made about the model are realistic.
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
The general linear test approach to regression analysis.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
KNN Ch. 3 Diagnostics and Remedial Measures Applied Regression Analysis BUSI 6220.
Regression. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other words, there is a distribution.
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
Chapter 12 Simple Linear Regression.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Inference for  0 and 1 Confidence intervals and hypothesis tests.
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
Chapter 9 Minitab Recipe Cards. Contingency tests Enter the data from Example 9.1 in C1, C2 and C3.
Chapter 26: Inference for Slope. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other.
Regression Analysis Presentation 13. Regression In Chapter 15, we looked at associations between two categorical variables. We will now focus on relationships.
Simple linear regression. What is simple linear regression? A way of evaluating the relationship between two continuous variables. One variable is regarded.
Simple linear regression. What is simple linear regression? A way of evaluating the relationship between two continuous variables. One variable is regarded.
Analysis of variance approach to regression analysis … an (alternative) approach to testing for a linear association.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Announcements There’s an in class exam one week from today (4/30). It will not include ANOVA or regression. On Thursday, I will list covered material and.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Cases of F-test Problems with Examples
Prediction and Prediction Intervals
Hypothesis testing and Estimation
Full Model: contain ALL coefficients of interest
Simple Linear Regression
Essentials of Statistics for Business and Economics (8e)
F test for Lack of Fit The lack of fit test..
Presentation transcript:

Lack of Fit (LOF) Test A formal F test for checking whether a specific type of regression function adequately fits the data

Example 1 Do the data suggest that a linear function is adequate in describing the relationship between skin cancer mortality and latitude?

Example 2 Do the data suggest that a linear function is adequate in describing the relationship between the length and weight of an alligator?

Example 3 Do the data suggest that a linear function is adequate in describing the relationship between iron content and weight loss due to corrosion?

Lack of fit test for a linear function … the basic idea Use general linear test approach. Full model is most general model with no restrictions on the means μ j at each X j level. Reduced model assumes that the μ j are a linear function of the X j, i.e., μ j = β 0 + β 1 X j. Determine SSE(F), SSE(R), and F statistic. If the P-value is small, reject the reduced model (H 0 : No lack of fit (linear)) in favor of the full model (H A : Lack of fit (not linear)).

Assumptions and requirements The Y observations for a given X level are independent. The Y observations for a given X level are normally distributed. The distribution of Y for each level of X has the same variance. LOF test requires repeat observations, called replications (or replicates), for at least one of the X values.

Notation iron wgtloss c different levels of X (c=7 with X 1 =0.01, X 2 =0.48, …, X 7 =1.96) n j = number of replicates for j th level of X (X j ) (n 1 =3, n 2 =2, …, n 7 =2) for a total of n = n 1 + … + n c observations. Y ij = observed value of the response variable for the i th replicate of X j (Y 11 =127.6, Y 21 =130.1, …, Y 27 =86.2)

The Full Model Assume nothing about (or “put no structure on”) the means of the responses, μ j, at the j th level of X: Make usual assumptions about error terms (ε ij ): normal, mean 0, constant variance σ 2. Least squares estimates of μ j are sample means of responses at X j level. “Pure error sum of squares”

The Reduced Model Assume the means of the responses, μ j, are linearly related to the j th level of X (same model as before, just modified subscripts): Make usual assumptions about error terms (ε ij ): normal, mean 0, constant variance σ 2. Least squares estimates of μ j are as usual. “Error sum of squares”

Error sum of squares decomposition error deviationpure error deviationlack of fit deviation

The F test

The Decision (Intuitively) If the largest portion of the error sum of squares is due to lack of fit, the F test should be large. A large F* statistic leads to a small P-value (determined by F(c-2, n-2) distribution). If P-value is small, reject null and conclude significant lack of (linear) fit.

LOF Test summarized in an ANOVA Table

LOF Test in Minitab Stat >> Regression >> Regression … Specify predictor and response. Under Options…, under Lack of Fit Tests, select box labeled “Pure error.” Select OK. Select OK. ANOVA table appears in session window.

Example 1 Do the data suggest that a linear function is adequate in describing the relationship between skin cancer mortality and latitude?

Example 1: Mortality and Latitude Analysis of Variance Source DF SS MS F P Regression Residual Error Lack of Fit Pure Error Total rows with no replicates

Example 2 Do the data suggest that a linear function is adequate in describing the relationship between the length and weight of an alligator?

Example 2: Alligator length and weight Analysis of Variance Source DF SS MS F P Regression Residual Error Lack of Fit Pure Error Total rows with no replicates

Example 3 Do the data suggest that a linear function is adequate in describing the relationship between iron content and weight loss due to corrosion?

Example 3: Iron and corrosion Analysis of Variance Source DF SS MS F P Regression Residual Error Lack of Fit Pure Error Total rows with no replicates

Closing comment #1 The t-test or F=MSR/MSE test only tests whether there is a linear relation between the predictor and response (β 1 ≠0) or not (β 1 =0). Failing to reject the null does not imply that there is no relation between the predictor and response.

Example: Closing comment #1

The regression equation is Y* = X Predictor Coef SE Coef T P Constant X S = R-Sq = 0.1% R-Sq(adj) = 0.0% Analysis of Variance Source DF SS MS F P Regression Residual Error Lack of Fit Pure Error Total

Closing comments #2, #3 We used general linear test approach to test appropriateness of a linear function. It can just as easily be used to test for appropriateness of other functions (quadratic, cubic). The alternative H A : Lack of fit (not linear) includes all possible regression functions other than a linear one. Use residuals to help identify what type of function is appropriate.