Model Adequacy Checking in the ANOVA Text reference, Section 3-4, pg

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Chapter 14, part D Statistical Significance. IV. Model Assumptions The error term is a normally distributed random variable and The variance of  is constant.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
1 Design of Engineering Experiments Part 3 – The Blocking Principle Text Reference, Chapter 4 Blocking and nuisance factors The randomized complete block.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Analysis of Variance Outlines: Designing Engineering Experiments
Inference for Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Design of Experiments and Analysis of Variance
Analysis of Variance (ANOVA) Statistics for the Social Sciences Psychology 340 Spring 2010.
Testing means, part III The two-sample t-test. Sample Null hypothesis The population mean is equal to  o One-sample t-test Test statistic Null distribution.
Chapter 13 Multiple Regression
Chapter 10 Simple Regression.
Part I – MULTIVARIATE ANALYSIS
Chapter 12 Multiple Regression
ANOVA Determining Which Means Differ in Single Factor Models Determining Which Means Differ in Single Factor Models.
Comparing Means.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Chapter 3 Experiments with a Single Factor: The Analysis of Variance
Final Review Session.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide Are the Means of Several Groups Equal? Ho:Ha: Consider the following.
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
Chapter 11 Multiple Regression.
8. ANALYSIS OF VARIANCE 8.1 Elements of a Designed Experiment
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
One-way Between Groups Analysis of Variance
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Chapter 2 Simple Comparative Experiments
Inferences About Process Quality
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
5-3 Inference on the Means of Two Populations, Variances Unknown
Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Simple Linear Regression Analysis
13 Design and Analysis of Single-Factor Experiments:
Copyright © 2009 Pearson Education, Inc. Chapter 28 Analysis of Variance.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. *Chapter 28 Analysis of Variance.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Chapter 12 Multiple Regression and Model Building.
Comparing Means. Anova F-test can be used to determine whether the expected responses at the t levels of an experimental factor differ from each other.
ITK-226 Statistika & Rancangan Percobaan Dicky Dermawan
Design and Analysis of Experiments Dr. Tai-Yue Wang Department of Industrial and Information Management National Cheng Kung University Tainan, TAIWAN,
ANOVA (Analysis of Variance) by Aziza Munir
© 2003 Prentice-Hall, Inc.Chap 11-1 Analysis of Variance & Post-ANOVA ANALYSIS IE 340/440 PROCESS IMPROVEMENT THROUGH PLANNED EXPERIMENTATION Dr. Xueping.
Design of Engineering Experiments Part 2 – Basic Statistical Concepts
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
DOX 6E Montgomery1 Design of Engineering Experiments Part 2 – Basic Statistical Concepts Simple comparative experiments –The hypothesis testing framework.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: One-way ANOVA Marshall University Genomics Core.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Statistics for the Social Sciences Psychology 340 Spring 2009 Analysis of Variance (ANOVA)
Chapter 11: The ANalysis Of Variance (ANOVA)
One-way ANOVA Example Analysis of Variance Hypotheses Model & Assumptions Analysis of Variance Multiple Comparisons Checking Assumptions.
Analysis of Variance STAT E-150 Statistical Methods.
Chapters Way Analysis of Variance - Completely Randomized Design.
Model adequacy checking in the ANOVA Checking assumptions is important –Normality –Constant variance –Independence –Have we fit the right model? Later.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
1 Chapter 5.8 What if We Have More Than Two Samples?
Step 1: Specify a null hypothesis
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Inference for Regression
CHAPTER 29: Multiple Regression*
Comparing Several Means: ANOVA
6-1 Introduction To Empirical Models
Presentation transcript:

Model Adequacy Checking in the ANOVA Text reference, Section 3-4, pg Checking assumptions is important Have we fit the right model? Normality Independence Constant variance Later we will talk about what to do if some of these assumptions are violated

Model Adequacy Checking in the ANOVA Violations of the basic assumptions and model adequacy can be investigated by examination of residuals Residuals (see text, Sec. 3-4, pg. 75) Statistical software usually generates the residuals Residual plots are very useful Normal probability plot of residuals

Other Important Residual Plots

Other Important Residual Plots A tendency of positive/negative residuals indicates correlation – violation of independence assumption Special attention is needed on uneven spread on the two ends in residuals versus time plot Peak discharge vs. fitted value (Ex. 3-5, page 81) – a violation of independence or constant variance assumptions Figure 3-7 Plot of residuals versus fitted value (Ex. 3-5)

Outliers When the residual is much larger/smaller than any of the others First check if it is a human mistake in recording /calculation If it cannot be rejected based on reasonable non-statistical grounds, it must be specially studied At least two analyses (with and without the outlier) can be made Procedure to detect outliers Visual examination Using standardized residuals If eij: N(0,s2), then dij: N(0,1). Therefore, 68% of dij should fall within the limits ±1; 95% of dij should fall within the limits ±2; all dij should fall within the limits ±3; A residual bigger than 3 or 4 standard deviations from zero is a potential outlier.

Post-ANOVA Comparison of Means The analysis of variance tests the hypothesis of equal treatment means Assume that residual analysis is satisfactory If that hypothesis is rejected, we don’t know which specific means are different Determining which specific means differ following an ANOVA is called the multiple comparisons problem There are lots of ways to do this…see text, Section 3-5, pg. 87 We will use pairwise t-tests on means…sometimes called Fisher’s Least Significant Difference (or Fisher’s LSD) Method

Contrasts Used for multiple comparisons A contrast is a linear combination of parameters with The hypotheses are Hypothesis testing can be done using a t-test: The null hypothesis would be rejected if |to| > ta/2,N-a

Contrasts (cont.) Hypothesis testing can be done using an F-test: The null hypothesis would be rejected if Fo > Fa,1,N-a The 100(1-a) percent confidence interval on the contrast is

Comparing Pairs of Treatment Means Comparing all pairs of a treatment means <=> testing Ho: mi = mj for all i  j. Tukey’s test Two means are significantly different if the absolute value of their sample differences exceeds for equal sample sizes Overall significance level is a for unequal sample sizes Overall significance level is at most a

100(1-a) percent confidence intervals (Tukey’s test) for equal sample sizes for unequal sample sizes

Example: etch rate experiment a = 0.05 f = 16 (degrees of freedom for error) q0.05(4,16) = 4.05 (Table VII),

The Fisher Least Significant Difference (LSD) Method Two means are significantly different if where for equal sample sizes for unequal sample sizes For Example 3-1

Duncan’s Multiple Range Test (page 100) The Newman-Keuls Test (page 102) Are they the same? Which one to use? Opinions differ LSD method and Duncan’s multiple range test are the most powerful ones Tukey method controls the overall error rate

Design-Expert Output Treatment Means (Adjusted, If Necessary) Estimated Standard Mean Error 1-160 551.20 8.17 2-180 587.40 8.17 3-200 625.40 8.17 4-220 707.00 8.17 Mean Standard t for H0 Treatment Difference DF Error Coeff=0 Prob > |t| 1 vs 2 -36.20 1 11.55 -3.13 0.0064 1 vs 3 -74.20 1 11.55 -6.42 <0.0001 1 vs 4 -155.80 1 11.55 -13.49 < 0.0001 2 vs 3 -38.00 1 11.55 -3.29 0.0046 2 vs 4 -119.60 1 11.55 -10.35 < 0.0001 3 vs 4 -81.60 1 11.55 -7.06 < 0.0001

For the Case of Quantitative Factors, a Regression Model is often Useful Quantitative factors: ones whose levels can be associated with points on a numerical scale Qualitative factors: Whose levels cannot be arranged in order of magnitude The experimenter is often interested in developing an interpolation equation for the response variable in the experiment – empirical model Approximations: y = b0 + b1 x + b2 x2 + e (a quadratic model) y = b0 + b1 x + b2 x2 + b3 x3 + e (a cubic model) The method of least squares can be used to estimate the parameters. Balance between “goodness-of-fit” and “generality”