Chapter 3 Experiments with a Single Factor: The Analysis of Variance

Slides:



Advertisements
Similar presentations
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 10 The Analysis of Variance.
Advertisements

Chapter 3 Experiments with a Single Factor: The Analysis of Variance
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Analysis of Variance (ANOVA) ANOVA methods are widely used for comparing 2 or more population means from populations that are approximately normal in distribution.
Chapter 6 The 2k Factorial Design
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Analysis of Variance Outlines: Designing Engineering Experiments
Model Adequacy Checking in the ANOVA Text reference, Section 3-4, pg
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Chapter 5 Introduction to Factorial Designs
1 Chapter 6 The 2 k Factorial Design Introduction The special cases of the general factorial design (Chapter 5) k factors and each factor has only.
Part I – MULTIVARIATE ANALYSIS
Chapter 3 Analysis of Variance
Simulation Modeling and Analysis Session 12 Comparing Alternative System Designs.
1 Chapter 5 Introduction to Factorial Designs Basic Definitions and Principles Study the effects of two or more factors. Factorial designs Crossed:
Lecture 9: One Way ANOVA Between Subjects
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
Chapter 11 Multiple Regression.
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Chapter 2 Simple Comparative Experiments
Chapter Ten The Analysis Of Variance. ANOVA Definitions > Factor The characteristic that differentiates the treatment or populations from one another.
Inferences About Process Quality
Chi-Square and F Distributions Chapter 11 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.
13 Design and Analysis of Single-Factor Experiments:
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Linear Contrasts and Multiple Comparisons (Chapter 9)
Regression and Correlation Methods Judy Zhong Ph.D.
The Chi-Square Distribution 1. The student will be able to  Perform a Goodness of Fit hypothesis test  Perform a Test of Independence hypothesis test.
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
ITK-226 Statistika & Rancangan Percobaan Dicky Dermawan
Design of Engineering Experiments Part 4 – Introduction to Factorials
1 The General Factorial Design Two-factor factorial design General case: Factor A – a levels Factor B – b levels … n replicate (n  2) Total observations:
Design and Analysis of Experiments Dr. Tai-Yue Wang Department of Industrial and Information Management National Cheng Kung University Tainan, TAIWAN,
STA305 week21 The One-Factor Model Statistical model is used to describe data. It is an equation that shows the dependence of the response variable upon.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Design of Engineering Experiments Part 2 – Basic Statistical Concepts
1 The Drilling Experiment Example 6-3, pg. 237 A = drill load, B = flow, C = speed, D = type of mud, y = advance rate of the drill.
Chapter 19 Analysis of Variance (ANOVA). ANOVA How to test a null hypothesis that the means of more than two populations are equal. H 0 :  1 =  2 =
1 The 2 3 Factorial Design Standard order: (1), a, b, ab, c, ac, bc, abc.
Orthogonal Linear Contrasts This is a technique for partitioning ANOVA sum of squares into individual degrees of freedom.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
1 The Two-Factor Mixed Model Two factors, factorial experiment, factor A fixed, factor B random (Section 13-3, pg. 495) The model parameters are NID random.
Sample Size Determination Text, Section 3-7, pg. 101 FAQ in designed experiments (what’s the number of replicates to run?) Answer depends on lots of things;
DOX 6E Montgomery1 Design of Engineering Experiments Part 2 – Basic Statistical Concepts Simple comparative experiments –The hypothesis testing framework.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
Remember You just invented a “magic math pill” that will increase test scores. On the day of the first test you give the pill to 4 subjects. When these.
Chapter 11: The ANalysis Of Variance (ANOVA)
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 10, Slide 1 Two-Sample Tests and One-Way ANOVA Chapter 10.
Chapters Way Analysis of Variance - Completely Randomized Design.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
1 Design of Engineering Experiments – The 2 k Factorial Design Text reference, Chapter 6 Special case of the general factorial design; k factors, all at.
Model adequacy checking in the ANOVA Checking assumptions is important –Normality –Constant variance –Independence –Have we fit the right model? Later.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
1 Chapter 5.8 What if We Have More Than Two Samples?
Design and Analysis of Experiments
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
CHAPTER 13 Design and Analysis of Single-Factor Experiments:
Chapter 2 Simple Comparative Experiments
Chapter 5 Introduction to Factorial Designs
Chapter 11 Analysis of Variance
Chapter 11: The ANalysis Of Variance (ANOVA)
1-Way Analysis of Variance - Completely Randomized Design
Design and Analysis of Experiments
Presentation transcript:

Chapter 3 Experiments with a Single Factor: The Analysis of Variance

3.1 An Example Chapter 2: A signal-factor experiment with two levels of the factor Consider signal-factor experiments with a levels of the factor, a  2 Example: The tensile strength of a new synthetic fiber. The weight percent of cotton Five levels: 15%, 20%, 25%, 30%, 35% a = 5 and n = 5

Does changing the cotton weight percent change the mean tensile strength? Is there an optimum level for cotton content?

3.2 The Analysis of Variance a levels (treatments) of a factor and n replicates for each level. yij: the jth observation taken under factor level or treatment i.

Models for the Data Means model: yij is the ijth observation, i is the mean of the ith factor level ij is a random error with mean zero Let i =  + i ,  is the overall mean and i is the ith treatment effect Effects model:

Linear statistical model One-way or Signal-factor analysis of variance model Completely randomized design: the experiments are performed in random order so that the environment in which the treatment are applied is as uniform as possible. For hypothesis testing, the model errors are assumed to be normally and independently distributed random variables with mean zero and variance, 2, i.e. yij ~ N(+i, 2) Fixed effect model: a levels have been specifically chosen by the experimenter.

3.3 Analysis of the Fixed Effects Model Interested in testing the equality of the a treatment means, and E(yij) = i =  + i, i = 1,2, …, a H0: 1 = … = a v.s. H1: i  j, for at least one pair (i, j) Constraint: H0: 1 = … = a =0 v.s. H1: i  0, for at least one i

Notations: 3.3.1 Decomposition of the Total Sum of Squares Total variability into its component parts. The total sum of squares (a measure of overall variability in the data) Degree of freedom: an – 1 = N – 1

SSTreatment: sum of squares of the differences between the treatment averages (sum of squares due to treatments) and the grand average, and a – 1 degree of freedom SSE: sum of squares of the differences of observations within treatments from the treatment average (sum of squares due to error), and N – a degrees of freedom.

A large value of SSTreatments reflects large differences in treatment means A small value of SSTreatments likely indicates no differences in treatment means dfTotal = dfTreatment + dfError If there are no differences between a treatment means,

Mean squares: 3.3.2 Statistical Analysis Assumption: ij are normally and independently distributed with mean zero and variance 2

SST/2 ~ Chi-square (N – 1), SSE/2 ~ Chi-square (N – a), SSTreatments/2 ~ Chi-square (a – 1), and SSE/2 and SSTreatments/2 are independent (Theorem 3.1) H0: 1 = … = a =0 v.s. H1: i  0, for at least one i

Reject H0 if F0 > F, a-1, N-a Rewrite the sum of squares: See page 71

Response:. Strength. ANOVA for Selected Factorial Model Response: Strength ANOVA for Selected Factorial Model Analysis of variance table [Partial sum of squares] Sum of Mean F Source Squares DF Square Value Prob > F Model 475.76 4 118.94 14.76 < 0.0001 A 475.76 4 118.94 14.76 < 0.0001 Pure Error 161.20 20 8.06 Cor Total 636.96 24 Std. Dev. 2.84 R-Squared 0.7469 Mean 15.04 Adj R-Squared 0.6963 C.V. 18.88 Pred R-Squared 0.6046 PRESS 251.88 Adeq Precision 9.294

3.3.3 Estimation of the Model Parameters Model: yij =  + i + ij Estimators: Confidence intervals:

Example 3.3 (page 75) Simultaneous Confidence Intervals (Bonferroni method): Construct a set of r simultaneous confidence intervals on treatment means which is at least 100(1-): 100(1-/r) C.I.’s 3.3.4 Unbalanced Data Let ni observations be taken under treatment i, i=1,2,…,a, N = i ni,

1. The test statistic is relatively insensitive to small departures from the assumption of equal variance for the a treatments if the sample sizes are equal. 2. The power of the test is maximized if the samples are of equal size.

3.4 Model Adequacy Checking Assumptions: yij ~ N(+i, 2) The examination of residuals Definition of residual: The residuals should be structureless.

3.4.1 The Normality Assumption Plot a histogram of the residuals Plot a normal probability plot of the residuals See Table 3-6

May be Slightly skewed (right tail is longer than left tail) Light tail (the left tail of error is thinner than the tail part of standard normal) Outliers The possible causes of outliers: calculations, data coding, copy error,…. Sometimes outliers are more informative than the rest of the data.

Detect outliers: Examine the standardized residuals, 3.4.2 Plot of Residuals in Time Sequence Plotting the residuals in time order of data collection is helpful in detecting correlation between the residuals. Independence assumption

3.4.3 Plot of Residuals Versus Fitted Values Plot the residuals versus the fitted values Structureless

Variance-stabilizing transformation Nonconstant variance: the variance of the observations increases as the magnitude of the observation increase, i.e. yij  2 If the factor levels having the larger variance also have small sample sizes, the actual type I error rate is larger than anticipated. Variance-stabilizing transformation Poisson Square root transformation Lognormal Logarithmic transformation Binomial Arcsin transformation

Statistical Tests for Equality Variance: Bartlett’s test: Reject null hypothesis if

Example 3.4: the test statistic is Bartlett’s test is sensitive to the normality assumption The modified Levene test: Use the absolute deviation of the observation in each treatment from the treatment median. Mean deviations are equal => the variance of the observations in all treatments will be the same. The test statistic for Levene’s test is the ANOVA F statistic for testing equality of means.

Example 3.5: Four methods of estimating flood flow frequency procedure (see Table 3.7) ANOVA table (Table 3.8) The plot of residuals v.s. fitted values (Figure 3.7) Modified Levene’s test: F0 = 4.55 with P-value = 0.0137. Reject the null hypothesis of equal variances.

Find y* = y that yields a constant variance. *  +-1 Let E(y) =  and y   Find y* = y that yields a constant variance. *  +-1 Variance-Stabilizing Transformations * and   = 1 -  Transformation *constant 1 No transformation *  1/2 ½ Square root *   Log *  3/2 3/2 -1/2 Reciprocal square root *  2 2 -1 Reciprocal

How to find : Use See Figure 3.8, Table 3.10 and Figure 3.9

3.5 Practical Interpretation of Results Conduct the experiment => perform the statistical analysis => investigate the underlying assumptions => draw practical conclusion 3.5.1 A Regression Model Qualitative factor: compare the difference between the levels of the factors. Quantitative factor: develop an interpolation equation for the response variable.

Regression analysis See Figure 3.1 Final Equation in Terms of Actual Factors: Strength = +62.61143 -9.01143* Cotton Weight % +0.48143 * Cotton Weight %^2 -7.60000E-003 * Cotton Weight %^3 This is an empirical model of the experimental results

3.5.2 Comparisons Among Treatment Means If that hypothesis is rejected, we don’t know which specific means are different Determining which specific means differ following an ANOVA is called the multiple comparisons problem 3.5.3 Graphical Comparisons of Means

3.5.4 Contrast A contrast: a linear combination of the parameters of the form H0:  = 0 v.s. H1:   0 Two methods for this testing.

The first method:

The second method:

The C.I. for a contrast,  Unequal Sample Size

3.5.5 Orthogonal Contrast Two contrasts with coefficients, {ci} and {di}, are orthogonal if ci di = 0 For a treatments, the set of a – 1 orthogonal contrasts partition the sum of squares due to treatments into a – 1 independent single-degree-of-freedom components. Thus, tests performed on orthogonal contrasts are independent. See Example 3.6 (Page 94)

3.5.6 Scheffe’s Method for Comparing All Contrasts Scheffe (1953) proposed a method for comparing any and all possible contrasts between treatment means. See Page 95 and 96

3.5.7 Comparing Pairs of Treatment Means Compare all pairs of a treatment means Tukey’s Test: The studentized range statistic: See Example 3.7

Sometimes overall F test from ANOVA is significant, but the pairwise comparison of mean fails to reveal any significant differences. The F test is simultaneously considering all possible contrasts involving the treatment means, not just pairwise comparisons. The Fisher Least Significant Difference (LSD) Method For H0: i = j

The least significant difference (LSD): See Example 3.8 Duncan’s Multiple Range Test The a treatment averages are arranged in ascending order, and the standard error of each average is determined as

Assume equal sample size, the significant ranges are Total a(a-1)/2 pairs Example 3.9 The Newman-Keuls Test Similar as Duncan’s multiple range test The critical values:

3.5.8 Comparing Treatment Means with a Control Assume one of the treatments is a control, and the analyst is interested in comparing each of the other a – 1 treatment means with the control. Test H0: i = a v.s. H1: : i  a, i = 1,2,…, a – 1 Dunnett (1964) Compute Reject H0 if Example 3.10

3.7 Determining Sample Size Determine the number of replicates to run 3.7.1 Operating Characteristic Curves (OC Curves) OC curves: a plot of type II error probability of a statistical test,

F0 = MSTreatment / MSE ~ noncentral F If H0 is false, then F0 = MSTreatment / MSE ~ noncentral F with degree of freedom a – 1 and N – a and noncentrality parameter  Chart V of the Appendix Determine Let i be the specified treatments. Then estimates of i : For 2, from prior experience, a previous experiment or a preliminary test or a judgment estimate.

Example 3.11 Difficulty: How to select a set of treatment means on which the sample size decision should be based. Another approach: Select a sample size such that if the difference between any two treatment means exceeds a specified value the null hypothesis should be rejected.

3.7.2 Specifying a Standard Deviation Increase Let P be a percentage for increase in standard deviation of an observation. Then For example (Page 110): If P = 20, then

3.7.3 Confidence Interval Estimation Method Use Confidence interval. For example: we want 95% C.I. on the difference in mean tensile strength for any two cotton weight percentages to be  5 psi and  = 3. See Page 110.

3.9 The Regression Approach to the Analysis of Variance Model: yij =  + i + ij

The normal equations Apply the constraint Then estimations are Regression sum of squares (the reduction due to fitting the full model)

The error sum of squares: Find the sum of squares resulting from the treatment effects:

The testing statistic for H0: 1 = … = a