Presentation is loading. Please wait.

Presentation is loading. Please wait.

Copyright © Cengage Learning. All rights reserved. 11 Multifactor Analysis of Variance.

Similar presentations


Presentation on theme: "Copyright © Cengage Learning. All rights reserved. 11 Multifactor Analysis of Variance."— Presentation transcript:

1 Copyright © Cengage Learning. All rights reserved. 11 Multifactor Analysis of Variance

2 Copyright © Cengage Learning. All rights reserved. 11.2 Two-Factor ANOVA with K ij  1

3 3 Two-factor Anova With K ij  1 We analyzed data from a two-factor experiment in which there was one observation for each of the IJ combinations of factor levels. The  ij ’s were assumed to have an additive structure with  ij =  +  i +  j,  I =  j = 0. Additivity means that the difference in true average responses for any two levels of the factors is the same for each level of the other factor.

4 4 Two-factor Anova With K ij  1 For example,  ij –  I’j = (  +  i +  j ) – (  +  i’ +  j ) =  i –  I’, independent of the level j of the second factor. This is shown in Figure 11.1(a), in which the lines connecting true average responses are parallel. Mean responses for two types of model: (a) additive Figure 11.1(a)

5 5 Two-factor Anova With K ij  1 Figure 11.1(b) depicts a set of true average responses that does not have additive structure. Mean responses for two types of model: (b) nonadditive Figure 11.2 (b)

6 6 Two-factor Anova With K ij  1 The lines connecting these  Ij ’s are not parallel, which means that the difference in true average responses for different levels of one factor does depend on the level of the other factor. When additivity does not hold, we say that there is interaction between the different levels of the factors. The assumption of additivity allowed us to obtain an estimator of the random error variance  2 (MSE) that was unbiased whether or not either null hypothesis of interest was true.

7 7 Two-factor Anova With K ij  1 When K i,j  1 for at least one (i, j ) pair, a valid estimator of  2 can be obtained without assuming additivity. Our focus here will be on the case K ij = K  1, so the number of observations per “cell” (for each combination of levels) is constant.

8 8 Fixed Effects Parameters and Hypotheses

9 9 Rather than use the  Ij ’s themselves as model parameters, it is customary to use an equivalent set that reveals more clearly the role of interaction. Notation Thus  is the expected response averaged over all levels of both factors (the true grand mean),  i  is the expected response averaged over levels of the second factor when the first factor A is held at level i, and similarly for   j. (11.7)

10 10 Fixed Effects Parameters and Hypotheses Definition  i =  i  –  = the effect of factor A at level i  j =  j  –  = the effect of factor A at level j γ ij =  ij – (  +  i +  j ) = interaction between factor A at level i and factor B at level j from which  ij =  +  i +  j + γ ij (11.9) (11.8)

11 11 Fixed Effects Parameters and Hypotheses The model is additive if and only if all γ ij ’s = 0. The γ i j ’s are referred to as the interaction parameters. The ai s are called the main effects for factor A, and the bj’s are the main effects for factor B. Although there are I  i ’s, J  j ’s, and IJ γ i j ’s in addition to , the conditions  i = 0,  j = 0,  j γ ij = 0 for any i, and  j γ ij = 0 for any j [all by virtue of (11.7) and (11.8)] imply that only IJ of these new parameters are independently determined: , I – 1 of the  i ’s J – 1 of the  j ’s, and (I – 1)(J – 1) of the γ ij ’s.

12 12 Fixed Effects Parameters and Hypotheses There are now three sets of hypotheses to be considered: H 0AB : γ ij = 0 for all i, j versus H aAB : at least one γ ij  0 H 0A :  1 = … =  1 = 0 versus H aA : at least one  i  0 H 0B :  1 = … =  j = 0 versus H aB : at least one  j  0 The no-interaction hypothesis H 0AB is usually tested first. If H 0AB is not rejected, then the other two hypotheses can be tested to see whether the main effects are significant.

13 13 Fixed Effects Parameters and Hypotheses If H 0AB is rejected and H 0A is then tested and not rejected, the resulting model  ij =  +  j + γ ij does not lend itself to straightforward interpretation. In such a case, it is best to construct a picture similar to that of Figure 11.1(b) to try to visualize the way in which the factors interact. Mean responses for two types of model: nonadditive Figure 11.1(b)

14 14 The Model and Test Procedures

15 15 The Model and Test Procedures We now use triple subscripts for both random variables and observed values, with X ijk and x ijk referring to the kth observation (replication) when factor A is at level I and factor B is at level j. The fixed effects model is X ijk =  +  i +  j + γ ij +  ijk i = 1, …, I, j = 1, …, J, k = 1, …, K where the  ijk ’s are independent and normally distributed, each with mean 0 and variance  2. (11.20)

16 16 The Model and Test Procedures Again, a dot in place of a subscript denotes summation over all values of that subscript, and a horizontal bar indicates averaging. Thus X ij  is the total of all K observations made for factor A at level i and factor B at level j [all observations in the (i, j)th cell], and X ij  is the average of these K observations.

17 17 The Model and Test Procedures Definition Test procedures are based on the following sums of squares:

18 18 The Model and Test Procedures The fundamental identity is SST = SSA + SSB + SSAB + SSE SSAB is referred to as interaction sum of squares. cont’d

19 19 The Model and Test Procedures Total variation is thus partitioned into four pieces: unexplained (SSE—which would be present whether or not any of the three null hypotheses was true) and three pieces that may be attributed to the truth or falsity of the three H 0 s. Each of four mean squares is defined by MS = SS/df. The expected mean squares suggest that each set of hypotheses should be tested using the appropriate ratio of mean squares with MSE in the denominator: E(MSE) =  2

20 20 The Model and Test Procedures Each of the three mean square ratios can be shown to have an F distribution when the associated H 0 is true, which yields the following level  test procedures.

21 21 The Model and Test Procedures Hypotheses Test Statistic Value Rejection Region

22 22 Example 7 Lightweight aggregate asphalt mix has been found to have lower thermal conductivity than a conventional mix, which is desirable. The article “Influence of Selected Mix Design Factors on the Thermal Behavior of Lightweight Aggregate Asphalt Mixes” (J. of Testing and Eval., 2008: 1–8) reported on an experiment in which various thermal properties of mixes were determined.

23 23 Example 7 Three different binder grades were used in combination with three different coarse aggregate contents (%), with two observations made for each such combination, resulting in the conductivity data (W/m  °K) that appears in Table 11.6. Conductivity Data for Example 7 Table 11.6 cont’d

24 24 Example 7 Here I = J = 3 and K = 2 for a total of IJK = 18 observations. The results of the analysis are summarized in the ANOVA table which appears as Table 11.7 (a table with additional information appeared in the cited paper). ANOVA Table for Example 7 Table 11.7 cont’d

25 25 Example 7 The P-value for testing for the presence of interaction effects is.414, which is clearly larger than any reasonable significance level. Alternatively, f AB = 1.10  F.10,4,9 = 2.69, so the interaction null hypothesis cannot be rejected even at the largest significance level that would be used in practice. Thus it appears that there is no interaction between the two factors. cont’d

26 26 Example 7 However, both main effects are significant at the 5% significance level (.002 .05 and.000 .05; alternatively both corresponding F ratios greatly exceed F.05,2,9 = 4.26). So it appears that true average conductivity depends on which grade is used and also on the level of coarse-aggregate content. cont’d

27 27 Example 7 Figure 11.5(a) shows an interaction plot for the conductivity data. Interaction Plots for the Asphalt Data of Example 7. Response variable is conductivity. Figure 11.5 (a) cont’d

28 28 Example 7 Notice the nearly parallel sets of line segments for the three different asphalt grades, in agreement with the F test that shows no significant interaction effects. True average conductivity appears to decrease as aggregate content decreases. cont’d

29 29 Example 7 Figure 11.5(b) shows an interaction plot for the response variable thermal diffusivity, values of which appear in the cited article. Interaction Plots for the Asphalt Data of Example 7. (b) Response variable is diffusivity Figure 11.5 cont’d

30 30 Example 7 The bottom two sets of line segments are close to parallel, but differ markedly from those for PG64; in fact, the F ratio for interaction effects is highly significant here. Plausibility of the normality and constant variance assumptions can be assessed by constructing plots. Define the predicted (i.e., fitted) values to be the cell means: For example, the predicted value for grade PG58 and aggregate content 38 is = (.835 +.845)  2 =.840 for k = 1, 2. The residuals are the differences between the observations and corresponding predicted values: cont’d

31 31 Example 7 A normal probability plot of the residuals is shown in Figure 11.6(a). The pattern is sufficiently linear that there should be no concern about lack of normality. Plots for Checking Normality and Constant Variance Assumptions in Example 7 Figure 11.6(a) cont’d

32 32 Example 7 The plot of residuals against predicted values in Figure 11.6(b) shows a bit less spread on the right than on the left, but not enough of a differential to be worrisome; constant variance seems to be a reasonable assumption. Plots for Checking Normality and Constant Variance Assumptions in Example 7 Figure 11.6(b) cont’d

33 33 Multiple Comparisons

34 34 Multiple Comparisons When the no-interaction hypothesis H 0AB is not rejected and at least one of the two main effect null hypotheses is rejected, Tukey’s method can be used to identify significant differences in levels. For identifying differences among the  i ’s when H 0A is rejected, 1. Obtain Q ,I,IJ(K – 1), where the second subscript I identifies the number of levels being compared and the third subscript refers to the number of degrees of freedom for error.

35 35 Multiple Comparisons 2. Compute, where JK is the number of observations averaged to obtain each of the x i   ’s compared in Step 3. 3. Order the x i   ’s from smallest to largest and, as before, underscore all pairs that differ by less than w. Pairs not underscored correspond to significantly different levels of factor A. To identify different levels of factor B when H 0B is rejected, replace the second subscript in Q by J, replace JK by IK in w, and replace x i  by x  j .

36 36 Example 8 Example 7… continued I = J = 3 for both factor A (grade) and factor B (aggregate content). With  =.05 and error df = IJ(K – 1) = 9, Q.05,3,9 = 3.95. The yardstick for identifying significant differences is then The grade sample means in increasing order are.8033,.8180, and.8297.

37 37 Example 8 Only the difference between the two largest means is smaller than w. This gives the underscoring pattern PG70PG58PG64 Grades PG58 and PG64 do not appear to differ significantly from one another in effect on true average conductivity, but both differ from the PG70 grade. cont’d

38 38 Example 8 The ordered means for factor B are.7883,.8227, and.8400. All three pairs of means differ by more than.00139, so there are no underscoring lines. True average conductivity appears to be different for all three levels of aggregate content. cont’d

39 39 Models with Mixed and Random Effects

40 40 Models with Mixed and Random Effects In some problems, the levels of either factor may have been chosen from a large population of possible levels, so that the effects contributed by the factor are random rather than fixed. As in earlier section, if both factors contribute random effects, the model is referred to as a random effects model, whereas if one factor is fixed and the other is random, a mixed effects model results.

41 41 Models with Mixed and Random Effects We will now consider the analysis for a mixed effects model in which factor A (rows) is the fixed factor and factor B (columns) is the random factor. Definition The mixed effects model when factor A is fixed and factor B is random is X ijk =  +  i + B j + G ij + ε ijk i = 1, …, I, j = 1, …, J, k = 1, …, K

42 42 Models with Mixed and Random Effects Here  and  i ’s are constants with  i = 0, and the B j ’s, G ij ’s, and ε ijk ’s are independent, normally distributed random variables with expected value 0 and variances and respectively. The relevant hypotheses here are somewhat different from those for the fixed effects model. H 0A :  1 =  2 = … =  1 = 0 versus H aA : at least one  i ≠ 0

43 43 Models with Mixed and Random Effects It is customary to test H 0A and H 0B only if the no-interaction hypothesis H 0G cannot be rejected. The expected mean squares for the mixed model are E(MSE) =  2 E(MSB) =  2 + K  2 G + IK  2 B E(MSB) =  2 + K  2 G

44 44 Models with Mixed and Random Effects The ratio f AB = MSAB/MSE is again appropriate for testing the no-interaction hypothesis, with H 0G rejected if f AB  F ,(I – 1)(J – 1),IJ(K – 1). However, for testing H 0A versus H aA, the expected mean squares suggest that although the numerator of the F ratio should still be MSA, the denominator should be MSAB rather than MSE. MSAB is also the denominator of the F ratio for testing H 0B.

45 45 Models with Mixed and Random Effects For testing H 0A versus H aA (A fixed, B random), the test statistic value is f A = MSA/MSAB, and the rejection region is f B  F , J – 1(I – 1)(J – 1).

46 46 Example 9 A process engineer has identified two potential causes of electric motor vibration, the material used for the motor casing (factor A) and the supply source of bearings used in the motor (factor B). The accompanying data on the amount of vibration (microns) resulted from an experiment in which motors with casings made of steel, aluminum, and plastic were constructed using bearings supplied by five randomly selected sources.

47 47 Example 9 Only the three casing materials used in the experiment are under consideration for use in production, so factor A is fixed. cont’d

48 48 Example 9 However, the five supply sources were randomly selected from a much larger population, so factor B is random. The relevant null hypotheses are H 0A :  1 =  2 =  3 = 0 H 0B :  2 B = 0 H 0AB :  2 G = 0 cont’d

49 49 Example 9 Minitab output appears in Figure 11.7. Output from Minitab’s balanced ANOVA option for the data of Example 9 Figure 11.7 cont’d

50 50 Example 9 The P-value column in the ANOVA table indicates that the latter two null hypotheses should be rejected at significance level.05. Different casing materials by themselves do not appear to affect vibration, but interaction between material and supplier is a significant source of variation in vibration. cont’d

51 51 Models with Mixed and Random Effects When at least two of the K ij ’s are unequal, the ANOVA computations are much more complex than for the case K ij = K. In addition, there is controversy as to which test procedures should be used.


Download ppt "Copyright © Cengage Learning. All rights reserved. 11 Multifactor Analysis of Variance."

Similar presentations


Ads by Google