Download presentation
Presentation is loading. Please wait.
Published byHarvey Baldwin Long Modified over 9 years ago
1
TOPIC 11 Analysis of Variance
2
Draw Sample Populations μ 1 = μ 2 = μ 3 = μ 4 = ….. μ n Evidence to accept/reject our claim Sample mean each group, grand mean, ANOVA test of equality of population means
3
Road Map Factorial Design Decision Making One/Two SamplesAnalysis of Variance Completely Randomized Design χ 2 Tests Randomized Block Design
4
Completely Randomized Design In many situations, you need to examine difference among more than two groups (populations). The group involved can be classified according to factor level of interest (treatments). For example, a factor such as baking temperature may have several groups defined by numerical levels such as 300 o, 350 o, 400 o, 450 o and a factor such as preferred supplier for a certain manufacturer may have several groups defined by categorical levels such as Supplier 1, Supplier 2, Supplier 3, Supplier 4. When there is a single factor, the experimental design is called a completely randomized design.
5
One Factor Design Experiment Supplier 1Supplier 2Supplier 3Supplier 4 18.5 24.0 17.2 19.9 18.0 26.3 25.3 24.0 21.2 24.5 20.6 25.2 20.8 24.7 22.9 25.4 19.9 22.6 17.5 20.4 Sample Mean19.5224.2622.8421.16 Grand Mean21.945 Sample Standard Deviation2.691.922.132.98 Sample of tensile strength of synthetic fibers from four different suppliers Do the synthetic fibers from each of four suppliers have equal strength?
6
One Way ANOVA The ANOVA procedure used for the completely randomized design is referred to as the One-Way ANOVA It is the extension of the t-test for the difference between two means. Although ANOVA is the acronym for analysis of variance, the term is misleading because the objective is to analyze differences among the group means, not the variances. By analyzing the variation among and within the groups, you can make conclusions about possible differences in group means.
7
Partitioning the Total Variation In One Way ANOVA, the total variation is subdivided into two parts: Variation that is due to differences among the treatments Variation that is due to differences within the treatments The symbol k is used to indicate the number of treatments Total Variation (SST) d.f. = n - 1 Total Variation (SST) d.f. = n - 1 Treatment Variation (SSTr) d.f. = k - 1 Treatment Variation (SSTr) d.f. = k - 1 Random Error Variation (SSE) d.f. = n - k Random Error Variation (SSE) d.f. = n - k Partitioning the Total Variation: SST = SSTr + SSE
8
Hypothesis to be Tested Assumptions: k groups represent populations Its values are randomly and independently selected Following a normal distribution Having equal variances Refer back to the table of synthetic fibers from four suppliers. The null hypothesis of no differences in the population means is tested against the alternative that at least two of the k treatment means differ (or not all μ j are equal, where j = 1,2,…, k) At least one of the k treatment means differ
9
Sums of Squares Formula We divide the total variation into variation among the treatments and variation within the treatments. The total variation is presented by the sum of squares total (SST) where = Grand mean X Treat. 1Treat. 2Treat. 3 Response, X
10
Sums of Squares Formula The variation among the treatments is presented by the sum of squares treatments (SSTr) X X3X3 X2X2 X1X1 Treat. 1Treat. 2Treat. 3 Response, X
11
The within-group variation is given by the sum of squares within treatments (SSE) Sums of Squares Formula X2X2 X1X1 X3X3 Treat. 1Treat. 2Treat. 3 Response, X
12
where = sample mean of treatment j = i-th value of treatment j n j = sample size of treatment j n = total number of values in all treatments = n 1 + n 2 + … + n j Sums of Squares Formula
13
To convert the sums of squares to mean squares, we divide SSTr, SSE and SST by degrees of freedom. We have MSTr (mean square treatments), MSE (mean square error), and MST (mean squares total) Mean Squares Total degrees of freedom = (k - 1) + (n – k) = n - 1
14
Source of Variation Degrees of Freedom Sum of Squares Mean Square (Variance) F Treatmentk - 1SSTr MSTr = SSTr/(k - 1) MSTr MSE Errorn - kSSE MSE = SSE/(n - k) Totaln - 1 SST= SSTr+SSE One-Way ANOVA Summary Table
15
F Test for Differences Among More than Two Means MSA and MSE provide estimates of the overall variance in the data. To test the null hypothesis: At least one of the k treatment means differ against you compute the One-Way ANOVA F test statistic, which is given by Rejection Region Critical Value F α from F-distribution with (k-1) numerator and (n-k) denominator degrees of freedom Reject H 0 if F > F α, Otherwise, do not reject
16
As production manager, you want to see if three filling machines have different mean filling times. You assign 15 similarly trained and experienced workers, 5 per machine, to the machines. At the.05 level of significance, is there a difference in mean filling times? Mach1 Mach2 Mach3 25.4023.4020.00 26.3121.8022.20 24.1023.5019.75 23.7422.7520.60 25.1021.6020.40 Mach1 Mach2 Mach3 25.4023.4020.00 26.3121.8022.20 24.1023.5019.75 23.7422.7520.60 25.1021.6020.40 One-Way ANOVA Example
17
Treatment k – 1 = 3 - 1 = 2 SSTr = 47.1640 MSTr = 23.5820 MSTr MSE = 25.60 Error n – k = 15 - 3 = 12 SSE = 11.0532 MSE =.9211 Total n – 1 = 15 - 1 = 14 SST = 58.2172 Source of Variation Degrees of Freedom Sum of Squares Mean Square (Variance) F Example Solution
18
F 0F α = 3.89 H 0 : 1 = 2 = 3 H a : Not All Equal =.05 1 = 2 2 = 12 Critical Value(s): Test Statistic: Decision: Conclusion: Reject H 0 at =.05 There is evidence population means are different =.05 F MSTr MSE 235820 9211 25.6.. Example Solution
19
You’re a trainer for Microsoft Corp. Is there a difference in mean learning times of 12 people using 4 different training methods ( =.05)? M1M2M3M4 10111318 916823 59925 Use the following values. ExerciseExercise SSTr = 348 SSE = 80
20
Factorial Design Decision Making One/Two SamplesAnalysis of Variance Completely Randomized Design χ 2 Tests Road Map Randomized Block Design
21
The Randomized Block Design A method to analyze more than two treatments using repeated measures or matched samples (related population) The items or individuals that have been matched (or from repeated measurements) are called blocks. Experimental situations that used blocks are called randomized block design. The blocks remove as much variability as possible from the random error so that the differences among the treatments are more evident.
22
The Randomized Block Design BRAND Golfer Brand ABCD Hit 3 Hit 1 Hit 4 Hit 2 Hit 2 Hit 4 Hit 3 Hit 1 Hit 4 Hit 3 Hit 1 Hit 2 1 2 10 Randomized Block Design Completely Randomized Design Blocks
23
Partitioning the Total Variation Then we need to break the within treatment variation into variation due to differences among the blocks (SSB) and variation due to random error (SSE) Total Variation (SST) d.f. = n - 1 Total Variation (SST) d.f. = n - 1 Among-Treatment Variation (SSTr) d.f. = k - 1 Among-Treatment Variation (SSTr) d.f. = k - 1 Within-Treatment Variation (SSE) d.f. = n - k Within-Treatment Variation (SSE) d.f. = n - k Among-Block Variation (SSB) d.f. = b - 1 Among-Block Variation (SSB) d.f. = b - 1 Random-Error Variation (SSE) d.f. = (b - 1)(k - 1) Random-Error Variation (SSE) d.f. = (b - 1)(k - 1) Partitioning the Total Variation: SST = SSTr + SSB + SSE
24
Sums of Squares Formula Total variation in randomized block design where = Grand mean Among treatment variation in randomized block design where
25
Among block variation in randomized block design Random error in randomized block design Sums of Squares Formula where
26
You divide each of the sums of squares by its associated degrees of freedom, The Mean Squares The Mean Squares
27
The null hypothesis Randomized Block F Tests is tested against F test statistic You reject the null hypothesis at the α level if F α from F distribution with (k-1) numerator and (k- 1) (b-1) denominator degrees of freedom At least one of the k treatment means differ
28
The null hypothesis F Tests for Block Effects is tested against F test statistic You reject the null hypothesis at the α level if F α from F distribution with (b-1) numerator and (k- 1) (b-1) denominator degrees of freedom At least one of the b block means differ
29
A production manager wants to see if three assembly methods have different mean assembly times (in minutes). Five employees were selected at random and assigned to use each assembly method. At the.05 level of significance, is there a difference in mean assembly times? Employee Method 1 Method 2 Method 3 15.43.64.0 24.13.82.9 36.15.64.3 43.62.32.6 55.34.73.4 Randomized Block Design Example
30
Treatment (Methods) 3 - 1 = 2SSTr= 5.43 MSTr = 2.71 MSTr MSE = 12.9 Error 15 - 3 - 5 + 1 = 8 SSE = 1.68 MSE =.21 Total 15 - 1 = 14 SST = 17.8 Source of Variation Degrees of Freedom Sum of Squares Mean Square (Variance) F Block (Employee) 5 - 1 = 4SSBL= 10.69 MSB = 2.67 MSB MSE = 12.7 Example Solution
31
F 0F α = 4.46 H 0 : 1 = 2 = 3 H a : Not all equal =.05 1 = 2 2 = 8 Critical Value(s): Test Statistic: Decision: Conclusion: Reject H 0 at =.05 There is evidence population means are different =.05 FTFT MSTr MSE 2.71.21 12.9 Example Solution
32
A fast-food chain wants to evaluate the service at four restaurant. The customer service director for the cahin hires six investigators with varied experiences in food service to act as raters. To reduce the effect the variability from rater to rater, you use a randomized block design with raters serving as the blocks. The four restaurants are the groups of interest. The six raters evaluate at each of the four restaurants in a random order. A rating scale from 0 (low) to 100 (high) is used. ExerciseExercise Use the 0.05 level of significance to test for differences among the restaurants. Check also the effectiveness of blocking.
33
Road Map Decision Making One/Two SamplesAnalysis of Variance Completely Randomized Design χ 2 Tests Factorial Design
34
The Factorial Design When there are two factors simultaneously evaluated, the experimental design is called a two factor factorial design (or just, factorial design) We can explore interaction between variables Data from a two-factor factorial design are analyzed using Two-Way ANOVA (or two-way table) Let the two factors be Factor A and Factor B We are going to only deal the equal number of replicates for each combination of the level of factor A with those of factor B
35
Example of Two Factors Design Tensile strength of parachutes woven by two types of looms, using synthetic fibers from four suppliers Loom (Factor A) Supplier (Factor B) 1234 Jetta 20.6 18.0 19.0 21.3 13.2 22.6 24.6 19.6 23.8 27.1 27.7 18.6 20.8 25.1 17.7 21.5 20.0 21.1 23.9 16.0 Turk 18.5 24. 17.2 19.9 18.0 26.3 25.3 24.0 21.2 24.5 20.6 25.2 20.8 24.7 22.9 25.4 19.9 22.6 17.5 20.4 We want to evaluate the different suppliers but also to determine whether parachutes woven on the Jetta looms are as strong as those woven on Turk looms.
36
Two Way ANOVA Procedure The following definitions are needed to develop two-way ANOVA where = grand mean = mean of the i-th level of factor A (where i = 1,2, …, a) = mean of the j-th level of factor B (where j = 1,2, …, b) = mean of the cell ij, the combination of the i-th level of factor A and the j-th level of factor B = number of levels of factor A and B, respectively = number of replicates for each cell (combination of a particular level of factor A and that of factor B)
37
Main and Interaction Effects No A effect; B main effect 1 2 3 Mean response Level of factor A Level 1, factor B Level 2, factor B 1 2 3 Mean response Level of factor A Level 1, factor B Level 2, factor B A main effect; insignificant B effect A and B main effects, no interaction 1 2 3 Mean response Level of factor A Level 1, factor B Level 2, factor B A and B interact 1 2 3 Mean response Level of factor A Level 1, factor B Level 2, factor B
38
Partitioning the Total Variation Then we need to break the group variability into three components plus one random variation or error Total Sum of Squares (SST) d.f. = n - 1 Total Sum of Squares (SST) d.f. = n - 1 Interaction Sum of Squares Factors A and B (SSI) d.f. = (a – 1) (b – 1) Interaction Sum of Squares Factors A and B (SSI) d.f. = (a – 1) (b – 1) Sum of Squares Random Error (SSE) d.f. = n - ab Sum of Squares Random Error (SSE) d.f. = n - ab Main Effect Sum of Squares Factor A (SSA) d.f. = a - 1 Main Effect Sum of Squares Factor A (SSA) d.f. = a - 1 Main Effect Sum of Squares Factor B (SSB) d.f. = b - 1 Main Effect Sum of Squares Factor B (SSB) d.f. = b - 1 Partitioning the Total Variation: SST = SSA + SSB + SSI + SSE
39
Sum of Squares Formula The computation for total variation: Factor A variation: Factor B variation:
40
Sum of Squares Formula Interaction variation: Random Error:
41
The Mean Squares If you divide each of the sums of squares by its associated degrees of freedom, you have the four variances or mean square terms.
42
There are three distinct tests to perform F Test in Two-Way ANOVA 1)Test for Main Effect of Factor A F test statistic You reject the null hypothesis at the α level if F α from F distribution with (a-1) numerator and (n- ab) denominator degrees of freedom
43
F Test in Two-Way ANOVA 2)Test for Main Effect of Factor B F test statistic You reject the null hypothesis at the α level if F α from F distribution with (b-1) numerator and (n- ab) denominator degrees of freedom
44
F Test in Two-Way ANOVA 3)Test for Factor Interaction F test statistic You reject the null hypothesis at the α level if F α from F distribution with (a-1)(b-1) numerator and (n-ab) denominator degrees of freedom
45
Human Resources wants to determine if training time is different based on motivation level and training method. Conduct the appropriate ANOVA tests. Use α =.05 for each test (Interaction, Motivation and Training Method). Training Method (Factor B) Factor Levels Self– paced ClassroomComputer 15 hr.10 hr.22 hr. Motivation (Factor A) High 11 hr.12 hr.17 hr. 27 hr.15 hr. 31 hr. Low 29 hr. 17 hr. 49 hr. Factorial Design Example
46
Source of Variation Degrees of Freedom Sum of Squares Mean Square F A (Row) 1546.75 B (Column) 2531.5265.75 AB (Interaction) 2123.561.76 Error 6188.531.42 Total11SST Same as other designs 17.40 8.46 1.97 Example Solution
47
H 0 : The factors do not interact H a : The factors interact =.05 1 = 2 2 = 6 Critical Value(s): Test Statistic: Decision: Conclusion: F 0F α = 5.14 =.05 Do not reject at =.05 There is no evidence the factors interact Example Solution
48
H 0 : H a : = 1 = 2 = Critical Value(s): Test Statistic: Decision: Conclusion: F 0F α = 5.99 =.05 No difference between motivation levels Motivation levels differ.05 1 6 Reject at =.05 There is evidence motivation levels differ Example Solution
49
H 0 : H a : = 1 = 2 = Critical Value(s): Test Statistic: Decision: Conclusion: F 0F α = 5.14 =.05 No difference between training methods Training methods differ.05 2 6 Reject at =.05 There is evidence training methods differ Example Solution
50
ExerciseExercise Tensile strength of parachutes woven by two types of looms, using synthetic fibers from four suppliers Loom (Factor A) Supplier (Factor B) 1234 Jetta 20.6 18.0 19.0 21.3 13.2 22.6 24.6 19.6 23.8 27.1 27.7 18.6 20.8 25.1 17.7 21.5 20.0 21.1 23.9 16.0 Turk 18.5 24. 17.2 19.9 18.0 26.3 25.3 24.0 21.2 24.5 20.6 25.2 20.8 24.7 22.9 25.4 19.9 22.6 17.5 20.4 Using 0.05 level of significance, determine whether there is evidence of an interaction between the loom and the supplier, a difference between the two looms, and a difference among the suppliers.
51
ExerciseExercise The sums of squares are already given,
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.