Download presentation
Presentation is loading. Please wait.
Published byPoul Paulsen Modified over 6 years ago
1
5-5 Inference on the Ratio of Variances of Two Normal Populations
5-5.1 Hypothesis Testing on the ratio of Two Variances We wish to test the hypotheses: The development of a test procedure for these hypotheses requires a new probability distribution, the F distribution.
2
5-5 Inference on the Ratio of Variances of Two Normal Populations
The F Distribution
3
5-5 Inference on the Ratio of Variances of Two Normal Populations
5-5.1 The F Distribution
4
5-5 Inference on the Ratio of Variances of Two Normal Populations
The Test Procedure
5
5-5 Inference on the Ratio of Variances of Two Normal Populations
The Test Procedure
6
5-5 Inference on the Ratio of Variances of Two Normal Populations
The Test Procedure
7
5-5 Inference on the Ratio of Variances
of Two Normal Populations
8
5-5 Inference on the Ratio of Variances of Two Normal Populations
5-5.2 Confidence Interval on the Ratio of Two Variances
9
5-5 Inference on the Ratio of Variances
of Two Normal Populations
10
5-5 Inference on the Ratio of Variances
of Two Normal Populations
11
5-8 What If We Have More Than Two Samples?
5-8.1 Completely Randomized Experiment and Analysis of Variance (Replicates) Treatments
12
5-8 What If We Have More Than Two Samples?
5-8.1 Completely Randomized Experiment and Analysis of Variance The levels of the factor are sometimes called treatments. Each treatment has six observations or replicates. The runs are run in random order.
13
5-8 What If We Have More Than Two Samples?
5-8.1 Completely Randomized Experiment and Analysis of Variance
14
5-8 What If We Have More Than Two Samples?
5-8.1 Completely Randomized Experiment and Analysis of Variance Table 5-6. Typical Data for a Single-Factor Experiment Treatment Observations Totals Averages 1 y11 y12 β¦ y1n y1. π¦ 1. 2 y21 y22 y2n y2. π¦ 2. . a ya1 ya2 yan ya. π¦ a. y.. π¦ .. π¦ π. = π=1 π π¦ ππ π¦ π. = π¦ π. π π=1, 2, β¦, π π¦ .. = π=1 π π=1 π π¦ ππ π¦ .. = π¦ .. π
15
5-8 What If We Have More Than Two Samples?
5-8.1 Completely Randomized Experiment and Analysis of Variance Linear Statistical Model π ππ = ΞΌ π + π ππ π=1 π π π =0
16
5-8 What If We Have More Than Two Samples?
5-8.1 Completely Randomized Experiment and Analysis of Variance
17
5-8 What If We Have More Than Two Samples?
5-8.1 Completely Randomized Experiment and Analysis of Variance
18
5-8 What If We Have More Than
Two Samples?
19
5-8 What If We Have More Than Two Samples?
5-8.1 Completely Randomized Experiment and Analysis of Variance
20
5-8 What If We Have More Than Two Samples?
5-8.1 Completely Randomized Experiment and Analysis of Variance
21
5-8 What If We Have More Than
Two Samples?
22
5-8 What If We Have More Than Two Samples?
5-8.1 Completely Randomized Experiment and Analysis of Variance
23
5-8 What If We Have More Than Two Samples?
Which means differ?
24
5-8 What If We Have More Than Two Samples?
Multiple Comparisons If we wish to make all pairwise comparisons π π β π π we need to worry about the probability of a type I error on each test and also the probability of making at least one type I error on all the tests. This latter error is referred to as the experiment-wise error rate or the overall error rate. A number of procedures are available to protect the overall error rate. They all compare the differences π π β π π to a cut-off value. The cut-off values all have the form (π‘πππππ π£πππ’π)Γ πππΈΓ( 1 π π π π )
25
Steps in Multiple Comparisons
5-8 What If We Have More Than Two Samples? Steps in Multiple Comparisons Calculate π π β π π . Determine cut-off. Any difference in 1) larger than the cut-off have corresponding means that are significantly different from one another. Fisherβs Least Significant Difference (LSD). This does not protect the overall error rate. The overall error rate will be approximately 1β (1βπΌ) π , where c is the number of comparisons being made. πΏππ·= π‘ πΌ 2 , πβπ πππΈΓ( 1 π π π π ) If all π π =π then πΏππ·= π‘ πΌ 2 πππΈΓ 2 π
26
Steps in Multiple Comparisons
5-8 What If We Have More Than Two Samples? Steps in Multiple Comparisons Tukeyβs Highest Significance Difference (HSD). This controls overall error rate. The overall error rate will be a. π»ππ·= π πβπ, π πππΈ ( 1 π π π π )/2 If all π π =π then HSD= π πΌ, π πππΈ( 1 π ) An alternative in unbalanced designs is HSD= π πβπ, π πππΈ( 1 minβ‘( π π ) ) With Tukeyβs you need both the degrees of freedom (n β r) and also the number of means being compared (r).
27
Steps in Multiple Comparisons
5-8 What If We Have More Than Two Samples? Steps in Multiple Comparisons Studentβs, Neumanβs, Keuls (SNK). This overall error rate is between that of Fisherβs and Tukeyβs. SNK= π πβπ, π πππΈ ( 1 π π π π )/2 d is the number of means apart the two means being compared are in the ordered list of means (d between 2 and r). Notice, more than one cut-off is needed for the SNK. The SNK is done sequentially, starting with the means the farthest apart. Once a pair of means is found to be not significantly different, all pairs closer together are deemed not significantly different.
28
5-8 What If We Have More Than
Two Samples?
29
5-8 What If We Have More Than Two Samples?
πΏππ·= π‘ πΌ 2 , πβπ πππΈΓ( 1 π π π π ) = π‘ 0.025, (24β4) Γ =(2.086)(1.473) = 3.073 π»ππ·= π πβπ, π πππΈ ( 1 π π π π )/2 = π 20, ( )/2 =(3.96)(1.042) = 4.125 SππΎ= π πβπ, π πππΈ ( 1 π π π π )/2 = π 20, ( )/2 =(2.95)(1.042) = 3.074 π 20, 2 =2.95 π 20, 3 =3.58 π 20, 4 =3.96
30
5-8 What If We Have More Than Two Samples?
Cut-off value for Fisherβs LSD = 3.073 Cut-off value for Tukeyβs HSD = 4.125 Cut-off value for SNK = (d=2) 3.731 (d=3) 4.125 (d=4) 20 15 10 5 22.2 17 15.7 12.2* 7* 5.7* - 6.5* 1.3 5.2* * Significantly different.
31
5-8 What If We Have More Than Two Samples?
Residual Analysis and Model Checking
32
5-8 What If We Have More Than Two Samples?
Residual Analysis and Model Checking
33
5-8 What If We Have More Than Two Samples?
Residual Analysis and Model Checking
34
5-8 What If We Have More Than Two Samples?
Residual Analysis and Model Checking
35
5-8 What If We Have More Than Two Samples?
proc format; value hc 1=' 5%' 2='10%' 3='15%' 4='20%'; DATA ex514; INPUT hc strength format hc hc.; CARDS; ods graphics on; proc anova data=ex514; class hc; model strength= hc; means hc/lsd snk tukey; TITLE 'proc anova balanced 1-way anova'; proc glm data=ex514 plots = diagnostics; TITLE 'proc glm 1-way anova'; RUN; ods graphics off; QUIT;
36
5-8 What If We Have More Than
Two Samples?
37
5-8 What If We Have More Than
Two Samples?
38
5-8 What If We Have More Than
Two Samples?
39
5-8 What If We Have More Than
Two Samples?
40
5-8 What If We Have More Than
Two Samples?
41
5-8 What If We Have More Than
Two Samples?
42
5-8 What If We Have More Than
Two Samples?
43
5-8 What If We Have More Than
Two Samples?
44
5-8 What If We Have More Than
Two Samples?
45
5-8 What If We Have More Than
Two Samples?
46
5-8 What If We Have More Than
Two Samples?
47
Kruskal-Wallis Nonparametric Test
5-8 What If We Have More Than Two Samples? Kruskal-Wallis Nonparametric Test H0: All k populations have the same distribution H1: Not all populations have the same distribution. Test Statistic: π»= 12 π(π+1) π=1 π π
π 2 π π β3(π+1) Here the Ri are the sum of the ranks for each population. The ranks are found by ranking the data in all populations combined. The n is the total sample size and ni is the sample size of the sample from the ith population. Rejection Region: π»> Ο 2 πΌ;πβ1 Paired comparison of population i to j: π·=| π
π β π
π |> Ο 2 πΌ;πβ1 π(π+1) π π π π
48
5-8 What If We Have More Than Two Samples?
Example An article in Fortune compared rent in five American cities: New York, Chicago, Detroit, Tampa, and Orlando. The following data are small random samples of rents (in dollars) in the five cities. The New York data are Manhattan only. Conduct the Kruskal-Wallis test to determine whether evidence exists that there are significant differences in the rents in these cities. If differences exit, where are they? OPTIONS NOOVP NODATE NONUMBER LS=80; proc format; value city 1=' New York' 2='Chicago' 3='Detroit' 4='Tampa' 5='Orlando'; DATA rent; INPUT city rent format city city.; CARDS; ods graphics on; proc npar1way data=rent wilcoxon; class city; var rent; TITLE 'Kruskal-Wallis Test'; PROC RANK DATA=RENT OUT=RRENT; VAR RENT; PROC GLM DATA=RRENT; CLASS CITY; MODEL RENT = CITY; LSMEANS CITY/ CL PDIFF=ALL; RUN; ods graphics off; QUIT;
49
5-8 What If We Have More Than
Two Samples?
50
5-8 What If We Have More Than
Two Samples?
51
5-8 What If We Have More Than
Two Samples?
52
5-8 What If We Have More Than
Two Samples?
53
5-8 What If We Have More Than
Two Samples?
54
5-8 What If We Have More Than
Two Samples?
55
5-8 What If We Have More Than
Two Samples?
56
5-8 What If We Have More Than
Two Samples?
57
5-8 What If We Have More Than Two Samples?
Cutoff for pairwise comparisons Tampa vs. Orlando Ο πΌ;πβ1 2 π(π+1) π π π π = Ο ;4 36(36+1) = 18.06 Chicago vs. NY Ο πΌ;πβ1 2 π(π+1) π π π π = Ο ;4 36(36+1) = 16.80 NYw 8 Chicago Detroit Tampa Orlando 32.6 24 16.9 8.8 8.2 24.4* 15.8 8.7 0.6 - 23.8* 15.2 8.1 15.7 7.1 8.6 * Significantly different.
58
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment The randomized block design is an extension of the paired t-test to situations where the factor of interest has more than two levels.
59
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment For example, consider the situation where two different methods were used to predict the shear strength of steel plate girders. Say we use four girders as the experimental units.
60
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment
61
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment The appropriate linear statistical model: We assume treatments and blocks are initially fixed effects blocks do not interact
62
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment The hypotheses of interest are:
63
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment
64
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment The mean squares are:
65
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment The expected values of these mean squares are:
66
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment
67
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment
68
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment The model for the randomized block design is the same as the two-way factorial where the interaction terms are assumed to be zero. The main question of interest is whether the means are the same for the different treatment levels. The blocks are used to explain some of the variability and many times to simplify the mechanics of collecting the data. For instance if the blocks are cities, it is much easier to collect data in one city at a time. The SSTR is the same as SSA in the two-way factorial, SSBL is the same as the SSB in the two-way factorial. Notice SSE in the corresponding two-way factorial model has zero degrees of freedom when you have only one observation per cell. The two-way factorial with only one observation is analyzed the same as the randomized block, and also assumes no interaction between the two factors.
69
5-8 What If We Have More Than
Two Samples?
70
5-8 What If We Have More Than
Two Samples?
71
5-8 What If We Have More Than
Two Samples?
72
5-8 What If We Have More Than
Two Samples?
73
5-8 What If We Have More Than Two Samples?
Which means differ?
74
5-8 What If We Have More Than Two Samples?
5-8.2 Randomized Complete Block Experiment
75
5-8 What If We Have More Than Two Samples?
Residual Analysis and Model Checking
76
5-8 What If We Have More Than Two Samples?
Residual Analysis and Model Checking
77
5-8 What If We Have More Than Two Samples?
Residual Analysis and Model Checking
78
5-8 What If We Have More Than Two Samples?
Residual Analysis and Model Checking
79
5-8 What If We Have More Than Two Samples?
Example 5-15 OPTIONS NOOVP NODATE NONUMBER LS=80; DATA ex515; DO type=1 to 4; DO fabsam=1 TO 5; INPUT strength OUTPUT; END; END; CARDS; PROC GLM DATA=ex515; CLASS type fabsam; MODEL strength= type fabsam; MEANS TYPE/SNK; output out=new p=phat r=resid; TITLE 'Randomized Block Design FabSample: Block, ChemType: Treatment'; proc boxplot; plot strength*type; proc plot data=new; plot resid*phat; /* Residual Plot */ plot resid*type; /* Residual by Chemical Type */ plot resid*fabsam; /* Residuals by block */ proc anova data=ex515; class type; model strength=type; means type/snk; TITLE 'one-way anova'; run; quit;
80
5-8 What If We Have More Than
Two Samples?
81
5-8 What If We Have More Than
Two Samples?
82
5-8 What If We Have More Than
Two Samples?
83
5-8 What If We Have More Than
Two Samples?
84
5-8 What If We Have More Than
Two Samples?
85
5-8 What If We Have More Than
Two Samples?
86
5-8 What If We Have More Than
Two Samples?
87
5-8 What If We Have More Than
Two Samples?
88
5-8 What If We Have More Than
Two Samples?
89
5-8 What If We Have More Than
Two Samples?
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.