Factorial Experiments Analysis of Variance (ANOVA) Experimental Design.

Slides:



Advertisements
Similar presentations
Latin Square Designs. Selected Latin Squares 3 x 34 x 4 A B CA B C DA B C DA B C DA B C D B C AB A D CB C D AB D A CB A D C C A BC D B AC D A BC A D BC.
Advertisements

Multiple Comparisons in Factorial Experiments
Incomplete Block Designs. Randomized Block Design We want to compare t treatments Group the N = bt experimental units into b homogeneous blocks of size.
ANOVA TABLE Factorial Experiment Completely Randomized Design.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Comparing k Populations Means – One way Analysis of Variance (ANOVA)
Design of Engineering Experiments - Experiments with Random Factors
1 Multifactor ANOVA. 2 What We Will Learn Two-factor ANOVA K ij =1 Two-factor ANOVA K ij =1 –Interaction –Tukey’s with multiple comparisons –Concept of.
Experimental Design Terminology  An Experimental Unit is the entity on which measurement or an observation is made. For example, subjects are experimental.
Biostatistics-Lecture 9 Experimental designs Ruibin Xi Peking University School of Mathematical Sciences.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Statistics Design of Experiment.
Factorial Experiments
Latin Square Designs. Selected Latin Squares 3 x 34 x 4 A B CA B C DA B C DA B C DA B C D B C AB A D CB C D AB D A CB A D C C A BC D B AC D A BC A D BC.
The Randomized Block Design. Suppose a researcher is interested in how several treatments affect a continuous response variable (Y). The treatments may.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Repeated Measures Designs. In a Repeated Measures Design We have experimental units that may be grouped according to one or several factors (the grouping.
The Randomized Block Design. Suppose a researcher is interested in how several treatments affect a continuous response variable (Y). The treatments may.
MANOVA Multivariate Analysis of Variance. One way Analysis of Variance (ANOVA) Comparing k Populations.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
MANOVA Multivariate Analysis of Variance. One way Analysis of Variance (ANOVA) Comparing k Populations.
Randomized Block Design Blocks All treats appear once in each block.
Repeated Measures Designs. In a Repeated Measures Design We have experimental units that may be grouped according to one or several factors (the grouping.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
DOX 6E Montgomery1 Design of Engineering Experiments Part 9 – Experiments with Random Factors Text reference, Chapter 13, Pg. 484 Previous chapters have.
Design Of Experiments With Several Factors
Repeated Measures Designs. In a Repeated Measures Design We have experimental units that may be grouped according to one or several factors (the grouping.
1 The Two-Factor Mixed Model Two factors, factorial experiment, factor A fixed, factor B random (Section 13-3, pg. 495) The model parameters are NID random.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Latin Square Designs KNNL – Sections Description Experiment with r treatments, and 2 blocking factors: rows (r levels) and columns (r levels)
Comparing k Populations Means – One way Analysis of Variance (ANOVA)
Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 18 Random Effects.
Chapter 13 Design of Experiments. Introduction “Listening” or passive statistical tools: control charts. “Conversational” or active tools: Experimental.
Orthogonal Linear Contrasts A technique for partitioning ANOVA sum of squares into individual degrees of freedom.
Multivariate Analysis of Variance
Chapter 4 Analysis of Variance
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
Other experimental designs Randomized Block design Repeated Measures designs.
The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest.
ANOVA TABLE Factorial Experiment Completely Randomized Design.
ANOVA Overview of Major Designs. Between or Within Subjects Between-subjects (completely randomized) designs –Subjects are nested within treatment conditions.
The p-value approach to Hypothesis Testing
1 1 Slide Slides by JOHN LOUCKS St. Edward’s University.
Factorial Experiments Analysis of Variance Experimental Design.
The Analysis of Covariance ANACOVA. Multiple Regression 1.Dependent variable Y (continuous) 2.Continuous independent variables X 1, X 2, …, X p The continuous.
1 Topic 14 – Experimental Design Crossover Nested Factors Repeated Measures.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
Two-Factor Study with Random Effects In some experiments the levels of both factors A & B are chosen at random from a larger set of possible factor levels.
Factorial BG ANOVA Psy 420 Ainsworth. Topics in Factorial Designs Factorial? Crossing and Nesting Assumptions Analysis Traditional and Regression Approaches.
Experimental Designs The objective of Experimental design is to reduce the magnitude of random error resulting in more powerful tests to detect experimental.
Comparing k Populations Means – One way Analysis of Variance (ANOVA)
Comparing Three or More Means
Chapter 10: Analysis of Variance: Comparing More Than Two Means
Statistics for Business and Economics (13e)
Comparing k Populations
Econ 3790: Business and Economic Statistics
Comparing k Populations
Statistics for the Social Sciences
Chapter 11 Analysis of Variance
Latin Square Designs KNNL – Sections
Factorial Experiments
Latin Square Designs.
Chapter 10 – Part II Analysis of Variance
One way Analysis of Variance (ANOVA)
Presentation transcript:

Factorial Experiments Analysis of Variance (ANOVA) Experimental Design

Dependent variable Y k Categorical independent variables A, B, C, … (the Factors) Let –a = the number of categories (levels) of A –b = the number of categories (levels) of B –c = the number of categories (levels) of C –etc.

Random Effects and Fixed Effects Factors

A factor is called a fixed effects factors if the levels of the factor are a fixed set of levels and the conclusions of any analysis is in relationship to these levels. If the levels have been selected at random from a population of levels the factor is called a random effects factor The conclusions of the analysis will be directed at the population of levels and not only the levels selected for the experiment

Example - Random Effects In this Example a Taxi company is interested in comparing the effects of three brands of tires (A, B and C) on mileage (mpg). Mileage will also be effected by driver. The company selects b = 4 drivers at random from its collection of drivers. Each driver has n = 3 opportunities to use each brand of tire in which mileage is measured. Dependent –Mileage Independent –Tire brand (A, B, C), Fixed Effect Factor –Driver (1, 2, 3, 4), Random Effects factor

Comments The ANOVA Table will be the same for performing tests with respect to Source, SS, df and MS. The differences will occur in the denominator of the F – ratios. The denominators of the F ratios are determined by evaluating Expected Mean Squares for each effect.

Example: 3 factors A, B and C fixed SourceEMSF A B C AB AC BC ABC Error

Example: 3 factors A, B, C – all are random effects SourceEMSF A B C AB AC BC ABC Error

Example: 3 factors A fixed, B, C random SourceEMSF A B C AB AC BC ABC Error

Example: 3 factors A, B fixed, C random SourceEMSF A B C AB AC BC ABC Error

Rules for determining Expected Mean Squares (EMS) in an Anova Table 1.Schultz E. F., Jr. “Rules of Thumb for Determining Expectations of Mean Squares in Analysis of Variance,”Biometrics, Vol 11, 1955, Both fixed and random effects Formulated by Schultz [1]

1.The EMS for Error is  2. 2.The EMS for each ANOVA term contains two or more terms the first of which is  2. 3.All other terms in each EMS contain both coefficients and subscripts (the total number of letters being one more than the number of factors) (if number of factors is k = 3, then the number of letters is 4) 4.The subscript of  2 in the last term of each EMS is the same as the treatment designation.

5.The subscripts of all  2 other than the first contain the treatment designation. These are written with the combination involving the most letters written first and ending with the treatment designation. 6.When a capital letter is omitted from a subscript, the corresponding small letter appears in the coefficient. 7.For each EMS in the table ignore the letter or letters that designate the effect. If any of the remaining letters designate a fixed effect, delete that term from the EMS.

8.Replace  2 whose subscripts are composed entirely of fixed effects by the appropriate sum.

Example - Random Effects In this Example a Taxi company is interested in comparing the effects of three brands of tires (A, B and C) on mileage (mpg). Mileage will also be effected by driver. The company selects at random b = 4 drivers at random from its collection of drivers. Each driver has n = 3 opportunities to use each brand of tire in which mileage is measured. Dependent –Mileage Independent –Tire brand (A, B, C), Fixed Effect Factor –Driver (1, 2, 3, 4), Random Effects factor

The Data

Asking SPSS to perform Univariate ANOVA

Select the dependent variable, fixed factors, random factors

The Output The divisor for both the fixed and the random main effect is MS AB This is contrary to the advice of some texts

The Anova table for the two factor model (A – fixed, B - random) SourceSSdfMSEMSF ASS A a -1MS A MS A /MS AB BSS A b - 1MS B MS B /MS Error ABSS AB (a -1)(b -1)MS AB MS AB /MS Error ErrorSS Error ab(n – 1)MS Error Note: The divisor for testing the main effects of A is no longer MS Error but MS AB. References Guenther, W. C. “Analysis of Variance” Prentice Hall, 1964

The Anova table for the two factor model (A – fixed, B - random) SourceSSdfMSEMSF ASS A a -1MS A MS A /MS AB BSS A b - 1MS B MS B /MS AB ABSS AB (a -1)(b -1)MS AB MS AB /MS Error ErrorSS Error ab(n – 1)MS Error Note: In this case the divisor for testing the main effects of A is MS AB. This is the approach used by SPSS. References Searle “Linear Models” John Wiley, 1964

Crossed and Nested Factors

The factors A, B are called crossed if every level of A appears with every level of B in the treatment combinations. Levels of B Levels of A

Factor B is said to be nested within factor A if the levels of B differ for each level of A. Levels of B Levels of A

Example: A company has a = 4 plants for producing paper. Each plant has 6 machines for producing the paper. The company is interested in how paper strength (Y) differs from plant to plant and from machine to machine within plant Plants Machines

Machines (B) are nested within plants (A) The model for a two factor experiment with B nested within A.

The ANOVA table SourceSSdfMSFp - value ASS A a - 1MS A MS A /MS Error B(A)B(A)SS B(A) a(b – 1)MS B(A) MS B(A) /MS Error ErrorSS Error ab(n – 1)MS Error Note: SS B(A ) = SS B + SS AB and a(b – 1) = (b – 1) + (a - 1)(b – 1)

Example: A company has a = 4 plants for producing paper. Each plant has 6 machines for producing the paper. The company is interested in how paper strength (Y) differs from plant to plant and from machine to machine within plant. Also we have n = 5 measurements of paper strength for each of the 24 machines

The Data

Anova Table Treating Factors (Plant, Machine) as crossed

Anova Table: Two factor experiment B(machine) nested in A (plant)

ANOVA Table for 3 factors crossed EffectSSdf ASS A (a – 1) BSS B (b – 1) CSS C (c – 1) ABSS AB (a – 1) (b – 1) ACSS AC (a – 1) (c – 1) BCSS BC (b – 1) (c – 1) ABCSS ABC (a – 1) (b – 1) (c – 1) ErrorSS Error abc(n – 1)

ANOVA Table for 3 nested factors B nested in A, C nested in B EffectSSdf ASS A (a – 1) B(A)B(A)SS B(A) a(b – 1) C(AB)SS C(AB) ab(c – 1) ErrorSS Error abc(n – 1) Note: SS B(A) = SS B + SS AB and a(b – 1) = (b – 1) + (a – 1)(b –1) Also SS C(AB) = SS C + SS AC + SS BC + SS ABC and ab(c – 1) = (c – 1) + (a – 1)(c –1) + (b – 1)(c –1) + (a – 1)(b –1)(c –1)

Also in nested designs Factors may be fixed effect factors or random effect factors Levels of the factor are a fixed set of levels Levels of the factor are chosen at random from a population of levels This effects the divisor in the F ratio for testing the effect

Other experimental designs Randomized Block design Latin Square design Repeated Measures design

The Randomized Block Design

Suppose a researcher is interested in how several treatments affect a continuous response variable (Y). The treatments may be the levels of a single factor or they may be the combinations of levels of several factors. Suppose we have available to us a total of N = nt experimental units to which we are going to apply the different treatments.

The Completely Randomized (CR) design randomly divides the experimental units into t groups of size n and randomly assigns a treatment to each group.

The Randomized Block Design divides the group of experimental units into n homogeneous groups of size t. These homogeneous groups are called blocks. The treatments are then randomly assigned to the experimental units in each block - one treatment to a unit in each block.

Example 1: Suppose we are interested in how weight gain (Y) in rats is affected by Source of protein (Beef, Cereal, and Pork) and by Level of Protein (High or Low). There are a total of t = 3  2 = 6 treatment combinations of the two factors (Beef -High Protein, Cereal-High Protein, Pork-High Protein, Beef -Low Protein, Cereal-Low Protein, and Pork-Low Protein).

Suppose we have available to us a total of N = 60 experimental rats to which we are going to apply the different diets based on the t = 6 treatment combinations. Prior to the experimentation the rats were divided into n = 10 homogeneous groups of size 6. The grouping was based on factors that had previously been ignored (Example - Initial weight size, appetite size etc.) Within each of the 10 blocks a rat is randomly assigned a treatment combination (diet).

The weight gain after a fixed period is measured for each of the test animals and is tabulated on the next slide:

Randomized Block Design

Example 2: The following experiment is interested in comparing the effect four different chemicals (A, B, C and D) in producing water resistance (y) in textiles. A strip of material, randomly selected from each bolt, is cut into four pieces (samples) the pieces are randomly assigned to receive one of the four chemical treatments.

This process is replicated three times producing a Randomized Block (RB) design. Moisture resistance (y) were measured for each of the samples. (Low readings indicate low moisture penetration). The data is given in the diagram and table on the next slide.

Diagram: Blocks (Bolt Samples)

Table Blocks (Bolt Samples) Chemical123 A B C D

The Model for a randomized Block Experiment i = 1,2,…, tj = 1,2,…, b y ij = the observation in the j th block receiving the i th treatment  = overall mean  i = the effect of the i th treatment  j = the effect of the j th Block  ij = random error

The Anova Table for a randomized Block Experiment SourceS.S.d.f.M.S.Fp-value TreatSS T t-1MS T MS T /MS E BlockSS B n-1MS B MS B /MS E ErrorSS E (t-1)(b-1)MS E

A randomized block experiment is assumed to be a two-factor experiment. The factors are blocks and treatments. The is one observation per cell. It is assumed that there is no interaction between blocks and treatments. The degrees of freedom for the interaction is used to estimate error.

The Anova Table for Diet Experiment

The Anova Table forTextile Experiment

If the treatments are defined in terms of two or more factors, the treatment Sum of Squares can be split (partitioned) into: –Main Effects –Interactions

The Anova Table for Diet Experiment terms for the main effects and interactions between Level of Protein and Source of Protein

Repeated Measures Designs

In a Repeated Measures Design We have experimental units that may be grouped according to one or several factors (the grouping factors) Then on each experimental unit we have not a single measurement but a group of measurements (the repeated measures) The repeated measures may be taken at combinations of levels of one or several factors (The repeated measures factors)

Example In the following study the experimenter was interested in how the level of a certain enzyme changed in cardiac patients after open heart surgery. The enzyme was measured immediately after surgery (Day 0), one day (Day 1), two days (Day 2) and one week (Day 7) after surgery for n = 15 cardiac surgical patients.

The data is given in the table below. Table: The enzyme levels -immediately after surgery (Day 0), one day (Day 1),two days (Day 2) and one week (Day 7) after surgery

The subjects are not grouped (single group). There is one repeated measures factor - Time – with levels –Day 0, –Day 1, –Day 2, –Day 7 This design is the same as a randomized block design with –Blocks = subjects

The Anova Table for Enzyme Experiment The Subject Source of variability is modelling the variability between subjects The ERROR Source of variability is modelling the variability within subjects

Example : (Repeated Measures Design - Grouping Factor) In the following study, similar to example 3, the experimenter was interested in how the level of a certain enzyme changed in cardiac patients after open heart surgery. In addition the experimenter was interested in how two drug treatments (A and B) would also effect the level of the enzyme.

The 24 patients were randomly divided into three groups of n= 8 patients. The first group of patients were left untreated as a control group while the second and third group were given drug treatments A and B respectively. Again the enzyme was measured immediately after surgery (Day 0), one day (Day 1), two days (Day 2) and one week (Day 7) after surgery for each of the cardiac surgical patients in the study.

Table: The enzyme levels - immediately after surgery (Day 0), one day (Day 1),two days (Day 2) and one week (Day 7) after surgery for three treatment groups (control, Drug A, Drug B)

The subjects are grouped by treatment –control, –Drug A, –Drug B There is one repeated measures factor - Time – with levels –Day 0, –Day 1, –Day 2, –Day 7

The Anova Table There are two sources of Error in a repeated measures design: The between subject error – Error 1 and the within subject error – Error 2

Tables of means DrugDay 0Day 1Day 2Day 7Overall Control A B Overall