Incomplete Block Designs

Slides:



Advertisements
Similar presentations
Introductory Mathematics & Statistics for Business
Advertisements

Multiple-choice question
Incomplete Block Designs. Randomized Block Design We want to compare t treatments Group the N = bt experimental units into b homogeneous blocks of size.
Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture #19 Analysis of Designs with Random Factor Levels.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
STA305 week 101 RCB - Example An accounting firm wants to select training program for its auditors who conduct statistical sampling as part of their job.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
STA305 week 31 Assessing Model Adequacy A number of assumptions were made about the model, and these need to be verified in order to use the model for.
Regression Analysis Using Excel. Econometrics Econometrics is simply the statistical analysis of economic phenomena Here, we just summarize some of the.
Topic 6: Introduction to Hypothesis Testing
Multiple regression analysis
Analysis of Variance: Inferences about 2 or More Means
Pengujian Hipotesis Nilai Tengah Pertemuan 19 Matakuliah: I0134/Metode Statistika Tahun: 2007.
Lecture 9: One Way ANOVA Between Subjects
Chapter 11 Multiple Regression.
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Inferences About Process Quality
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Analysis of Variance & Multivariate Analysis of Variance
1 Topic 11 – ANOVA II Balanced Two-Way ANOVA (Chapter 19)
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
1 Chapter 11 Analysis of Variance Introduction 11.2 One-Factor Analysis of Variance 11.3 Two-Factor Analysis of Variance: Introduction and Parameter.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
PROBABILITY & STATISTICAL INFERENCE LECTURE 6 MSc in Computing (Data Analytics)
Analysis of Covariance ANOVA is a class of statistics developed to evaluate controlled experiments. Experimental control, random selection of subjects,
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
23-1 Analysis of Covariance (Chapter 16) A procedure for comparing treatment means that incorporates information on a quantitative explanatory variable,
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Two-way ANOVA Introduction to Factorial Designs and their Analysis.
STA305 week21 The One-Factor Model Statistical model is used to describe data. It is an equation that shows the dependence of the response variable upon.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
1 Chapter 3 Multiple Linear Regression Multiple Regression Models Suppose that the yield in pounds of conversion in a chemical process depends.
Randomized Block Design (Kirk, chapter 7) BUSI 6480 Lecture 6.
Experimental Design If a process is in statistical control but has poor capability it will often be necessary to reduce variability. Experimental design.
Discussion 3 1/20/2014. Outline How to fill out the table in the appendix in HW3 What does the Model statement do in SAS Proc GLM ( please download lab.
Testing Hypotheses about Differences among Several Means.
STA305 week 51 Two-Factor Fixed Effects Model The model usually used for this design includes effects for both factors A and B. In addition, it includes.
Design Of Experiments With Several Factors
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
1 Experimental Statistics - week 14 Multiple Regression – miscellaneous topics.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
Control of Experimental Error Blocking - –A block is a group of homogeneous experimental units –Maximize the variation among blocks in order to minimize.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
1 Experimental Statistics - week 9 Chapter 17: Models with Random Effects Chapter 18: Repeated Measures.
Experimental Statistics - week 3
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Other Types of t-tests Recapitulation Recapitulation 1. Still dealing with random samples. 2. However, they are partitioned into two subsamples. 3. Interest.
The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Experimental Design and Analysis of Variance Chapter 11.
Two-Factor Study with Random Effects In some experiments the levels of both factors A & B are chosen at random from a larger set of possible factor levels.
Simple and multiple regression analysis in matrix form Least square Beta estimation Beta Simple linear regression Multiple regression with two predictors.
1 Chapter 5.8 What if We Have More Than Two Samples?
Experimental Designs The objective of Experimental design is to reduce the magnitude of random error resulting in more powerful tests to detect experimental.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Design Lecture: week3 HSTS212.
Incomplete Block Designs
Comparing Three or More Means
Statistics Analysis of Variance.
Statistics for Business and Economics (13e)
Chapter 9 Hypothesis Testing.
Joanna Romaniuk Quanticate, Warsaw, Poland
One-Way Analysis of Variance
The American Statistician (1990) Vol. 44, pp
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

Incomplete Block Designs Recall: in randomized complete block design, each of a treatments was used once within each of b blocks. In some situations, it will not be possible to use each of a treatments in each block. Any blocked experiment which has fewer than a units per block is called an incomplete block design. STA305 week11

Balanced Incomplete Block Design (BIBD) If all treatment comparisons are of equal importance and if we wish to estimate each treatment effect with the same precision, then the design needs to be balanced: (1) Each block contains the same number of units. (2) Each treatment occur the same number of times in total (3) Each pair of treatments occurs together the same number of times in total.   A design that satisfies these conditions is called Balanced Incomplete Block Design. STA305 week11

Example  Consider a study which we wish to compares 4 treatments: A, B, C, and D. Only 2 treatments can be used in each block and suppose there are 6 blocks available. The following is a balanced incomplete block design (BIBD): This design is balanced with respect to each of the 3 points above. STA305 week11

So this would not be considered a balanced incomplete block design. Consider the following design, which also uses 6 blocks, each with 2 units Although this design is balanced with regard to the first 2 items, the same 2 treatments always appear together. So this would not be considered a balanced incomplete block design. Could this study be designed using 5 blocks? 7 blocks? STA305 week11

Number of Experimental Units Required We will use the following notation: a - number of treatments, b - number of blocks, k - number of units per block , r - total number of times each treatment occurs The total number of units required in the study is therefore N = bk. However, the total number of units required can also be expressed as N = ar. So in a balanced design, we must have bk = ar. Using combinatorics, we can show that the total number of times that each pair of treatments appears together is where λ must be an integer. NOTE: designs that satisfy the last two conditions may not exist for a particular choice of a, b, k, r. STA305 week11

Example A study was designed to compare 5 treatments. Although the study will use blocks to control for a nuisance factor, there are only 3 experimental units within each block. Are there values of b (# of blocks) and r (# of times each treatment is used) that would lead to a BIBD?   STA305 week11

Solution STA305 week11

Statistical Model for BIBD The model equation for the BIBD is the same as for the randomized complete block design, that is Yij = μ+τi +βj+ εij. μ is the overall mean, τi is the effect of the i-th treatment, βj is the effect of the j-th block and εij are i.i.d N(0, σ2) - the residual or random error term. STA305 week11

Sums of Squares The total sum of squares is calculated in the usual manner: In decomposing total sum of squares, we use an adjusted treatment sum of squares to account for the fact that each treatment did not occur within each block. So treatment sum of squares sums over different blocks for each treatment. STA305 week11

Additional Notation Let Ti denote the sum over all experimental units which received treatment i. Since each treatment does not appear in each block, the treatment total must be adjusted to take this into account. Let Bi be the sum over all observations in the blocks in which treatment i occurred. Then the adjusted treatment total is Qi = Ti - Bi / k If we were interested in testing for the equality of block effects, we would also need to compute an adjusted block total. STA305 week11

The treatment sum of squares adjusted for blocks is The sum of squares for the blocking factor (not adjusted for treatments) is The error sum of squares is obtained by subtraction: SSE = SST - SSTr (adj) - SSBl   STA305 week11

Degrees of Freedom Since the same number of parameters must still be estimated in this design (compared to the randomized complete block design), the degrees of freedom will be the same. Since there are a treatments, the degrees of freedom for treatment is a-1. Similarly, the degrees of freedom for blocks is b-1. The total degrees of freedom is N-1, where N = bk = ar STA305 week11

Testing for Treatment Effects In order to test for treatment effects, we start by calculating the necessary sums of squares and constructing the ANOVA Table. It is given below   Note that the mean square for blocks is not computed; if we wanted to test for block effects we would have to compute an adjusted sum of squares for blocks. STA305 week11

We use the test statistic: Fobs = MSTr (adj) / MSE To test the hypotheses H0: τ1 = τ2 = ... = τa = 0 Ha: not all τi = 0 We use the test statistic: Fobs = MSTr (adj) / MSE The corresponding P-value = P(F(a-1, N-a-b+1 > Fobs ) Small values of the p-value would cause us to reject the hypothesis that there is no treatment effect. STA305 week11

Example A researcher planned and carried out a study to examine the inter-rater reliability of a scale for measuring depression. Six examiners at a particular institution were the primary focus of the study. However, since the outcome of the assessment was likely to be influenced by the patient, it was decided to block on patients. It was not feasible to have all 6 examiners question every subject; each subject was to be examined by 3 examiners. In order to achieve balance it is necessary to have ar = bk, which in this case is 6r = 3b It is also necessary for λ = r(k-1)/(a-1) to be an integer; in this case λ = r(2/5). The smallest value of r for which λ will be an integer is 5, and this requires that the number of blocks, b, be 10. STA305 week11

A balanced incomplete block design was used, and the following data were collected: Do these data provide any evidence that the examiners differ with respect to the mean depression scores that they assign to patients? STA305 week11

Solution STA305 week11

Contrasts As in other experiments, the researchers may be interested in comparisons between subgroups of treatment means. Contrasts can be used to test hypotheses. Tests should be based on the adjusted treatment effects: To test the hypothesis H0 : c1μ1 + c2μ2 + · · · + caμa = 0 Ha : c1μ1 + c2μ2 + · · · + caμa ≠ 0 We use Fobs = SSC / MSE, where STA305 week11

Least Squares Estimates of the Model Parameter In deriving the least squares estimators, we will use nij, an indicator for treatment I occurring in block j as follow: Start by forming the sum of the squared deviations of the observed values from the fitted values… STA305 week11

Analysis of BIBD Using SAS In the designs we have examined up to now, the order in which factors were specified in the “model” statement was not important. However, for the BIBD case the order is important. Refer to the Example on slide 15-16. The manner in which data are input for the BIBD is similar to that for the RBC design:   data example ; input patient examiner score ; cards ; 1 1 10 1 2 14 1 3 10 2 1 3 2 2 3 2 4 1 etc; run ; STA305 week11

proc glm data = example ; class patient examiner ; The glm procedure is still the appropriate choice for this design, but is important that the block factor be entered into the model before the treatment factor proc glm data = example ; class patient examiner ; model score = patient examiner ; title1 'block before factor' ; run ; Care is needed in reading the SAS output, since Type I SS is used for the block factor (unadjusted) while Type III SS is used for the treatment factor (adjusted). The outputs is given below: STA305 week11

STA305 week11

The error SS is read from the output and it is 139.22222222. The total SS is read from the SAS output, and in this case it is 1156.66666667. The error SS is read from the output and it is 139.22222222. The unadjusted block SS is taken from the column containing the Type I SS, and in this case is 982.00. The adjusted treatment SS is taken from the column containing the Type III SS, and is 35.44444444. The correct value to use for the F-test is the one associated with the Type III SS for treatment. For the purposes of comparison, the output generated by SAS when the treatment factor is entered into the model before blocks is given below STA305 week11

STA305 week11

proc glm data = example ; class patient examiner ; From this output we can get the adjusted SS for the factor (examiner), but we cannot get the unadjusted SS for the blocks (patients). Because of the unbalanced nature of this design, least squares treatment means should be used rather than simple averages. To obtain least squares means for the treatment factor, just add one line to the SAS code as follows: proc glm data = example ; class patient examiner ; model score = patient examiner ; lsmeans examiner ; title1 'block before factor' ; run ; the output generated by the “lsmeans” statement is given below: STA305 week11

Compare the LS means to the treatment averages that do not take into account block effects: STA305 week11

Contrasts in SAS Contrasts can also be tested using SAS. Suppose that examiner 1 was more recently hired than the other examiners and it is of interest to test the hypothesis that the mean for examiner 1 is equal to the mean for the others. In other words, test H0 : μ1 = (μ2 + μ3 + μ4 + μ5 + μ6)/5 Ha : μ1 ≠ (μ2 + μ3 + μ4 + μ5 + μ6)/5 To do this test is SAS, it is still important that the block factor is entered into the model before the treatment factor. Add the following statement to the glm code given above contrast ‘1st vs others’ examiner -1 0.2 0.2 0.2 0.2 0.2 ; STA305 week11

An addition line of output will be produced giving the following information STA305 week11

Additive Models and Alternatives Models used for randomized complete block design, BIBD, Latin square & Graeco-Latin square design are known as additive models. Consider the model used for the randomized complete block design: Yij = μ+τi +βj+ εij. In this model, treatment i always causes the expected response to increase/decrease τi units from the overall mean μ. The effect of block j is to cause a change of βj units, regardless of which treatment is applied. The effect of treatment i and block j together cause an increase or decrease from the overall mean by τi +βj. In other words, the effects of the 2 factors are additive. This model is not always appropriate for a randomized complete block design. Generally, an additive model may not be appropriate for a particular design. STA305 week11

Multiplicative Effects In some cases, the effect of one factor might be amplified by the presence of another factor. A multiplicative model might be appropriate in this situation: Yij = μ×τi×βj×εij. Taking logarithms results in an additive model: ln{Yij} = ln{μ}+ln{τi}+ln{βj}+ln{εij}. This can be written as STA305 week11

Interaction In some cases, there may be an element of additivity, although the additive model may not be adequate. For instance, consider a randomized complete block design where treatment i causes an extreme reaction in a particular block, but the other treatments do not. Furthermore, suppose that in the other blocks treatment i does not cause an extreme reaction. In this case, there is an interaction between treatments and blocks. The additive model would need to be modified to include an interaction term.   STA305 week11