The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest.

Slides:



Advertisements
Similar presentations
Analysis by design Statistics is involved in the analysis of data generated from an experiment. It is essential to spend time and effort in advance to.
Advertisements

Incomplete Block Designs. Randomized Block Design We want to compare t treatments Group the N = bt experimental units into b homogeneous blocks of size.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
1 Design of Engineering Experiments Part 3 – The Blocking Principle Text Reference, Chapter 4 Blocking and nuisance factors The randomized complete block.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
STA305 week 101 RCB - Example An accounting firm wants to select training program for its auditors who conduct statistical sampling as part of their job.
STA305 week 31 Assessing Model Adequacy A number of assumptions were made about the model, and these need to be verified in order to use the model for.
1 Multifactor ANOVA. 2 What We Will Learn Two-factor ANOVA K ij =1 Two-factor ANOVA K ij =1 –Interaction –Tukey’s with multiple comparisons –Concept of.
C82MST Statistical Methods 2 - Lecture 7 1 Overview of Lecture Advantages and disadvantages of within subjects designs One-way within subjects ANOVA Two-way.
ANOVA: ANalysis Of VAriance. In the general linear model x = μ + σ 2 (Age) + σ 2 (Genotype) + σ 2 (Measurement) + σ 2 (Condition) + σ 2 (ε) Each of the.
Chapter 4 Multiple Regression.
Evaluating Hypotheses
8. ANALYSIS OF VARIANCE 8.1 Elements of a Designed Experiment
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
Incomplete Block Designs
Experimental Evaluation
Inferences About Process Quality
Analysis of Variance & Multivariate Analysis of Variance
13 Design and Analysis of Single-Factor Experiments:
Biostatistics-Lecture 9 Experimental designs Ruibin Xi Peking University School of Mathematical Sciences.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
Fundamentals of Data Analysis Lecture 7 ANOVA. Program for today F Analysis of variance; F One factor design; F Many factors design; F Latin square scheme.
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
QNT 531 Advanced Problems in Statistics and Research Methods
1 Chapter 11 Analysis of Variance Introduction 11.2 One-Factor Analysis of Variance 11.3 Two-Factor Analysis of Variance: Introduction and Parameter.
STA291 Statistical Methods Lecture 31. Analyzing a Design in One Factor – The One-Way Analysis of Variance Consider an experiment with a single factor.
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
STA305 week21 The One-Factor Model Statistical model is used to describe data. It is an equation that shows the dependence of the response variable upon.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Experimental Design and Analysis of Variance Chapter 10.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Experimental Design If a process is in statistical control but has poor capability it will often be necessary to reduce variability. Experimental design.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Discussion 3 1/20/2014. Outline How to fill out the table in the appendix in HW3 What does the Model statement do in SAS Proc GLM ( please download lab.
Testing Hypotheses about Differences among Several Means.
STA305 week 51 Two-Factor Fixed Effects Model The model usually used for this design includes effects for both factors A and B. In addition, it includes.
The Completely Randomized Design (§8.3)
Design Of Experiments With Several Factors
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Control of Experimental Error Blocking - –A block is a group of homogeneous experimental units –Maximize the variation among blocks in order to minimize.
1 Overview of Experimental Design. 2 3 Examples of Experimental Designs.
1 Introduction to Mixed Linear Models in Microarray Experiments 2/1/2011 Copyright © 2011 Dan Nettleton.
CPE 619 One Factor Experiments Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama in.
Chapter 13 Design of Experiments. Introduction “Listening” or passive statistical tools: control charts. “Conversational” or active tools: Experimental.
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Sampling Design and Analysis MTH 494 Lecture-21 Ossam Chohan Assistant Professor CIIT Abbottabad.
1 Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 9 Review.
Factorial Experiments Analysis of Variance Experimental Design.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
1 Topic 14 – Experimental Design Crossover Nested Factors Repeated Measures.
Designs for Experiments with More Than One Factor When the experimenter is interested in the effect of multiple factors on a response a factorial design.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Experimental Design and Analysis of Variance Chapter 11.
Two-Factor Study with Random Effects In some experiments the levels of both factors A & B are chosen at random from a larger set of possible factor levels.
Simple and multiple regression analysis in matrix form Least square Beta estimation Beta Simple linear regression Multiple regression with two predictors.
1 Chapter 5.8 What if We Have More Than Two Samples?
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Stats Methods at IC Lecture 3: Regression.
Design Lecture: week3 HSTS212.
Statistical Data Analysis - Lecture10 26/03/03
Topics Randomized complete block design (RCBD) Latin square designs
What are their purposes? What kinds?
ENM 310 Design of Experiments and Regression Analysis Chapter 3
Chapter 10 – Part II Analysis of Variance
14 Design of Experiments with Several Factors CHAPTER OUTLINE
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

The Mixed Effects Model - Introduction In many situations, one of the factors of interest will have its levels chosen because they are of specific interest to the researcher. On the other hand, there may be a second factor of interest for which it is important to generalize to all possible levels; in this case the levels of the second factor might be chosen at random. This type of experiment is referred to as a mixed study design because one of the factors has fixed levels while the other has random levels. 1STA305 Week 9

The Mixed Effects Model Suppose that in a given study, a levels of factor A have been chosen because they are of particular interest. Further, suppose that b levels of factor B have been chosen at random from all possible levels of this factor. A total of abr experimental units will be used to conduct the study, r units will be randomly allocated to each of the ab experimental conditions. The form of the statistical model that we will study is identical to that for 2 factor studies with either fixed or random effects - the difference is in the assumptions about the factor levels and the interactions. The model equation is Y ijk = μ + α i + β j + γ ij + ε ijk As with the fixed and random effects models, we parameterize the model in such a way that μ is the overall mean of all of the responses: i.e. 2STA305 Week 9

Assumptions of the Mixed Effects Model As in the fixed effects model, the factor A has fixed effects and we therefore require that Since the levels of factor B have been chosen at random, we require instead that Since one of the factors is random, the interactions must be random as well however, since one of the factors is fixed, the sum over that component will be 0. Together these yield 2 constraints on the γ ij The factor (a-1)/a is for convenience in expressing EMS only. STA305 Week 93

STA305 week 54 Sums of Squares The observed variation in the data is measure in the same manner as for the fixed effects and the random effect case. In other words, the total variation in the data is measured by The sums of squares and the degrees of freedom for the other sources of variation are also the same as in the 2-factor study with fixed or random effects model. The only difference is in the expected mean squares.

STA305 week 55 Expected Mean Squares The expected mean squares are as follows:

Hypothesis Testing As in all of the other experimental designs that we have looked at, the motivation for the test statistics is derived from the EMS. As in the case of both the fixed and random effects models, the test for interactions will be made by comparing MS A×B to MS E. The test for the fixed factor, factor A, will be made by comparing MS A to MS A×B. The test for the random effect, factor B, will be made by comparing MS B to MS E. STA305 Week 96

The ANOVA Table It is useful to add the expected mean squares to the table in order to remember which ratios to form for the F-tests. The ANOVA table is given below: STA305 week 67

Estimating the Model Parameters The effect for the levels of the fixed factor can be estimated as in the fixed effects model. That is, In the mixed model, however, confidence intervals for the effects of the levels of the fixed factors are constructed using MS A×B as the variance estimate. That is, a CI for the effect of the ith level of factor A is: Orthogonal contrasts can also be used to make inferences about the levels of factor A. The mixed effects model also contains components of variation and these can be estimated as follows: STA305 week 68

Random & Mixed Effects Using SAS – Example Background: the goal of this study is to investigate the capacity of a measurement system. Design: 10 parts are randomly selected; 2 operators are randomly selected to measure each part 3 times. The statements required to conduct analysis in SAS are as follows: proc glm data = measurement ; class part operator ; Model measure = part | operator ; Random part operator ; Test h=part e=part*operator ; Test h=operator e=part*operator ; run ; STA305 Week 99

10

STA305 Week 911

STA305 Week 912

Suppose that parts were fixed and operators were random. The SAS code would be as follows: proc glm data = measurement ; class part operator ; model measure = part | operator ; random part operator ; test h=part e=part*operator ; run ; The ANOVA would look the same as above. The fixed factor “part” would be tested against the interaction. The (random) factor “operator” and “part×operator” would be tested against error term that can be read from the ANOVA table. STA305 Week 913

Three-Factor Fixed Effects Design Suppose that in a particular experiment, there are 3 factors that are of interest to the researcher. Assume that there are a levels of Factor A, b levels of Factor B, and c levels of Factor C. In this case, the researcher must also be concerned with interactions between all 3 factors: A×B, A×C, B×C, and A×B×C. The model that we will use in this case is Y ijkl = μ+α i +β j +_γ k +(αβ) ij +(αγ) ik +(βγ) jk +(αβγ) ijk + ε ijkl. In this notation, the interaction terms are denoted by, for instance, (αβ) ij. This notation is used to avoid introducing more Greek letters, and does not mean that the interaction between α i and β j is α i β j STA305 Week 914

Model Assumptions The assumptions about the parameters are similar to those for the 2- factor fixed effects model. We assume the following: STA305 Week 915

Sums of Squares and ANOVA Table STA305 Week 916

Blocking - Introduction In general, the goal of experimental design is to minimization haphazard variability and to be able to see differences between treatments. In some situations, a variable might have an impact on the response, however, this variable is not the focus of the study and we generally wish to exclude it from the design. Such variables are called nuisance factors. The purpose of randomization is to average out the impact of these nuisance factors. In some cases, the nuisance factors may be both unknown and uncontrollable, in which case randomization is especially useful. STA305 Week 917

In other cases, factors which influence response might be known, but possibly uncontrollable. Although such factors cannot be included in the design, we can at least observe their value. The analysis can then be adjusted to compensate for the effect of these variables using an analysis of covariance (to be discussed later in the course). In other situations, a nuisance factor may be both known and controllable, in which case, we can reduce risk of haphazard error by including this factor in design of the experiment. The type of designs, called blocked designs can be used to reduce variability of experimental error in such cases. STA305 Week 918

Example Fleet manager wishes to consider 4 brands of tires to determine which has least tread wear after 20,000 miles. Since there are 4 brands of tires to test the study should ideally include at least 4 cars. Denote tire brands by T 1, T 2, T 3, T 4 and the cars by C 1, C 2, C 3, C 4. One possible way to design the study is to randomly decide which car gets which type of tire. This car would then have 4 tires of this type STA305 Week 919

However, if there is a difference between cars with respect to the wear they cause on the tires, then this design will not allow us to detect a difference between brands. Although differences between cars are not of primary interest, they need to be taken into account. One possible way around this is to randomly assign the 16 tires (4 of each type) to the 4 cars. The following allocation of tires to cars might result from such a randomization: STA305 Week 920

However, the goal of the design was to eliminate the confounding of tire effects with car differences but this goal has not been met here. For example, brand T 1 isn’t used on car C 3, brand T 2 is not used on car C 1, and brand T 4 is not used on car C 2. So we need to ensure that there is no confounding and that random error does include differences between cars. This could be accomplished by restricting randomization so that each car must have one tire of each brand. That is, randomize the location of tires within each car. An example of such a randomization scheme is as follows: This design is known as a randomized complete block design. STA305 Week 921

Randomized Complete Block Design A randomized complete block design is a restricted randomization. Experimental units are first organized into homogeneous groups called blocks. Treatments are then randomly allocated within each block. In the example above, cars were contributing to variation but were not of primary interest. The fact that each car requires 4 tires means that the 4 tires on one car form a natural blocking unit. The purpose of blocking is to ensure that experimental units within a block are as homogeneous as possible with respect to the response variable. Units in different blocks are more heterogeneous. STA305 Week 922

Advantages & Disadvantages Using blocks allows us to control a factor not of primary interest. However, it requires that there be enough experimental units to ensure that each treatment can be used within each block. Further, it requires the researcher to assume that there is no interaction between blocks and treatments. Since block effects must be estimated in addition to treatment effects, the degrees of freedom available for estimating error are reduced. STA305 Week 923

Special Case: Paired t-test The simplest example of a randomized complete block design is a paired t-test. In this case there are 2 treatments to be studied, each treatment is applied to each experimental unit. For example, twins might be randomly allocated to one of 2 treatments. Or 2 treatments might be randomly allocated to left and right eyes, lungs, kidneys, hands, etc. STA305 Week 924

General Case: Two or More Treatments Consider the case where there is one factor which will be studied for its effect on the response variable. Suppose that the number of levels of that factor is a. Further, suppose that it is known that there is a nuisance factor which can be controlled, and that this factor will be used to form blocks. Let b denote the number of blocks to be used in the experiment. The order in which the treatments will be allocated within blocks is randomized. The total number of experimental units required to conduct this experiment is N = ab. STA305 Week 925

The Model We will use the following statistical model to express the response in terms of the treatment and block effects: Y ij = μ+τ i +β j + ε ij. Where: μ is the overall mean τ i is the effect of the i-th treatment β j is the effect of the j-th block and ε ij is the residual or random error term. It is possible that either or both of the treatments and blocks could be randomly chosen. But, for now we assume that both are fixed. STA305 Week 926

Assumptions As before, we will assume that ε ij ~ N(0, σ 2 ) and that ε ij are independent of each other. Treatment and block effects are defined as deviations from the overall mean. Therefore, we require that STA305 Week 927

Sources of Variation When considered as a whole, the data from all treatment groups and all blocks will contain a certain amount of variability. Some of the variability might be due the fact that the treatments have different effects on the response. Similarly, some of the variability might be due to the fact that blocks are quite heterogeneous with regard to the response. Finally, even if there were no treatment or block differences, there would still be chance variation. The total sum of squares is a measure of the overall variability in the sample, and it can be decomposed to allow us to determine how much variability is due to each source… STA305 Week 928