Two-Factor Full Factorial Designs

Slides:

Advertisements

Similar presentations

ANOVA Two Factor Models Two Factor Models. 2 Factor Experiments Two factors can either independently or together interact to affect the average response.

Advertisements

Topic 12: Multiple Linear Regression

Copyright 2004 David J. Lilja1 Comparing Two Alternatives Use confidence intervals for Before-and-after comparisons Noncorresponding measurements.

 Population multiple regression model  Data for multiple regression  Multiple linear regression model  Confidence intervals and significance tests.

Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.

ANOVA: Analysis of Variation

Chapter 11 Analysis of Variance

k r Factorial Designs with Replications r replications of 2 k Experiments –2 k r observations. –Allows estimation of experimental errors Model:

1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.

Go to Table of ContentTable of Content Analysis of Variance: Randomized Blocks Farrokh Alemi Ph.D. Kashif Haqqi M.D.

Chap 10-1 Analysis of Variance. Chap 10-2 Overview Analysis of Variance (ANOVA) F-test Tukey- Kramer test One-Way ANOVA Two-Way ANOVA Interaction Effects.

For Discussion Today (when the alarm goes off) Survey your proceedings for just one paper in which factorial design has been used or, if none, one in which.

One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.

QNT 531 Advanced Problems in Statistics and Research Methods

CPE 619 Simple Linear Regression Models Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama.

Simple Linear Regression Models

© 1998, Geoff Kuenning Linear Regression Models What is a (good) model? Estimating model parameters Allocating variation Confidence intervals for regressions.

© 1998, Geoff Kuenning General 2 k Factorial Designs Used to explain the effects of k factors, each with two alternatives or levels 2 2 factorial designs.

1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.

1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.

1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.

CHAPTER 14 MULTIPLE REGRESSION

Lecture 8 Page 1 CS 239, Spring 2007 Experiment Design CS 239 Experimental Methodologies for System Software Peter Reiher May 1, 2007.

CHAPTER 12 Analysis of Variance Tests

Chapter 10 Analysis of Variance.

Lecture 10 Page 1 CS 239, Spring 2007 Experiment Designs for Categorical Parameters CS 239 Experimental Methodologies for System Software Peter Reiher.

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.

Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.

One-Way Analysis of Variance

Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.

Lecture 9-1 Analysis of Variance

STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.

CPE 619 Two-Factor Full Factorial Design With Replications Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The.

CPE 619 One Factor Experiments Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama in.

Experiment Design Overview Number of factors 1 2 k levels 2:min/max n - cat num regression models2k2k repl interactions & errors 2 k-p weak interactions.

Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.

Chapter 11 Analysis of Variance. 11.1: The Completely Randomized Design: One-Way Analysis of Variance vocabulary –completely randomized –groups –factors.

Chapter 4 Analysis of Variance

1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 

Linear Regression Models Andy Wang CIS Computer Systems Performance Analysis.

Topic 27: Strategies of Analysis. Outline Strategy for analysis of two-way studies –Interaction is not significant –Interaction is significant What if.

Multiple Linear Regression

CHAPTER 3 Analysis of Variance (ANOVA) PART 3 = TWO-WAY ANOVA WITH REPLICATION (FACTORIAL EXPERIMENT) MADAM SITI AISYAH ZAKARIA EQT 271 SEM /2015.

McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Experimental Design and Analysis of Variance Chapter 11.

F73DA2 INTRODUCTORY DATA ANALYSIS ANALYSIS OF VARIANCE.

DSCI 346 Yamasaki Lecture 4 ANalysis Of Variance.

CHAPTER 3 Analysis of Variance (ANOVA) PART 3 = TWO-WAY ANOVA WITH REPLICATION (FACTORIAL EXPERIMENT)

ANOVA: Analysis of Variation

Chapter 11 Analysis of Variance

ANOVA: Analysis of Variation

ANOVA: Analysis of Variation

MADAM SITI AISYAH ZAKARIA

ANOVA: Analysis of Variation

Statistics for Managers Using Microsoft Excel 3rd Edition

Two-Way Analysis of Variance Chapter 11.

Factorial Experiments

CHAPTER 4 Analysis of Variance (ANOVA)

Chapter 10: Analysis of Variance: Comparing More Than Two Means

ANalysis Of VAriance (ANOVA)

Linear Regression Models

CHAPTER 29: Multiple Regression*

Chapter 12 Inference on the Least-squares Regression Line; ANOVA

Chapter 11 Analysis of Variance

CHAPTER 14 MULTIPLE REGRESSION

Replicated Binary Designs

Reasoning in Psychology Using Statistics

Reasoning in Psychology Using Statistics

One-Factor Experiments

For Discussion Today Survey your proceedings for just one paper in which factorial design has been used or, if none, one in which it could have been used.

F test for Lack of Fit The lack of fit test..

Presentation transcript:

Two-Factor Full Factorial Designs Andy Wang CIS 5930-03 Computer Systems Performance Analysis

Two-Factor Design Without Replications Used when only two parameters, but multiple levels for each Test all combinations of levels of the two parameters One replication per combination For factors A and B with a and b levels, ab experiments required

When to Use This Design? System has two important factors Factors are categorical More than two levels for at least one factor Examples: Performance of different CPUs under different workloads Characteristics of different compilers for different benchmarks Effects of different reconciliation topologies and workloads on a replicated file system

When to Avoid This Design? Systems with more than two important factors Use general factorial design Non-categorical variables Use regression Only two levels Use 22 designs

Model For This Design yij is observation m is mean response j is effect of factor A at level j i is effect of factor B at level i eij is error term Sums of j’s and i’s are both zero

Assumptions of the Model Factors are additive Errors are additive Typical assumptions about errors: Distributed independently of factor levels Normally distributed Remember to check these assumptions!

Computing Effects Need to figure out , j, and i Arrange observations in two-dimensional matrix b rows, a columns Compute effects such that error has zero mean Sum of error terms across all rows and columns is zero

Two-Factor Full Factorial Example Want to expand functionality of a file system to allow automatic compression Examine three choices: Library substitution of file system calls New VFS Stackable layers Three different benchmarks Metric: response time

Data for Example Library VFS Layers Compilation Benchmark 94.3 89.5 96.2 Email Benchmark 224.9 231.8 247.2 Web Server Benchmark 733.5 702.1 797.4

Computing  Averaging the jth column, By assumption, error terms add to zero Also, the j’s add to zero, so Averaging rows produces Averaging everything produces

Parameters Are:

Calculating Parameters for the Example  = grand mean = 357.4 j = (-6.5, -16.3, 22.8) i = (-264.1, -122.8, 386.9) So, for example, the model predicts that the email benchmark using a special-purpose VFS will take 357.4 - 16.3 -122.8 = 218.3 seconds

Estimating Experimental Errors Similar to estimation of errors in previous designs Take difference between model’s predictions and observations Calculate Sum of Squared Errors Then allocate variation

Allocating Variation Use same kind of procedure as on other models SSY = SS0 + SSA + SSB + SSE SST = SSY - SS0 Can then divide total variation between SSA, SSB, and SSE

Calculating SS0, SSA, SSB Recall that a and b are number of levels for the factors

Allocation of Variation for Example SSE = 2512 SSY = 1,858,390 SS0 = 1,149,827 SSA = 2489 SSB = 703,561 SST = 708,562 Percent variation due to A: 0.35% Percent variation due to B: 99.3% Percent variation due to errors: 0.35%

Analysis of Variation Again, similar to previous models, with slight modifications As before, use an ANOVA procedure Need extra row for second factor Minor changes in degrees of freedom End steps are the same Compare F-computed to F-table Compare for each factor

Analysis of Variation for Our Example MSE = SSE/[(a-1)(b-1)] = 2512/[(2)(2)] = 628 MSA = SSA/(a-1) = 2489/2 = 1244 MSB = SSB/(b-1) = 703,561/2 = 351,780 F-computed for A = MSA/MSE = 1.98 F-computed for B = MSB/MSE = 560 95% F-table value for A & B is 6.94 So A is not significant, but B is

Checking Our Results with Visual Tests As always, check if assumptions made in the analysis are correct Use residuals vs. predicted and quantile-quantile plots

Residuals Vs. Predicted Response for Example

What Does the Chart Reveal? Do we or don’t we see a trend in errors? Clearly they’re higher at highest level of the predictors But is that alone enough to call a trend? Perhaps not, but we should take a close look at both factors to see if there’s reason to look further Maybe take results with a grain of salt

Quantile-Quantile Plot for Example

Confidence Intervals for Effects Need to determine standard deviation for data as a whole Then can derive standard deviations for effects Use different degrees of freedom for each Complete table in Jain, pg. 351

Standard Deviations for Example se = sqrt(MSE) = 25.0 Standard deviation of : Standard deviation of j - Standard deviation of i -

Calculating Confidence Intervals for Example Only file system alternatives shown here Use 95% level, 4 degrees of freedom CI for library solution: (-39,26) CI for VFS solution: (-49,16) CI for layered solution: (-10,55) So none of the solutions are significantly different than mean at 95%

Looking a Little Closer Do zero CI’s mean that none of the alternatives for adding functionality are different? Use contrasts to check (see section 20.7) T-test should work as well

White Slide