A Really Bad Graph. For Discussion Today Project Proposal 1.Statement of hypothesis 2.Workload decisions 3.Metrics to be used 4.Method.

Slides:



Advertisements
Similar presentations
Analysis of Variance (ANOVA)
Advertisements

Factorial ANOVA More than one categorical explanatory variable.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
2  How to compare the difference on >2 groups on one or more variables  If it is only one variable, we could compare three groups with multiple ttests:
Experimental Design, Response Surface Analysis, and Optimization
Friday, May 14, 2004 ISYS3015 Analytical Methods for IS Professionals School of IT, The University of Sydney 1 Factorial Designs Week 9 Lecture 2.
Using process knowledge to identify uncontrolled variables and control variables as inputs for Process Improvement 1.
Statistics for Managers Using Microsoft® Excel 5th Edition
Statistics CSE 807.
Chapter 11 Analysis of Variance
L Berkley Davis Copyright 2009 MER301: Engineering Reliability Lecture 14 1 MER301: Engineering Reliability LECTURE 14: Chapter 7: Design of Engineering.
Statistics for Managers Using Microsoft® Excel 5th Edition
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 Introduction to Experiment Design Shiv Kalyanaraman Rensselaer Polytechnic Institute
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
k r Factorial Designs with Replications r replications of 2 k Experiments –2 k r observations. –Allows estimation of experimental errors Model:
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 Introduction to Experiment Design Shiv Kalyanaraman Rensselaer Polytechnic Institute
Inferences About Process Quality
Go to Table of ContentTable of Content Analysis of Variance: Randomized Blocks Farrokh Alemi Ph.D. Kashif Haqqi M.D.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
Chap 10-1 Analysis of Variance. Chap 10-2 Overview Analysis of Variance (ANOVA) F-test Tukey- Kramer test One-Way ANOVA Two-Way ANOVA Interaction Effects.
1 Advances in Statistics Or, what you might find if you picked up a current issue of a Biological Journal.
Statistics Design of Experiment.
For Discussion Today (when the alarm goes off) Survey your proceedings for just one paper in which factorial design has been used or, if none, one in which.
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
QNT 531 Advanced Problems in Statistics and Research Methods
CPE 619 2k-p Factorial Design
Ratio Games and Designing Experiments Andy Wang CIS Computer Systems Performance Analysis.
© 1998, Geoff Kuenning General 2 k Factorial Designs Used to explain the effects of k factors, each with two alternatives or levels 2 2 factorial designs.
Lecture 8 Page 1 CS 239, Spring 2007 Experiment Design CS 239 Experimental Methodologies for System Software Peter Reiher May 1, 2007.
Introduction to Experimental Design
CHAPTER 12 Analysis of Variance Tests
Lecture 10 Page 1 CS 239, Spring 2007 Experiment Designs for Categorical Parameters CS 239 Experimental Methodologies for System Software Peter Reiher.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
ANALYSIS OF VARIANCE (ANOVA) BCT 2053 CHAPTER 5. CONTENT 5.1 Introduction to ANOVA 5.2 One-Way ANOVA 5.3 Two-Way ANOVA.
MSE-415: B. Hawrylo Chapter 13 – Robust Design What is robust design/process/product?: A robust product (process) is one that performs as intended even.
Chapter 9 Introduction to the t Statistic. 9.1 Review Hypothesis Testing with z-Scores Sample mean (M) estimates (& approximates) population mean (μ)
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
CPE 619 Experimental Design Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama in Huntsville.
Question paper 1997.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Experiment Design Overview Number of factors 1 2 k levels 2:min/max n - cat num regression models2k2k repl interactions & errors 2 k-p weak interactions.
Chapter 11 Analysis of Variance. 11.1: The Completely Randomized Design: One-Way Analysis of Variance vocabulary –completely randomized –groups –factors.
1 Simulation Scenarios. 2 Computer Based Experiments Systematically planning and conducting scientific studies that change experimental variables together.
Designs for Experiments with More Than One Factor When the experimenter is interested in the effect of multiple factors on a response a factorial design.
CHAPTER 3 Analysis of Variance (ANOVA) PART 3 = TWO-WAY ANOVA WITH REPLICATION (FACTORIAL EXPERIMENT) MADAM SITI AISYAH ZAKARIA EQT 271 SEM /2015.
© 1998, Geoff Kuenning Comparison Methodology Meaning of a sample Confidence intervals Making decisions and comparing alternatives Special considerations.
ChE 551 Lecture 04 Statistical Tests Of Rate Equations 1.
Engineering Statistics Design of Engineering Experiments.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
CHAPTER 3 Analysis of Variance (ANOVA) PART 3 = TWO-WAY ANOVA WITH REPLICATION (FACTORIAL EXPERIMENT)
Stats Methods at IC Lecture 3: Regression.
15 Inferential Statistics.
Analysis of Variance l Chapter 8 l 8.1 One way ANOVA
Common Mistakes in Performance Evaluation The Art of Computer Systems Performance Analysis By Raj Jain Adel Nadjaran Toosi.
Two-Way Analysis of Variance Chapter 11.
DESIGN OF EXPERIMENTS.
Ratio Games and Designing Experiments
i) Two way ANOVA without replication
Two-Factor Full Factorial Designs
IE-432 Design Of Industrial Experiments
Correlation and Regression
Chapter 11 Analysis of Variance
Discrete Event Simulation - 4
Replicated Binary Designs
One-Factor Experiments
DESIGN OF EXPERIMENTS by R. C. Baker
CS533 Modeling and Performance Evalua ion of Network and Computer Systems Experimental Design (Chapters 16-17)
Presentation transcript:

A Really Bad Graph

For Discussion Today Project Proposal 1.Statement of hypothesis 2.Workload decisions 3.Metrics to be used 4.Method

© 1998, Geoff Kuenning Designing Experiments Introduction 2 k factorial designs 2 k r factorial designs 2 k-p fractional factorial designs One-factor experiments Two-factor full factorial design without replications Two-factor full factorial design with replications General full factorial designs with k factors © 1998, Geoff Kuenning

Introduction To Experiment Design You know your metrics You know your factors You know your levels You’ve got your instrumentation and test loads Now what? © 1998, Geoff Kuenning

Goals in Experiment Design Obtain maximum information With minimum work –Typically meaning minimum number of experiments More experiments aren’t better if you’re the one who has to perform them Well-designed experiments are also easier to analyze © 1998, Geoff Kuenning

Experimental Replications The system under study will be run with varying levels of different factors, potentially with differing workloads A run with a particular set of levels and other inputs is a replication Often, you need to do multiple replications with a single set of levels and other inputs –For statistical validation © 1998, Geoff Kuenning

Interacting Factors Some factors have effects completely independent of each other –Double the factor’s level, halve the response, regardless of other factors But the effects of some factors depends on the values of other factors –Interacting factors Presence of interacting factors complicates experimental design © 1998, Geoff Kuenning

Basic Problem in Designing Experiments You have chosen some number of factors They may or may not interact How can you design an experiment that captures the full range of the levels? –With minimum amount of work Which combination or combinations of the levels of the factors do you measure? © 1998, Geoff Kuenning

Common Mistakes in Experimentation Ignoring experimental error Uncontrolled parameters Not isolating effects of different factors One-factor-at-a-time experiment designs Interactions ignored Designs require too many experiments © 1998, Geoff Kuenning

Types of Experimental Designs Simple designs Full factorial design Fractional factorial design

Experimental Design ( l 1,0, l 1,1, …, l 1,n1-1 ) x ( l 2,0, l 2,1, …, l 2,n2-1 ) x … x ( l k,0, l k,1, …, l k,nk-1 ) k different factors, each factor with n i levels r replications Factor 1 Factor k Factor 2

© 1998, Geoff Kuenning Simple Designs Vary one factor at a time For k factors with i th factor having n i levels - Assumes factors don’t interact Usually more effort than required Don’t use it, usually

Simple Designs ( l 1,0, l 1,1, …, l 1,n-1 ) x ( l 2,0, l 2,1, …, l 2,n-1 ) x … x ( l k,0, l k,1, …, l k,n-1 ) Factor 1 Factor k Factor 2 fix vary

Simple Designs ( l 1,0, l 1,1, …, l 1,n-1 ) x ( l 2,0, l 2,1, …, l 2,n-1 ) x … x ( l k,0, l k,1, …, l k,n-1 ) Factor 1 Factor k Factor 2

© 1998, Geoff Kuenning Full Factorial Designs For k factors with i th factor having n i levels - Test every possible combination of factors’ levels Captures full information about interaction A hell of a lot of work, though

Full Factorial Designs ( l 1,0, l 1,1, …, l 1,n-1 ) x ( l 2,0, l 2,1, …, l 2,n-1 ) x … x ( l k,0, l k,1, …, l k,n-1 ) Factor 1 Factor k Factor 2

© 1998, Geoff Kuenning Reducing the Work in Full Factorial Designs Reduce number of levels per factor –Generally a good choice –Especially if you know which factors are most important - use more levels for them Reduce the number of factors –But don’t drop important ones Use fractional factorial designs

© 1998, Geoff Kuenning Fractional Factorial Designs Only measure some combination of the levels of the factors Must design carefully to best capture any possible interactions Less work, but more chance of inaccuracy Especially useful if some factors are known not to interact

( l 1,0, l 1,1, …, l 1,n-1 ) x ( l 2,0, l 2,1, …, l 2,n-1 ) x … x ( l k,0, l k,1, …, l k,n-1 ) Fractional Factorial Designs Factor 1 Factor k Factor 2

© 1998, Geoff Kuenning 2 k Factorial Designs Used to determine the effect of k factors –Each with two alternatives or levels Often used as a preliminary to a larger performance study –Each factor measured at its maximum and minimum level –Perhaps offering insight on importance and interaction of various factors

© 1998, Geoff Kuenning Unidirectional Effects Effects that only increase as the level of a factor increases –Or visa versa If this characteristic is known to apply, a 2 k factorial design at minimum and maximum levels is useful Shows whether the factor has a significant effect

© 1998, Geoff Kuenning 2 2 Factorial Designs Two factors with two levels each Simplest kind of factorial experiment design Concepts developed here generalize A form of regression can be easily used here Simplest to show with an example

© 1998, Geoff Kuenning 2 2 Factorial Design Example The Time Warp Operating System Designed to run discrete event simulations in parallel Using an optimistic method Goal is fastest possible completion of a given simulation Usually quality is expressed in terms of speedup Here, the simpler metric of runtime is used

© 1998, Geoff Kuenning Factors and Levels for Time Warp Example First factor - number of nodes used to run the simulation –Vary between 8 and 64 Second factor - whether or not dynamic load management is used –To migrate work from node to node as load in the simulation changes Other factors exists, but ignore them for now

© 1998, Geoff Kuenning Defining Variables for the 2 2 Factorial TW Example if 8 nodes if 64 nodes if no dynamic load management if dynamic load management used

© 1998, Geoff Kuenning Sample Data For Example Single runs of one benchmark simulation DLM (+1) NO DLM (-1) 8 Nodes (-1)64 Nodes (+1)

© 1998, Geoff Kuenning Regression Model for Example y = q 0 + q A x A + q B x B + q AB x A x B Note this is a nonlinear model 820 = q 0 -q A - q B + q AB 217 = q 0 +q A - q B - q AB 776 = q 0 -q A + q B - q AB 197 = q 0 +q A + q B + q AB

© 1998, Geoff Kuenning Regression Model, Con’t 4 equations in 4 unknowns Another way to look at it shown in this table - ExperimentABy 1-1-1y y y y 4

© 1998, Geoff Kuenning Solving the Equations q 0 = 1/4( ) = q A = 1/4( ) = q B = 1/4( ) = -16 q AB = 1/4( ) = 6 So, y = x A - 16x B + 6x A x B

© 1998, Geoff Kuenning The Sign Table Method Another way of looking at the problem in a tabular form IABAB y Total Total/4

© 1998, Geoff Kuenning Allocation of Variation for 2 2 Model Calculate the sample variance of y Numerator is the SST - total variation SST = 2 2 q A q B q AB 2 We can use this to explain what causes the variation in y

© 1998, Geoff Kuenning Terms in the SST 2 2 q A 2 is part of variation explained by the effect of A - SSA 2 2 q B 2 is part of variation explained by the effect of B - SSB 2 2 qA B 2 is part of variation explained by the effect of the interaction of A and B - SSAB SST = SSA + SSB + SSAB

© 1998, Geoff Kuenning Variations in Our Example SST = SSA = SSB = 1024 SSAB = 144 We can now calculate the fraction of the total variation caused by each effect

© 1998, Geoff Kuenning Fractions of Variation in Our Example Fraction explained by A is 99.67% Fraction explained by B is 0.29% Fraction explained by the interaction of A and B is 0.04% So almost all the variation comes from the number of nodes So if you want to run faster, apply more nodes, don’t turn on dynamic load management

© 1998, Geoff Kuenning General 2 k Factorial Designs Used to explain the effects of k factors, each with two alternatives or levels 2 2 factorial designs are a special case Methods developed there extend to the more general case But many more possible interactions between pairs (and trios, etc.) of factors

For Discussion Tuesday March 25 Survey your proceedings for just one paper in which factorial design has been used or, if none, one in which it could have been used effectively. © 2003, Carla Ellis