Main Effects Screening: A Distributed Continuous Quality Assurance Process for Monitoring Performance Degradation in Evolving Software Systems Cemal Yilmaz,

Slides:



Advertisements
Similar presentations
AI Pathfinding Representing the Search Space
Advertisements

G. Alonso, D. Kossmann Systems Group
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Best-First Search: Agendas
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
1 Experimental Methodology H Experimental methods can be used to: – demonstrate that a new concept, technique, or algorithm is feasible –demonstrate that.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 11: Monitoring Server Performance.
Chapter 8 Two-Level Fractional Factorial Designs
CHAPTER 7: SAMPLING DISTRIBUTIONS. 2 POPULATION AND SAMPLING DISTRIBUTIONS Population Distribution Sampling Distribution.
Chapter 7 Blocking and Confounding in the 2k Factorial Design
Chapter 28 Design of Experiments (DOE). Objectives Define basic design of experiments (DOE) terminology. Apply DOE principles. Plan, organize, and evaluate.
Fractional factorial Chapter 8 Hand outs. Initial Problem analysis Eyeball, statistics, few graphs Note what problems are and what direction would be.
Introduction : ‘Skoll: Distributed Continuous Quality Assurance’ Morimichi Nishigaki.
Thick v Thin Access Points Lab Last Update Copyright 2014 Kenneth M. Chipps Ph.D.
Hands-On Microsoft Windows Server 2008 Chapter 11 Server and Network Monitoring.
Selecting and Implementing An Embedded Database System Presented by Jeff Webb March 2005 Article written by Michael Olson IEEE Software, 2000.
Factorial Experiments
9.6 Counting Principles Permutations Combinations.
Fractional Factorial Experiments (Continued) The concept of design resolution is a useful way to categorize fractional factorial designs. The higher the.
Chapter 8Design and Analysis of Experiments 8E 2012 Montgomery 1 Design of Engineering Experiments The 2 k-p Fractional Factorial Design Text reference,
Chapter 3: Screening Designs
DOX 6E Montgomery1 Design of Engineering Experiments Part 7 – The 2 k-p Fractional Factorial Design Text reference, Chapter 8 Motivation for fractional.
Computer Science Open Research Questions Adversary models –Define/Formalize adversary models Need to incorporate characteristics of new technologies and.
Dr. Gary Blau, Sean HanMonday, Aug 13, 2007 Statistical Design of Experiments SECTION V SCREENING.
1 Confounding In an unreplicated 2 K there are 2 K treatment combinations. Consider 3 factors at 2 levels each: 8 t.c’s If each requires 2 hours to run,
Statistical Analysis Professor Lynne Stokes Department of Statistical Science Lecture 14 Sequential Experimentation, Screening Designs, Fold-Over Designs.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
Dr. Gary Blau, Sean HanMonday, Aug 13, 2007 Statistical Design of Experiments SECTION IV FACTORIAL EXPERIMENTATION.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
AN EXTENDED OPENMP TARGETING ON THE HYBRID ARCHITECTURE OF SMP-CLUSTER Author : Y. Zhao 、 C. Hu 、 S. Wang 、 S. Zhang Source : Proceedings of the 2nd IASTED.
The Essentials of 2-Level Design of Experiments Part II: The Essentials of Fractional Factorial Designs The Essentials of 2-Level Design of Experiments.
1 Chapter 3: Screening Designs 3.1 Fractional Factorial Designs 3.2 Blocking with Screening Designs.
Fractional Factorial Design Full Factorial Disadvantages Full Factorial Disadvantages –Costly (Degrees of freedom wasted on estimating higher order terms)
TRIANGLES, PROBABILITY, AND AMAZEMENT A CONNECTED EXPERIENCE FOR THE CLASSROOM JIM RAHN
Statistical Analysis Professor Lynne Stokes
Geo597 Geostatistics Ch9 Random Function Models.
Sums of Squares. Sums of squares Besides the unweighted means solution, sums of squares can be calculated in various ways depending on the situation and.
Lecture 9 Page 1 CS 239, Spring 2007 More Experiment Design CS 239 Experimental Methodologies for System Software Peter Reiher May 8, 2007.
SOFTWARE DESIGN AND ARCHITECTURE LECTURE 05. Review Software design methods Design Paradigms Typical Design Trade-offs.
IE341 Midterm. 1. The effects of a 2 x 2 fixed effects factorial design are: A effect = 20 B effect = 10 AB effect = 16 = 35 (a) Write the fitted regression.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 11: Monitoring Server Performance.
1 Resolution III Designs Designs with main effects aliased with two- factor interactions Used for screening (5 – 7 variables in 8 runs, variables.
1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.
Iperf Quick Mode Ajay Tirumala & Les Cottrell. Sep 12, 2002 Iperf Quick Mode at LBL – Les Cottrell & Ajay Tirumala Iperf QUICK Mode Problem – Current.
1 The 2 k-p Fractional Factorial Design Text reference, Chapter 8 Motivation for fractional factorials is obvious; as the number of factors becomes large.
The American University in Cairo Interdisciplinary Engineering Program ENGR 592: Probability & Statistics 2 k Factorial & Central Composite Designs Presented.
II.3 Screening Designs in Eight Runs: Other Screening Designs in 8 runs  In addition to 5 factors in 8 runs, Resolution III designs can be used to study.
CISC Machine Learning for Solving Systems Problems Presented by: Suman Chander B Dept of Computer & Information Sciences University of Delaware Automatic.
Fractional Factorial Designs Andy Wang CIS 5930 Computer Systems Performance Analysis.
Experimentation in Computer Science (Part 2). Experimentation in Software Engineering --- Outline  Empirical Strategies  Measurement  Experiment Process.
1 Modeling change Kristin Sainani Ph.D. Stanford University Department of Health Research and Policy
The SALT Development Committee Recommendations Regarding New Instruments (Matt Bershady) Rob Fesen Janusz Kaluzny Wolfram Kollatschney Darragh O’Donoghue.
L. M. LyeDOE Course1 Design and Analysis of Multi-Factored Experiments Fractional Factorials Not Based on the Powers of 2 – Irregular Designs.
1 Statistics 262: Intermediate Biostatistics Mixed models; Modeling change.
ETM U k factorials Recall our example from last time … Estimate the effects Determine significant effects Develop regression model Examine.
Designs for Experiments with More Than One Factor When the experimenter is interested in the effect of multiple factors on a response a factorial design.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
LECTURE 11: LINEAR MODEL SELECTION PT. 1 March SDS 293 Machine Learning.
1 Chapter 8 Two-level Fractional Factorial Designs.
Algebra. Factorials are noted by the expression n! and reads “n factorial”
Emdeon Office Batch Management Services This document provides detailed information on Batch Import Services and other Batch features.
Tom Van Steenkiste Supervisor: Predrag Buncic

IET 603 Minitab Chapter 12 Christopher Smith
Tools for Performance, Load Testing, Stress Testing Using Telerik Test Studio Pavel Pankov QA Lead Automated Testing Team Telerik QA Academy.
Rule Induction for Classification Using
Simulation based verification: coverage
Fractional Factorial Design
Design matrix Run A B C D E
Presentation transcript:

Main Effects Screening: A Distributed Continuous Quality Assurance Process for Monitoring Performance Degradation in Evolving Software Systems Cemal Yilmaz, Arvind S. Krishnaz, Atif Memon, Adam Porter, Douglas C. Schmidt, Aniruddha Gokhale, Balachandran Natarajan Presented By: Walaa El-Din M. Moustafa

Objective  As software systems change, developers often run regression tests to detect unintended functional side effects.  QA efforts can be confounded by the enormous configuration space.  Time and resource constraints severely limit the number of configurations that can be examined.

Main Effects Screening  Screening Designs are Highly economical Can reveal important low order effects that strongly affect performance.

Low Order Effects  First-, second-, or third-order effects.  n th -order effect is an effect caused by the simultaneous interaction of n factors.

Low Order Effects Example  For certain web server applications: A 1 st -order effect might be that performance slows considerably when logging is turned on and another might be that it also slows when few server threads are used. A 2 nd order effect involves the interaction of two options, e.g., web server performance may slow down when caching is turned off and the server performs blocking reads.

Factorial Designs  Full factorial design involving k binary factors. Such a design exhaustively tests all combinations.  Fractional factorial designs. These designs use only a carefully selected fraction (such as 1/2 or 1/4) of a full factorial design.

Fractional Factorial Designs  Save money  Do so by giving up the ability to measure some higher-order effects.

Fractional Factorial Design Example  To show how screening designs are computed, we present a hypothetical example of a performance-intensive software system: 4 binary configuration options, A through D. No inter-option constraints. The full configuration space therefore has 2 4 = 16 configurations.

Fractional Factorial Design Example  8 configurations.  Will focus only on 1st-order effects.  This design is referred to as a design, where 4 refers to the total number of options we will examine and the -1 (2 -1 = 1/2) indicates the fraction of the full factorial over which we will collect data.

Fractional Factorial Design Example  We can extend the design and estimate the effect of option D without going to a 2 4 full factorial design.

Fractional Factorial Design Example  Design Generator: Specifies the aliasing patterns used to build the design For this example, we select the design generator D = ABC.

Fractional Factorial Design Example

 The design we described above is a resolution IV design.  In resolution R designs, no effects involving i factors are aliased with effects involving less than R - i factors.

Fractional Factorial Design Example  Run the experiment  For binary options (with settings - or +), the main effect of option A, ME(A), is ME(A) = z(A-) - z(A+) where z(A-) and z(A+) are the mean values of the observed data over all runs where option A is (-) and where option A is (+), respectively.  If desired, 2 nd order effects can be calculated in a similar way. The interaction effect of option A and B, INT(A, B) is: = ½ (ME(B|A+) - ME(B|A-)) = ½ (ME(A|B+) - ME(A|B-))

The Experiment  ACE v5.4 + TAO v1.4 + CIAO v0.4.  14 binary run-time options 2 14 = 16,384 different configurations.  For each task measure: Message latency. Throughput between the client and the server.

The Experiment

The Full Data Set

Evaluating Screening Designs  Can screening designs correctly identify the important options discovered during the analysis of the full data set?

Evaluating Screening Designs  Scr32 is a IV with design generators F = ABC, G = ABD, H = ACD, I = BCD, J = ABE, K = ACE, L = BCE, M = ADE, N = BDE.  Scr64 is a IV with design generators G = ABC, H = ABD, I = ABE, J = ACDE, K = ABF, L = ACDF, M = ACEF, N = ADEF.  Scr128 is a IV with design generators H = ABC, I = ABDE, J = ABDF, K = ACEF, L = ACDG, M = ABEFG, N = BCDEFG.

Evaluating Screening Designs

 Screening designs can detect important options at a large fraction of the cost of exhaustive testing.  The smaller the effect, the larger the run size needed to identify it.

Estimating Performance with Screening Suites  The estimates are generated by examining all combinations of the most important options, while defaulting the settings of the unimportant options  Make the estimates based on benchmarking either 4 (all combinations of options B and J) or 32 (all combinations of options B, J, C, I, and F) configurations.

Estimating Performance with Screening Suites

 The distributions of the top-5 and top-2 screening suites closely track the overall performance data.

Screening Suites vs. Random Sampling

 These graphs suggest the obvious weakness of random sampling.

Dealing with Evolving Systems  Detect performance degradations in evolving systems quickly.

Dealing with Evolving Systems  Developer records and problem reports indicate that problems were noticed on 12/14/03.

Higher­Order Effects  Resolution VI design  Increased the run size to 2,048 to capture the second-order effects.

Higher­Order Effects

 The important interaction effects involve only options that are already considered important by themselves  The screening design correctly identifies the 5 most important pairwise interactions at 1/8 th the cost of exhaustive testing.

Thank you