Rescaling Reliability Bounds for a New Operational Profile Peter G Bishop Adelard, Drysdale Building, Northampton Square,

Slides:



Advertisements
Similar presentations
3.2 OLS Fitted Values and Residuals -after obtaining OLS estimates, we can then obtain fitted or predicted values for y: -given our actual and predicted.
Advertisements

Integration of sensory modalities
1 Finite-Length Scaling and Error Floors Abdelaziz Amraoui Andrea Montanari Ruediger Urbanke Tom Richardson.
CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.
A Randomized Linear-Time Algorithm to Find Minimum Spanning Trees David R. Karger David R. Karger Philip N. Klein Philip N. Klein Robert E. Tarjan.
Boundary Value Testing A type of “Black box” functional testing –The program is viewed as a mathematical “function” –The program takes inputs and maps.
Topic 6: Introduction to Hypothesis Testing
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
Simulation Where real stuff starts. ToC 1.What, transience, stationarity 2.How, discrete event, recurrence 3.Accuracy of output 4.Monte Carlo 5.Random.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Chapter 7 Sampling and Sampling Distributions
Point and Confidence Interval Estimation of a Population Proportion, p
An Experimental Evaluation on Reliability Features of N-Version Programming Xia Cai, Michael R. Lyu and Mladen A. Vouk ISSRE’2005.
Efficient Estimation of Emission Probabilities in profile HMM By Virpi Ahola et al Reviewed By Alok Datar.
Michael Ernst, page 1 Improving Test Suites via Operational Abstraction Michael Ernst MIT Lab for Computer Science Joint.
1 EE 616 Computer Aided Analysis of Electronic Networks Lecture 9 Instructor: Dr. J. A. Starzyk, Professor School of EECS Ohio University Athens, OH,
Discrete Probability Distributions
Part III: Inference Topic 6 Sampling and Sampling Distributions
DAST 2005 Week 4 – Some Helpful Material Randomized Quick Sort & Lower bound & General remarks…
Multiple Regression and Correlation Analysis
Lecture Slides Elementary Statistics Twelfth Edition
Lecture 19 Simple linear regression (Review, 18.5, 18.8)
Use of Quantile Functions in Data Analysis. In general, Quantile Functions (sometimes referred to as Inverse Density Functions or Percent Point Functions)
Regression and Correlation Methods Judy Zhong Ph.D.
ECE355 Fall 2004Software Reliability1 ECE-355 Tutorial Jie Lian.
Models for Software Reliability N. El Kadri SEG3202.
Software Reliability Model Deterministic Models Probabilistic Models Halstead’s software metric McCabe’s cyclomatic complexity metrix Error seeding Failure.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 12-1 Chapter 12 Simple Linear Regression Statistics for Managers Using.
Path selection criteria Tor Stålhane & ‘Wande Daramola.
by B. Zadrozny and C. Elkan
Hydrologic Modeling: Verification, Validation, Calibration, and Sensitivity Analysis Fritz R. Fiedler, P.E., Ph.D.
Software Reliability SEG3202 N. El Kadri.
Modeling and Simulation CS 313
What is Software Testing? And Why is it So Hard J. Whittaker paper (IEEE Software – Jan/Feb 2000) Summarized by F. Tsui.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Random Sampling, Point Estimation and Maximum Likelihood.
Name: Angelica F. White WEMBA10. Teach students how to make sound decisions and recommendations that are based on reliable quantitative information During.
Basic Probability (Chapter 2, W.J.Decoursey, 2003) Objectives: -Define probability and its relationship to relative frequency of an event. -Learn the basic.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
Statistical Applications Binominal and Poisson’s Probability distributions E ( x ) =  =  xf ( x )
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Example: Bioassay experiment Problem statement –Observations: At each level of dose, 5 animals are tested, and number of death are observed.
Software Reliabilty1 Software Reliability Advanced Software Engineering COM360 University of Sunderland © 1998.
ME Mechanical and Thermal Systems Lab Fall 2011 Chapter 3: Assessing and Presenting Experimental Data Professor: Sam Kassegne, PhD, PE.
Lecture 10: Correlation and Regression Model.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
1 / 12 Michael Beer, Vladik Kreinovich COMPARING INTERVALS AND MOMENTS FOR THE QUANTIFICATION OF COARSE INFORMATION M. Beer University of Liverpool V.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
Path selection criteria Tor Stålhane & Wande Daramola.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
Chapter 15 Running Time Analysis. Topics Orders of Magnitude and Big-Oh Notation Running Time Analysis of Algorithms –Counting Statements –Evaluating.
Prediction of Interconnect Net-Degree Distribution Based on Rent’s Rule Tao Wan and Malgorzata Chrzanowska- Jeske Department of Electrical and Computer.
Statistical Modelling
Software Testing.
Data Mining: Concepts and Techniques
Discussions on Software Reliability
Input Space Partition Testing CS 4501 / 6501 Software Testing
12 Inferential Analysis.
Chapter 10 Verification and Validation of Simulation Models
Software Reliability Models.
Statistical Methods For Engineers
Software Testing (Lecture 11-a)
Reasoning in Psychology Using Statistics
Integration of sensory modalities
12 Inferential Analysis.
Simple Linear Regression
LECTURE 09: BAYESIAN LEARNING
Mathematical Foundations of BME Reza Shadmehr
Presentation transcript:

Rescaling Reliability Bounds for a New Operational Profile Peter G Bishop Adelard, Drysdale Building, Northampton Square, London EC1V 0HB

2 Outline of Talk l Original reliability bound theory (same op. profile) l Extended theory (different operational profile) l Implications of the theory l Experimental evaluation

3 Original Theory Operational profile (I) Defect Observed defect failure frequency    D 1 D 2 D 3 Input Domain

4 Theory assumptions the operational profile is invariant, i.e.  s are constant over time l when a failure occurs the associated defect is immediately and perfectly corrected removal of a defect does not affect the s of the remaining defects

5 Basic idea Given some test interval t : Defects with large s will be removed already Defects with small s will remain - but have little affect on program reliability So there must be an “worst case” for a defect that maximises the program failure rate after t

6 Worst-case bound Original paper showed that, given the assumptions, max failure /unit time for a defect i is:  i | t  1/et (where t is the test time) So if there are N faults in the program the failure rate at time t is bounded by:  | t  N/et

7 Bound is independent of t Probablity of failure  | t =0.1 =0.01 = /et

8 Refinement for discrete tests l For for a discrete sequence of T tests the result is:  |T  N (T/T+1) T /(T+1)  N/(eT) (conservative approx.) l So it is conservative to use original equation.

9 Limitations Assumes operational profile I is constant hence s are constant l But we know that in practice the profile changes. l So the reliability bound does not apply if the operational profile changes l (e.g. from system test to actual use) l but will “settle back” in long term if new profile stable l New theory gives a means for “rescaling” the reliability bound for a different profile

10 Additional assumptions l Each defect is localised to a single code “block” The operational profile I can be characterised by the distribution of code block executions Q in the program { q(1), q(2), … } The failure rate of defect in block,  i  q(i) l There is a constant probability of a fault existing in any line of executable code.

11 Rescaling for known defect l For a defect i in code block j, the re-scaled bound would be: where q’(j) is the new execution rate and q(j) is the old execution rate.

12 Probability of defect in block We do not know which block contains defect i, but we assume that the chance of being in j is: L(j)/L where L(j) is the length of the code block, and L is the total length of the executable code.

13 Re-scaled bound l Taking the average over all blocks: l So the “scale factor” relative to the original bound is: l Also true if there are N faults rather than 1   L jL jq ( jq)( )( )

14 Theory predictions - Fair testing l If q  L of blocks “dominated” by decision branch, scale factor unchanged by any other profile l Applies to any acyclic graph, l And subgraphs with fixed iteration loops Segment j L(j). q(j) q ’ (j) L(j)q ’ (j) q(j) Root Branch Branch Sum 110 S =Sum/L1

15 l Use of “unbalanced” test profile can be very sensitive to changes in profile l Factor can be less than 1 if under-tested blocks avoided, e.g. Q’={1,1,0} gives S = 0.19 Unfair testing Segment j L(j) q(j) Root Branch Branch Sum S =Sum/L7.5 q ’ (j) L q ’ /q

16 Limits to fair test approach l Fair test apportionment does not work for variable loops, recursion and subroutines l Even if we identify a fair test profile, it may be infeasible to execute Decisions not independent (shared variable)

17 Maximum scale factor l If we know max. possible execution rates for each block, can estimate a “maximum scale factor”:  ( q(k) max / q(k) ) (L(k) / L) l Where k relates to a worst case “thread” through the graph. Hard to identify this thread, but easier to compute a more pessimistic factor:  ( q(j) max / q(j) ) (L(j) / L) where j includes all blocks. l No knowledge of the new profile is needed

18 Including module tests l Can combine module tests and system tests, composite scale factor is: where x(j) are the total executions under module testing l Module tests can “fill in” uncovered segments that would make the test profile “unbalanced” Tjxjq jq L jL /)()( )()(   

19 Experimental evaluation l Use programs with known set of defects l PODS l simple reactor trip application (<1000 code lines) l simple structure, fixed loops l PREPRO l ~ code lines l parses input description file of indefinite length l recursive - max execution unknown l Similar results - will only discuss PODS here

20 PODS evaluation l Measure Q for different test profiles l Uniform, Normal, Inverse normal - “bathtub” Measure defect failure rates (i) under all profiles Predict residual failure rate:  (i) exp(- (i)T) Compute failure rate for new profile:  ’ (i) exp(- (i)T) Compare with scaled bound:  ( L(j)/L)(q ’ (j)/q(j))N/eT

21 Variation in q(j)

22 Predicted scale factors Operational profile Test profile uniforminv-normal normal uniform inv-normal normal l Note the predicted reduction in bound

23 Maximum scale factor Test profileMax scale-up factor uniform6.6 inv-normal10.0 normal1059 l 2-5 times worst than bound with a known profile l Can be over-pessimistic l But could indicate relative sensitivity to change

24 “Unfair” Normal test profile N/et bound Scaled bound Max bound

25 “Fairer” Uniform test profile N/et bound Scaled bound Max bound

26 PREPRO l Similar results l changes in failure rates are within the scaled bounds l But could not compute a maximum bound l program is recursive l so no upper bound on the execution of program code blocks

27 Summary Theory suggests: l Can rescale bound (knowing Q and Q’) l Can include module test execution information l Can compute max scale up (knowing Q and Q max ) l For some program structures can identify a totally "fair" test profile - bound insensitive to change The experimental evaluations appear to be consistent with the predictions of the theory

28 Conclusions l Could affect approach to testing: l “fairer” test profiles rather than realistic profiles l integrated module and system test strategy l Could improve reliability bound prediction for new environment l Could assess sensitivity to profile change l e.g. by computing maximum scale factor l But based on quite strong assumptions, need to: l validate assumptions l assess impact of assumption violation l evaluate on more examples

29