IV&V Facility 1 Software Reliability Corroboration Bojan Cukic, Erdogan Gunel, Harshinder Singh, Lan Guo West Virginia University Carol Smidts University.

Slides:

Advertisements

Similar presentations

Chapter 2 The Process of Experimentation

Advertisements

Chapter 4 Quality Assurance in Context

INFERENCE: SIGNIFICANCE TESTS ABOUT HYPOTHESES Chapter 9.

1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.

1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.

Software Quality Assurance (SQA). Recap SQA goal, attributes and metrics SQA plan Formal Technical Review (FTR) Statistical SQA – Six Sigma – Identifying.

SE 450 Software Processes & Product Metrics Reliability: An Introduction.

Elementary hypothesis testing

Swami NatarajanJune 17, 2015 RIT Software Engineering Reliability Engineering.

SE 450 Software Processes & Product Metrics Reliability Engineering.

Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.

SQM - 1DCS - ANULECTURE Software Quality Management Software Quality Management Processes V & V of Critical Software & Systems Ian Hirst.

Swami NatarajanJuly 14, 2015 RIT Software Engineering Reliability: Introduction.

West Virginia University A Bayesian Approach to Reliability Predication of Component Based Systems H. Singh, V. Cortellessa, B. Cukic, E. Gunel, V. Bharadwaj.

Software causes many failures - significant mission risk Hard to quantify effects on system risk of: software defects software development practices software.

Change Request Management

Capability Maturity Model

Chapter 22. Software Reliability Engineering (SRE)

Software Reliability Growth. Three Questions Frequently Asked Just Prior to Release 1.Is this version of software ready for release (however “ready” is.

©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 24 Slide 1 Critical Systems Validation 1.

Section 2: Science as a Process

Confidence Intervals and Hypothesis Testing - II

1 STATISTICAL HYPOTHESES AND THEIR VERIFICATION Kazimieras Pukėnas.

Achieving Better Reliability With Software Reliability Engineering Russel D’Souza Russel D’Souza.

SAS 03/ GSFC/SATC-ERAU-DoC Fault Tree Analysis Application for Safety and Reliability Massood Towhidnejad Embry-Riddle University Dolores Wallace & Al.

N By: Md Rezaul Huda Reza n

Independent Verification and Validation (IV&V) Techniques for Object Oriented Software Systems SAS meeting July 2003.

1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.

VTT-STUK assessment method for safety evaluation of safety-critical computer based systems - application in BE-SECBS project.

Verification and Validation Overview References: Shach, Object Oriented and Classical Software Engineering Pressman, Software Engineering: a Practitioner’s.

MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 19.

1 Department of Electrical and Computer Engineering University of Virginia Software Quality & Safety Assessment Using Bayesian Belief Networks Joanne Bechta.

Scientific Inquiry Mr. Wai-Pan Chan Scientific Inquiry Research & Exploratory Investigation Scientific inquiry is a way to investigate things, events.

1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.

Generic Approaches to Model Validation Presented at Growth Model User’s Group August 10, 2005 David K. Walters.

Introduction Osborn. Daubert is a benchmark!!!: Daubert (1993)- Judges are the “gatekeepers” of scientific evidence. Must determine if the science is.

Slide 1V&V 10/2002 Software Quality Assurance Dr. Linda H. Rosenberg Assistant Director For Information Sciences Goddard Space Flight Center, NASA

West Virginia University Towards Practical Software Reliability Assessment for IV&V Projects B. Cukic, E. Gunel, H. Singh, V. Cortellessa Department of.

Chapter 1 Measurement, Statistics, and Research. What is Measurement? Measurement is the process of comparing a value to a standard Measurement is the.

SINTEF Telecom and Informatics EuroSPI’99 Workshop on Data Analysis Popular Pitfalls of Data Analysis Tore Dybå, M.Sc. Research Scientist, SINTEF.

Statistical Hypotheses & Hypothesis Testing. Statistical Hypotheses There are two types of statistical hypotheses. Null Hypothesis The null hypothesis,

Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.

1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 

Estimating Component Availability by Dempster-Shafer Belief Networks Estimating Component Availability by Dempster-Shafer Belief Networks Lan Guo Lane.

Verification and Validation Assuring that a software system meets a user's needs.

IV&V Facility 26SEP071 Validation Workshop Dr. Butch Caffall Director, NASA IV&V Facility 26SEP07.

Chapter 10 Verification and Validation of Simulation Models

Uncertainty Management in Rule-based Expert Systems

1 Report on results of Discriminant Analysis experiment. 27 June 2002 Norman F. Schneidewind, PhD Naval Postgraduate School 2822 Racoon Trail Pebble Beach,

©2010 John Wiley and Sons Chapter 2 Research Methods in Human-Computer Interaction Chapter 2- Experimental Research.

Research Heaven, West Virginia A Framework for Early Reliability Assessment Bojan Cukic, Erdogan Gunel, Harshinder Singh, Lan Guo, Dejan Desovski West.

CSE SW Metrics and Quality Engineering Copyright © , Dennis J. Frailey, All Rights Reserved CSE8314M13 8/20/2001Slide 1 SMU CSE 8314 /

Copyright , Dennis J. Frailey CSE Software Measurement and Quality Engineering CSE8314 M00 - Version 7.09 SMU CSE 8314 Software Measurement.

Review on Test-Based Approach of Software Reliability November 22 nd, 2010 Nuclear I&C and Information Engineering LabKAIST Bo Gyung Kim.

Establishing by the laboratory of the functional requirements for uncertainty of measurements of each examination procedure Ioannis Sitaras.

Calculation of Software Failure Probability and Test Case Selection February 14, 2007 Kim, Sung Ho.

Change Request Management

Leacock, Warrican and Rose (2009)

Achieving High Software Reliability

Verification and Validation Overview

Software Independent Verification and Validation (IV&V)

Quality Risk Management

Chapter 10 Verification and Validation of Simulation Models

Critical Systems Validation

When we free ourselves of desire,

Critical Systems Validation

Capability Maturity Model

Rai University , November 2014

Capability Maturity Model

Presentation transcript:

IV&V Facility 1 Software Reliability Corroboration Bojan Cukic, Erdogan Gunel, Harshinder Singh, Lan Guo West Virginia University Carol Smidts University of Maryland (WVU UI: Integrating Formal Methods and Testing in a Quantitative Software Reliability Assessment Framework 2002)

IV&V Facility 2 Overview Introduction and motivation. Software reliability assessment and NASA IV&V. Bayesian hypothesis testing approach. A methodology for formulating priors. Case study Accounting for severities and risks. Summary

IV&V Facility 3 Introduction Improvement of software V&V practices, especially for high assurance systems. Quantification of the effects of V&V activities is always desirable. Is software reliability quantification practical for safety/mission critical systems? –Time and cost considerations may limit the appeal. Reliability growth applicable only to integration testing, the tail end of V&V. Estimation of operational usage profiles is rare.

IV&V Facility 4 Is SRE Impractical for NASA IV&V? Most IV&V techniques are qualitative in nature. Mature software reliability estimation methods based exclusively on operational (system) testing. Neglects the investment made in other IV&V techniques –Requirements readings, inspections, problem reports and tracking, unit level tests… Req Design Code Test (Verification & Validation) Unit IntegrationAcceptance Life cycle long IV&V Implementation Tradional SW Rel. Assessment

IV&V Facility 5 Regulatory Viewpoint Regulatory view: DO178-B (software considerations in airborne systems and equipment certification) “… methods for estimating the post-verification probabilities of software errors were examined. The goal was to develop numerical requirements for such probabilities for software in computer-based airborne systems of equipment. The conclusion reached, however, was that currently available methods do not provide results in which the confidence can be placed to the level required for this purpose... If the applicant proposes to use software reliability models for certification credit, rationale for the model should be included in the plan for software aspects of certification, and agreed with by the certification authority.”

IV&V Facility 6 Contribution Develop software reliability assessment methods that build on: –Stable and mature development environments. –Lifecycle long IV&V activities. –Utilize all relevant available information. –Qualitative (formal and informal) IV&V methods? Strengthening the case for IV&V all across NASA enterprise.

IV&V Facility 7 Assessment vs. Corroboration Current thinking –Software reliability “tested into” the product through the integration and acceptance testing. Our thinking –Why “waste” the results of all the qualitative IV&V activities. –Testing should corroborate that the life-cycle long IV&V techniques are giving the “usual” results, that the project follows usual quality patterns.

IV&V Facility 8 Reliability Assessment (No Prior Assumptions) P(    Required testing effort (N), from random sampling: Number of failure free test cases as a function of the required failure rate, with C=0.99 Value of  Number of Tests , , , ,605,167 Required testing effort not realistic.

IV&V Facility 9 Bayesian Inference Allows for the inclusion of a subjective probability of failure. Subjective estimate based on observed behavior, reflects beliefs. Hypothesis on the event occurrence probability is combined with new evidence, which may change the degree of belief..

IV&V Facility 10 Bayesian Estimation (Non Ignorance Priors) Needs the following assumption: –The system has achieved desired reliability prior to acceptance testing. –This “guess” should be “reasonably accurate.” Use random tests (operational profile) to corroborate assumed system failure probability. How many failure free random tests U should be performed?

IV&V Facility 11 Benefits What if corroboration testing is not failure free? –Keep adjusting the target number of tests [Littlewood 97, 98]

IV&V Facility 12 Bayesian Hypothesis Testing (BHT) Problem of Bayesian estimation: –Categorical assumption that the program meets required reliability. BHT makes this a probability statement, P(H 0 ). Corroboration testing now looks for the evidence in favor of the hypothesized reliability. H o :   o alternative hypothesis.

IV&V Facility 13 The number of corroboration tests according to BHT theory.  P(H o ) n o n 1 n

IV&V Facility 14 Formulating Priors Formulation of prior beliefs is the most important research issue. Historical data on failure occurrences under the same V&V regime. –Historical data on failure occurrence reduction related to the application of specific verification techniques (very few studies). –Process effectiveness measures [Smidts 98]. –Transforming fault density into failure intensity [Smidts 01]. –Represent the application of a specific verification method by an appropriate number of random tests [Miller et. al. 94].

IV&V Facility 15 Can This Be Done? Is it realistic to expect software developers to hypothesize on the operational reliability? Experiment (Smidts et. al.). –A panel of experts ranked 32 measures related to software reliability. Ranks normalized to a [0, 1] range. –Highly ranked measures: Failure rate (0.98), test coverage (0.90), fault density (0.73). –Low ranked measures: Mutation testing(0.48), function point analysis (0.00), bugs per line of code (Gaffney estimate, 0.00).

IV&V Facility 16 Controlled Experiment A company contracted to develop a program (smart card based access control system, PACS). –Controlled requirements document (NSA specs). Five software engineering measures monitored: –Defect density, test coverage, requirements traceability, function points, Gaffney. Each measure can used within a reliability prediction system (RPS). Accurate RPS. –Defect density, test coverage and requirements traceability. Inaccurate RPS: function points and Gaffney.

IV&V Facility 17 Software Reliability Corroboration Accurate predictors are adequate for the corroboration approach. –A weighted linear combination of the three measures (RPS) gives a very accurate reliability prediction. –Low levels of trust in the prediction accuracy. No experience in repeatability. Low value of P(H 0 ) still requires substantial but realistic reliability corroboration effort.

IV&V Facility 18 Accounting for Failure Severities Not all the failures encountered in corroboration testing are equally important. Instead of counting generic failures, test failures stratified according to their severity. If a high severity failure encountered in corroboration testing, strong evidence in favor of the alternative hypothesis. Allows for the tolerance towards low severity failures.

IV&V Facility 19 Approach Recap Software quality Measures (SQM) Reliability Prediction Systems (RPS) RPS Combination Techniques SW Reliability Corroboration Testing SQM1 SQM3 SQM2 SQM4 SQM6 SQM5 SQMi SQMj RPS1RPS2RPSkRPSm... RPS Combination (Experience, Learning, Dempster-Schafer…) BHT software reliability corroboration Null Hypothesis, H 0 Alternative Hypothesis, H a Software Development Lifecycle Software Reliability Measure

IV&V Facility 20 Status and Perspectives Software reliability corroboration allows: –Inclusion of IV&V quality measures and activities into the reliability assessment. –A significant reduction in the number of (corroboration) tests. –Software reliability of safety/mission critical systems can be assessed with a reasonable effort. Research directions. –Sound formulation of prior beliefs from IV&V. –Further experimentation (other measures, repetition). –Can prior beliefs be based on the “formality” of the IV&V methods (formal methods)?