1 Introducing Bayesian Nets in AgenaRisk An example based on Software Defect Prediction.

Slides:



Advertisements
Similar presentations
Reviewing Risk Measurement Concepts First Affirmative Financial Network, LLC R. Kevin OKeefe, CIMA.
Advertisements

If you are viewing this slideshow within a browser window, select File/Save as… from the toolbar and save the slideshow to your computer, then open it.
MCMC estimation in MlwiN
New Developments in Bayesian Network Software (AgenaRisk)
CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
Naïve Bayes. Bayesian Reasoning Bayesian reasoning provides a probabilistic approach to inference. It is based on the assumption that the quantities of.
Statistical Analysis and Data Interpretation What is significant for the athlete, the statistician and team doctor? important Will Hopkins
Modelling unknown errors as random variables Thomas Svensson, SP Technical Research Institute of Sweden, a statistician working with Chalmers and FCC in.
Hypothesis Testing making decisions using sample data.
1. Profile Decision-making and risk assessment under uncertainty Special expertise on software project risk assessment Novel applications of causal models.
Software Measurement: Uncertainty and Causal Modeling Koosha Golmohammadi
1 Software Maintenance and Evolution CSSE 575: Session 8, Part 3 Predicting Bugs Steve Chenoweth Office Phone: (812) Cell: (937)
Conceptual thinking about the unknown Uncertainty: expected value, sensitivity analysis, and the value of information.
Lecture 6. Hypothesis tests for the population mean  Similar arguments to those used to develop the idea of a confidence interval allow us to test the.
Stoimen Stoimenov QA Engineer QA Engineer SitefinityLeads,SitefinityTeam6 Telerik QA Academy Telerik QA Academy.
CHAPTER 19: Two-Sample Problems
Handouts Software Testing and Quality Assurance Theory and Practice Chapter 11 System Test Design
Monté Carlo Simulation MGS 3100 – Chapter 9. Simulation Defined A computer-based model used to run experiments on a real system.  Typically done on a.
1.3 ORGANIZATIONAL PLANNING & DECISION MAKING INTRODUCTION TO DECISION TREES (HIGHER LEVEL CONTENT)
S Neuendorf 2004 Prediction of Software Defects SASQAG March 2004 by Steve Neuendorf.
Software Reliability Growth. Three Questions Frequently Asked Just Prior to Release 1.Is this version of software ready for release (however “ready” is.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 24 Slide 1 Critical Systems Validation 1.
1 Bayesian methods for parameter estimation and data assimilation with crop models Part 2: Likelihood function and prior distribution David Makowski and.
Calibration and Model Discrepancy Tony O’Hagan, MUCM, Sheffield.
Applications of Bayesian sensitivity and uncertainty analysis to the statistical analysis of computer simulators for carbon dynamics Marc Kennedy Clive.
Software Estimation and Function Point Analysis Presented by Craig Myers MBA 731 November 12, 2007.
Risk Analysis and Technical Analysis Tanveer Singh Chandok (Director of Mentorship)
Decision Making Under Uncertainty and Risk 1 By Isuru Manawadu B.Sc in Accounting Sp. (USJP), ACA, AFM
West Virginia University Towards Practical Software Reliability Assessment for IV&V Projects B. Cukic, E. Gunel, H. Singh, V. Cortellessa Department of.
CHAPTER 17: Tests of Significance: The Basics
Fast Simulators for Assessment and Propagation of Model Uncertainty* Jim Berger, M.J. Bayarri, German Molina June 20, 2001 SAMO 2001, Madrid *Project of.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Bayesian statistics Probabilities for everything.
Why Risk Models Should be Parameterised William Marsh, Risk Assessment and Decision Analysis Research Group.
This material is approved for public release. Distribution is limited by the Software Engineering Institute to attendees. Sponsored by the U.S. Department.
BSBPMG504A Manage Project Costs 7.1 Estimate Costs Adapted from PMBOK 4 th Edition InitiationPlanning ExecutionClose Monitor Control The process of developing.
Copyright , Dennis J. Frailey CSE7315 – Software Project Management CSE7315 M16 - Version 8.01 SMU CSE 7315 Planning and Managing a Software Project.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Mistakes, Errors and Defects. 12/7/2015Mistakes, Errors, Defects, Copyright M. Smith, ECE, University of Calgary, Canada 2 Basic Concepts  You are building.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
SOFTWARE METRICS Software Metrics :Roadmap Norman E Fenton and Martin Neil Presented by Santhosh Kumar Grandai.
MATH 643 Bayesian Statistics. 2 Discrete Case n There are 3 suspects in a murder case –Based on available information, the police think the following.
WATERFALL DEVELOPMENT MODEL. Waterfall model is LINEAR development lifecycle. This means each phase must be completed before moving onto the next!!! WHAT.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 12 Multiple.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Using Bayesian Nets to Predict Software Defects in Arbitrary Software Lifecycles Martin Neil Agena Ltd London, UK Web:
Slide 1 SPIN 23 February 2006 Norman Fenton Agena Ltd and Queen Mary University of London Improved Software Defect Prediction.
CSE SW Measurement and Quality Engineering Copyright © , Dennis J. Frailey, All Rights Reserved CSE8314M14 version 3.09Slide 1 SMU CSE.
CSE SW Metrics and Quality Engineering Copyright © , Dennis J. Frailey, All Rights Reserved CSE8314M13 8/20/2001Slide 1 SMU CSE 8314 /
Copyright , Dennis J. Frailey CSE Software Measurement and Quality Engineering CSE8314 M00 - Version 7.09 SMU CSE 8314 Software Measurement.
Application of Maximum Entropy Principle to software failure prediction Wu Ji Software Engineering Institute BeiHang University.
What’s the Point (Estimate)? Casualty Loss Reserve Seminar September 12-13, 2005 Roger M. Hayne, FCAS, MAAA.
MODELING AND SIMULATION CS 313 Simulation Examples 1.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Capital Budgeting and Risk Pertemuan Matakuliah: A0774/Information Technology Capital Budgeting Tahun: 2009.
Statistics for Business and Economics Module 1:Probability Theory and Statistical Inference Spring 2010 Lecture 4: Estimating parameters with confidence.
PH101 lab 1 uncertainty analysis statistics. Sciencing So you have an idea. This idea must be testable... or it is not science So we test it. How good.
SUR-2250 Error Theory.
MSA / Gage Capability (GR&R)
What’s Reasonable About a Range?
Product reliability Measuring
of Heritage and New Hardware For Launch Vehicle Reliability Models
Modeling and Simulation CS 313
Critical Systems Validation
Software Reliability Models.
Critical Systems Validation
Distributed Generation Adoption Decision Making Under Uncertainty
Summary.
Presentation transcript:

1 Introducing Bayesian Nets in AgenaRisk An example based on Software Defect Prediction

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

A Bayesian Net for predicting air traffic incidents

A Detailed Example What follows is a demo of a simplified version of a Bayesian net model to provide more accurate predictions of software defects Many organisations worldwide have now used models based around this one

Predicting software defects Operational defects The number of operational defects (i.e. those found by customers) is what we are really interested in predicting

Residual Defects Operational defects We know this is clearly dependent on the number of residual defects. Predicting software defects

Residual Defects Operational defectsOperational usage But it is also critically dependent on the amount of operational usage. If you do not use the system you will find no defects irrespective of the number there. Predicting software defects

Residual Defects Defects IntroducedOperational defectsOperational usage Predicting software defects The number of residual defects is determined by the number you introduce during development….

Residual Defects Defects found and fixed Defects IntroducedOperational defectsOperational usage Predicting software defects …minus the number you successfully find and fix

Residual Defects Defects found and fixed Defects IntroducedOperational defectsOperational usage Obviously defects found and fixed is dependent on the number introduced Predicting software defects

Residual Defects Problem complexity Defects found and fixed Defects IntroducedOperational defectsOperational usage The number introduced is influenced by problem complexity… Predicting software defects

Residual Defects Problem complexity Defects found and fixed Defects Introduced Design process quality Operational defectsOperational usage ….and design process quality Predicting software defects

Residual DefectsTesting Effort Problem complexity Defects found and fixed Defects Introduced Design process quality Operational defectsOperational usage Finally, how many defects you find is influenced not just by the number there to find but also by the amount of testing effort Predicting software defects

A Model in action Here is that very simple model with the probability distributions shown

A Model in action We are looking at an individual software component in a system

A Model in action The prior probability distributions represent our uncertainty before we enter any specific information about this component.

A Model in action So the component is just as likely to have very high complexity as very low

A Model in action and the number of defects found and fixed in testing is in a wide range where the median value is about 20.

A Model in action As we enter observations about the component the probability distributions update

Here we have entered the observation that this component had 0 defects found and fixed in testing

Note how the other distributions changed.

The model is doing forward inference to predict defects in operation…..

..and backwards inference to make deductions about design process quality.

but actually the most likely explanation is very low testing quality.

…and lower than average complexity.

But if we find out that the complexity is actually high…..

then the expected number of operational defects increases

and we become even more convinced of the inadequate testing

So far we have made no observation about operational usage.

If, in fact, the operational usage is high…

Then we have an example of a component with no defects in test..

…but probably many defects in operation.

But suppose we find out that the test quality was very high.

Then we completely revise out beliefs

We are now pretty convinced that the module will be fault free in operation

…And the ‘explanation’ is that the design process is likely to be very high quality

A Model in action we reset the model and this time use the model to argue backwards

A Model in action Suppose we know that this is a critical component that has a requirement for 0 defects in operation…

The model looks for explanations for such a state of affairs.

The most obvious way to achieve such a result is to not use the component much.

But if we know it will be subject to high usage…

Then the model adjusts the beliefs about the other uncertain variables.

A combination of lower than average complexity…..

…Higher than average design quality…..

and much higher than average testing quality …..

But suppose we cannot assume our testing is anything other than average…

Then better design quality …..

..and lower complexity are needed …..

But if complexity is very high …..

…Then we are left with a very skewed distribution for design process quality.

What the model is saying is that, if these are the true requirements for the component then you are very unlikely to achieve them unless you have a very good design process

Making better decisions That was a simplified version of model produced for Philips Helped Philips make critical decisions about when to release software for electronic components 95% accuracy in defect prediction – much better than can be achieved by traditional statistical methods

Model Implementation In AgenaRisk