1. Profile Decision-making and risk assessment under uncertainty Special expertise on software project risk assessment Novel applications of causal models.

Slides:



Advertisements
Similar presentations
1 Introducing Bayesian Nets in AgenaRisk An example based on Software Defect Prediction.
Advertisements

CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
Naïve Bayes. Bayesian Reasoning Bayesian reasoning provides a probabilistic approach to inference. It is based on the assumption that the quantities of.
Cadence Design Systems, Inc. Why Interconnect Prediction Doesn’t Work.
Modelling unknown errors as random variables Thomas Svensson, SP Technical Research Institute of Sweden, a statistician working with Chalmers and FCC in.
Model Assessment, Selection and Averaging
Software Measurement: Uncertainty and Causal Modeling Koosha Golmohammadi
Review: What influences confidence intervals?
1 © 1998 HRL Laboratories, LLC. All Rights Reserved Construction of Bayesian Networks for Diagnostics K. Wojtek Przytula: HRL Laboratories & Don Thompson:
Business Intelligence Michael Gross Tina Larsell Chad Anderson.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 7: Expert Systems and Artificial Intelligence Decision Support.
CS Bayesian Learning1 Bayesian Learning. CS Bayesian Learning2 States, causes, hypotheses. Observations, effect, data. We need to reconcile.
Statistics for Managers Using Microsoft® Excel 7th Edition
Annex I: Methods & Tools prepared by some members of the ICH Q9 EWG for example only; not an official policy/guidance July 2006, slide 1 ICH Q9 QUALITY.
Handouts Software Testing and Quality Assurance Theory and Practice Chapter 11 System Test Design
Monté Carlo Simulation MGS 3100 – Chapter 9. Simulation Defined A computer-based model used to run experiments on a real system.  Typically done on a.
1 CHAPTER M4 Cost Behavior © 2007 Pearson Custom Publishing.
S Neuendorf 2004 Prediction of Software Defects SASQAG March 2004 by Steve Neuendorf.
Software Reliability Growth. Three Questions Frequently Asked Just Prior to Release 1.Is this version of software ready for release (however “ready” is.
1 Forecasting Field Defect Rates Using a Combined Time-based and Metrics-based Approach: a Case Study of OpenBSD Paul Luo Li Jim Herbsleb Mary Shaw Carnegie.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 24 Slide 1 Critical Systems Validation 1.
National Institute of Economic and Social Research “Consensus estimates of forecast uncertainty: working out how little we know ” James Mitchell NIESR.
Software Estimation and Function Point Analysis Presented by Craig Myers MBA 731 November 12, 2007.
Chapter 6 : Software Metrics
Guide Jouni Tuomisto, Mikko Pohjola - National Institute for Health and Welfare - Department of Environmental Health – Finland Introduction: The world.
Decision Making Under Uncertainty and Risk 1 By Isuru Manawadu B.Sc in Accounting Sp. (USJP), ACA, AFM
TAIC-PART - Cumberland Lodge, UK – 29 Aug 2006 MOTOROLA and the Stylized M Logo are registered in the US Patent & Trademark Office. All other product or.
West Virginia University Towards Practical Software Reliability Assessment for IV&V Projects B. Cukic, E. Gunel, H. Singh, V. Cortellessa Department of.
CHAPTER 17: Tests of Significance: The Basics
Introduction to Bayesian Networks
Fast Simulators for Assessment and Propagation of Model Uncertainty* Jim Berger, M.J. Bayarri, German Molina June 20, 2001 SAMO 2001, Madrid *Project of.
SINTEF Telecom and Informatics EuroSPI’99 Workshop on Data Analysis Popular Pitfalls of Data Analysis Tore Dybå, M.Sc. Research Scientist, SINTEF.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Bayesian statistics Probabilities for everything.
Why Risk Models Should be Parameterised William Marsh, Risk Assessment and Decision Analysis Research Group.
This material is approved for public release. Distribution is limited by the Software Engineering Institute to attendees. Sponsored by the U.S. Department.
BSBPMG504A Manage Project Costs 7.1 Estimate Costs Adapted from PMBOK 4 th Edition InitiationPlanning ExecutionClose Monitor Control The process of developing.
Copyright , Dennis J. Frailey CSE7315 – Software Project Management CSE7315 M16 - Version 8.01 SMU CSE 7315 Planning and Managing a Software Project.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
CHAPTER 5 Probability Theory (continued) Introduction to Bayesian Networks.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
SOFTWARE METRICS Software Metrics :Roadmap Norman E Fenton and Martin Neil Presented by Santhosh Kumar Grandai.
MATH 643 Bayesian Statistics. 2 Discrete Case n There are 3 suspects in a murder case –Based on available information, the police think the following.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 12 Multiple.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Using Bayesian Nets to Predict Software Defects in Arbitrary Software Lifecycles Martin Neil Agena Ltd London, UK Web:
Slide 1 SPIN 23 February 2006 Norman Fenton Agena Ltd and Queen Mary University of London Improved Software Defect Prediction.
CSE SW Metrics and Quality Engineering Copyright © , Dennis J. Frailey, All Rights Reserved CSE8314M13 8/20/2001Slide 1 SMU CSE 8314 /
2007 Annual Meeting ● Vancouver IP 40 LTD & STD Pricing ● Andrew Ryan 2007 Annual Meeting ● Vancouver IP 40 LTD & STD Pricing ● Andrew Ryan Canadian Institute.
Copyright , Dennis J. Frailey CSE Software Measurement and Quality Engineering CSE8314 M00 - Version 7.09 SMU CSE 8314 Software Measurement.
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Capital Budgeting and Risk Pertemuan Matakuliah: A0774/Information Technology Capital Budgeting Tahun: 2009.
Statistics for Business and Economics Module 1:Probability Theory and Statistical Inference Spring 2010 Lecture 4: Estimating parameters with confidence.
Modeling of Core Protection Calculator System Software February 28, 2005 Kim, Sung Ho Kim, Sung Ho.
Direct method of standardization of indices. Average Values n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the.
Demographic Analysis Migration: Estimation Using Residual Methods -
Evaluation and Control
MSA / Gage Capability (GR&R)
Measures of Dispersion
What’s Reasonable About a Range?
Bayesian statistics So far we have thought of probabilities as the long term “success frequency”: #successes / #trails → P(success). In Bayesian statistics.
Product reliability Measuring
of Heritage and New Hardware For Launch Vehicle Reliability Models
Session II: Reserve Ranges Who Does What
Critical Systems Validation
Software Reliability Models.
Critical Systems Validation
Significance Tests: The Basics
Significance Tests: The Basics
Presentation transcript:

1

Profile Decision-making and risk assessment under uncertainty Special expertise on software project risk assessment Novel applications of causal models (Bayesian networks) Research enables non-experts to build and use models to solve real-world risk assessment problems

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

Typical Applications Predicting reliability of critical systems Software defect prediction Aircraft accident traffic risk Warranty return rates of electronic parts Operational risk in financial institutions Hazards in petrochemical industry

A Bayesian Net for predicting air traffic incidents

A Detailed Example What follows is a demo of a simplified version of a Bayesian net model to provide more accurate predictions of software defects Many organisations worldwide have now used models based around this one

Predicting software defects Operational defects The number of operational defects (i.e. those found by customers) is what we are really interested in predicting

Residual Defects Operational defects We know this is clearly dependent on the number of residual defects. Predicting software defects

Residual Defects Operational defectsOperational usage But it is also critically dependent on the amount of operational usage. If you do not use the system you will find no defects irrespective of the number there. Predicting software defects

Residual Defects Defects IntroducedOperational defectsOperational usage Predicting software defects The number of residual defects is determined by the number you introduce during development….

Residual Defects Defects found and fixed Defects IntroducedOperational defectsOperational usage Predicting software defects …minus the number you successfully find and fix

Residual Defects Defects found and fixed Defects IntroducedOperational defectsOperational usage Obviously defects found and fixed is dependent on the number introduced Predicting software defects

Residual Defects Problem complexity Defects found and fixed Defects IntroducedOperational defectsOperational usage The number introduced is influenced by problem complexity… Predicting software defects

Residual Defects Problem complexity Defects found and fixed Defects Introduced Design process quality Operational defectsOperational usage ….and design process quality Predicting software defects

Residual DefectsTesting Effort Problem complexity Defects found and fixed Defects Introduced Design process quality Operational defectsOperational usage Finally, how many defects you find is influenced not just by the number there to find but also by the amount of testing effort Predicting software defects

A Model in action Here is that very simple model with the probability distributions shown

A Model in action We are looking at an individual software component in a system

A Model in action The prior probability distributions represent our uncertainty before we enter any specific information about this component.

A Model in action So the component is just as likely to have very high complexity as very low

A Model in action and the number of defects found and fixed in testing is in a wide range where the median value is about 20.

A Model in action As we enter observations about the component the probability distributions update

Here we have entered the observation that this component had 0 defects found and fixed in testing

Note how the other distributions changed.

The model is doing forward inference to predict defects in operation…..

..and backwards inference to make deductions about design process quality.

but actually the most likely explanation is very low testing quality.

…and lower than average complexity.

But if we find out that the complexity is actually high…..

then the expected number of operational defects increases

and we become even more convinced of the inadequate testing

So far we have made no observation about operational usage.

If, in fact, the operational usage is high…

Then we have an example of a component with no defects in test..

…but probably many defects in operation.

But suppose we find out that the test quality was very high.

Then we completely revise out beliefs

We are now pretty convinced that the module will be fault free in operation

…And the ‘explanation’ is that the design process is likely to be very high quality

A Model in action we reset the model and this time use the model to argue backwards

A Model in action Suppose we know that this is a critical component that has a requirement for 0 defects in operation…

The model looks for explanations for such a state of affairs.

The most obvious way to achieve such a result is to not use the component much.

But if we know it will be subject to high usage…

Then the model adjusts the beliefs about the other uncertain variables.

A combination of lower than average complexity…..

…Higher than average design quality…..

and much higher than average testing quality …..

But suppose we cannot assume our testing is anything other than average…

Then better design quality …..

..and lower complexity are needed …..

But if complexity is very high …..

…Then we are left with a very skewed distribution for design process quality.

What the model is saying is that, if these are the true requirements for the component then you are very unlikely to achieve them unless you have a very good design process

Making better decisions That was a simplified version of model produced for Philips Helped Philips make critical decisions about when to release software for electronic components 95% accuracy in defect prediction – much better than can be achieved by traditional statistical methods

Current Major EPSRC Projects eXdecide: Quantified Risk Assessment and Decision Support for Agile Software Projects DyFUSION: Towards a Novel Universal tool for Modelling and Reasoning under Uncertainty

Exploitation and Tool support Through spinout company Agena Ltd

In Summary RADAR is providing improved risk assessment for a wide range of critical applications Based on novel extensions of causal models At the leading edge for modelling uncertainty