Quantification of the non- parametric continuous BBNs with expert judgment Iwona Jagielska Msc. Applied Mathematics.

Slides:



Advertisements
Similar presentations
Copula Representation of Joint Risk Driver Distribution
Advertisements

A core course on Modeling kees van Overveld Week-by-week summary.
Statistics for Improving the Efficiency of Public Administration Daniel Peña Universidad Carlos III Madrid, Spain NTTS 2009 Brussels.
How do we generate the statistics of a function of a random variable? – Why is the method called “Monte Carlo?” How do we use the uniform random number.
1 8. Numerical methods for reliability computations Objectives Learn how to approximate failure probability using Level I, Level II and Level III methods.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
STAT 497 APPLIED TIME SERIES ANALYSIS
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
QA-2 FRM-GARP Sep-2001 Zvi Wiener Quantitative Analysis 2.
FRM Zvi Wiener Following P. Jorion, Financial Risk Manager Handbook Financial Risk Management.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Evaluating Hypotheses
Ch. 6 The Normal Distribution
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
MACHINE LEARNING 6. Multivariate Methods 1. Based on E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2 Motivating Example  Loan.
Correlations and Copulas Chapter 10 Risk Management and Financial Institutions 2e, Chapter 10, Copyright © John C. Hull
Lehrstuhl für Informatik 2 Gabriella Kókai: Maschine Learning 1 Evaluating Hypotheses.
Lecture II-2: Probability Review
5-1 Two Discrete Random Variables Example Two Discrete Random Variables Figure 5-1 Joint probability distribution of X and Y in Example 5-1.
5-1 Two Discrete Random Variables Example Two Discrete Random Variables Figure 5-1 Joint probability distribution of X and Y in Example 5-1.
1 10. Joint Moments and Joint Characteristic Functions Following section 6, in this section we shall introduce various parameters to compactly represent.
Modern Navigation Thomas Herring
NIPRL Chapter 2. Random Variables 2.1 Discrete Random Variables 2.2 Continuous Random Variables 2.3 The Expectation of a Random Variable 2.4 The Variance.
Chapter 9 Numerical Integration Numerical Integration Application: Normal Distributions Copyright © The McGraw-Hill Companies, Inc. Permission required.
Nonparametric or Distribution-free Tests
Lecture 7: Simulations.
Random variables Petter Mostad Repetition Sample space, set theory, events, probability Conditional probability, Bayes theorem, independence,
Short Resume of Statistical Terms Fall 2013 By Yaohang Li, Ph.D.
Percentile Approximation Via Orthogonal Polynomials Hyung-Tae Ha Supervisor : Prof. Serge B. Provost.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 13 Oct 14, 2005 Nanjing University of Science & Technology.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
1 Institute of Engineering Mechanics Leopold-Franzens University Innsbruck, Austria, EU H.J. Pradlwarter and G.I. Schuëller Confidence.
Chapter 14 Monte Carlo Simulation Introduction Find several parameters Parameter follow the specific probability distribution Generate parameter.
PARAMETRIC STATISTICAL INFERENCE
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 Part 4 Curve Fitting.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Module 1: Statistical Issues in Micro simulation Paul Sousa.
Applied Quantitative Analysis and Practices LECTURE#11 By Dr. Osman Sadiq Paracha.
The Examination of Residuals. Examination of Residuals The fitting of models to data is done using an iterative approach. The first step is to fit a simple.
Danila Filipponi Simonetta Cozzi ISTAT, Italy Outlier Identification Procedures for Contingency Tables in Longitudinal Data Roma,8-11 July 2008.
An Efficient Sequential Design for Sensitivity Experiments Yubin Tian School of Science, Beijing Institute of Technology.
1 Nonparametric Statistical Techniques Chapter 17.
Chapter 13 Multiple Regression
POOLED DATA DISTRIBUTIONS GRAPHICAL AND STATISTICAL TOOLS FOR EXAMINING COMPARISON REFERENCE VALUES Alan Steele, Ken Hill, and Rob Douglas National Research.
Improved Cross Entropy Method For Estimation Presented by: Alex & Yanna.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Sampling and Sampling Distributions Basic Business Statistics 11 th Edition.
STOCHASTIC HYDROLOGY Stochastic Simulation of Bivariate Distributions Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National.
Introducing Error Co-variances in the ARM Variational Analysis Minghua Zhang (Stony Brook University/SUNY) and Shaocheng Xie (Lawrence Livermore National.
Joint Moments and Joint Characteristic Functions.
FORECASTING METHODS OF NON- STATIONARY STOCHASTIC PROCESSES THAT USE EXTERNAL CRITERIA Igor V. Kononenko, Anton N. Repin National Technical University.
Stochastic Hydrology Random Field Simulation Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 6-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Introduction A probability distribution is obtained when probability values are assigned to all possible numerical values of a random variable. It may.
Chapter Two Copyright © 2006 McGraw-Hill/Irwin The Marketing Research Process.
Density Estimation in R Ha Le and Nikolaos Sarafianos COSC 7362 – Advanced Machine Learning Professor: Dr. Christoph F. Eick 1.
Stats 242.3(02) Statistical Theory and Methodology.
CWR 6536 Stochastic Subsurface Hydrology Optimal Estimation of Hydrologic Parameters.
Chapter 6 – Continuous Probability Distribution Introduction A probability distribution is obtained when probability values are assigned to all possible.
Estimating standard error using bootstrap
Deep Feedforward Networks
Probability Theory and Parameter Estimation I
Assoc. Prof. Dr. Peerapol Yuvapoositanon
Rutgers Intelligent Transportation Systems (RITS) Laboratory
Professor S K Dubey,VSM Amity School of Business
Hidden Markov Models Part 2: Algorithms
Parametric Methods Berlin Chen, 2005 References:
Chapter 2. Random Variables
Chapter 4. Supplementary Questions
Presentation transcript:

Quantification of the non- parametric continuous BBNs with expert judgment Iwona Jagielska Msc. Applied Mathematics

Outline of the presentation I PART 1. Introduction II PART 2. Method of eliciting conditional rank correlations 3. Comparison of algorithms to calculate multivariate normal probabilities 4. Presentation of elicitation software UniExp III PART 5. Building the Maintenance Performance Model Model variables Dependence relation 6. Results 7. Conclusions and recommendations I PART 1. Introduction II PART 2. Method of eliciting conditional rank correlations 3. Comparison of algorithms to calculate multivariate normal probabilities 4. Presentation of elicitation software UniExp III PART 5. Building the Maintenance Performance Model Model variables Dependence relation 6. Results 7. Conclusions and recommendations

CATS – casual model for Air Transport Safety – motivation and purpose - three sectors of human performance ATC Model, Flight Crew Performance Model Maintenance Performance Model 1. Introduction

2. Way of assessing dependence relations Conditional Rank correlations  Conditional probabilities of exceedence Why normal copula? - Advantages known relation between partial and rank correlation equal conditional and partial correlations possess zero independent property - Disadvantages no analytical form for multivariate cumulative distribution function P 1 = P ( F X4 (X 4 ) > q | F X3 (X 3 ) > q ) “Suppose that the variable X 3 was observed above its q th quantile. What is the probability that also X 4 will be observed above its q th quantile? “

2. Way of assessing dependence relations To see the conditional probability as a function of rank correlation we integrate bivariate normal density over the given region. we can calculate relationship between rank correlation and conditional probability

P 2 = P ( F X4 (X 4 ) > q | F X3 (X 3 ) > q, F X2 (X 2 ) > q ) “ Suppose that not only variable X 3 but also X 2 was observed above their q th quantile. What is the probability that also X 4 will be observed above its q th quantile? ” To find the conditional probability we integrate trivariate normal density over the given region with covariance matrix. 2. Way of assessing dependence relations We assess the higher order conditional rank correlation in the similar way.

Proposed numerical integration methods: Algorithm I and II – by Genz - first we apply transformation to simplify integration region - later randomized quasi Monte Carlo method is used - different choice of quasi points - in algorithm I we specify number of points; algorithm II assign number of points, s.t. the requested accuracy is provided Algorithm III and IV - based on successive subdivisions of integration region, where each subdivision is used to provide a better approximation of the integrand - polynomial rule is used to approximate integrand on each subregions - error estimate – difference between two polynomial rules of different order - algorithm IV may involve some simplification routines (change of variables) 3.1. Algorithms to calculate multivariate normal probabilities probabilities

T Ai – time of calculation for algorithm i P Ai – probability obtained by algorithm i E Ai – estimated error of approximation provided by algorithm i 700, 1500 – number of quasi random point in Alg I; requested accuracy for Alg II 3.2. NumericalComparison 3.2. Numerical Comparison dimension = 4, determinant = dimension = 7, determinant = 0.489

Algorithms III and IV are unpractical for large scale applications since they require long time for numerical calculations - time for hypercube [0.5, inf] 7 is more than 700seconds - when the procedure of subdivision of integration region is applied, algorithm do not provide the total error In Algorithm I user needs to specify number of quasi random points used to calculation; there is no control of provided error of estimation; time of calculation depends on the number of points, not of covariance matrix Time of calculation for Algorithm II is sometimes grater than for Algorithm I; time depends on covariance matrix; number of quasi random points depends on requested accuracy of solution At this moment Algorithm II is used in the software UniExp as the most accurate one; Algorithm I also has future potencial for implementation. 3. NumericalComparison – brief summary 3. Numerical Comparison – brief summary

4. Software elicitation tool - UniExp 1 Step – input of nodes and connections

4. Software elicitation tool - UniExp 2 Step – elicitation of conditional rank correlations

4. Software elicitation tool - UniExp Values of Rank Correlations can be found in RankCorrelationValues.txt file

5. Maintenance Performance Model

Elicitation with single expert we asked – 4 questions about marginal distributions – classical method of expert judgment 7 questions about conditional probabilities of exceedance 5. Maintenance Performance Model – dependence relation All variables are negatively correlated with variable human error

5. Maintenance Performance Model At the bottom of each histogram the expectation and standard derivation are shown. Unconditional expected value of human error is 0.266/10000

6. Maintenance Performance Model - conditioning Number of years of experience = 3 expected value of human error increases 0.266/ > 0.309/10000

6. Maintenance Performance Model - conditioning Requiring at least 6 hours of sleep provides decrease of expected human error from 0.266/10000 to 0.152/10000 Moreover E(HE|WorkCond=1,Alert=6) = 0.248/10000 while E(HE|WorkCond=1)=0.398/10000

7. Conclusions and recommendations Calculation of multivariate normal probabilities is not an easy task in case of high dimension; there is still need to develop more fast (and also accurate) algorithm for higher dimension Include Algorithm I in UniExp software; together with making UniExp to worked outside the Matlab environment Combining experts opinion to obtain better results Collect data describing to marginal distribution in Maintenance Performance Model Discover other possible influential factors in Maintenance Performance Model Any other propositions?

Questions ???

Additional Slides

A1. Covariance matrixes used in numerical tests Dimension 4 determinant = determinant =

A1. Covariance matrixes used in numerical tests Dimension 7 determinant = determinant =

A2. Determinant of covariance matrix as the measure of spread from distribution of spread from distribution Dimension 2 determinant = 1

A2. Determinant of covariance matrix as the measure of spread of distribution of spread of distribution Dimension 2 determinant = 0.51,  =0.714

A2. Determinant of covariance matrix as the measure of spread of distribution of spread of distribution Dimension 2 determinant = ,  =0.141

3.2. NumericalComparison 3.2. Numerical Comparison Test 1 – identity covariance matrix dimension = 4 dimension = 7 T Ai – time of calculation for algorithm i P Ai – probability obtained by algorithm i E Ai – estimated error of approximation provided by algorithm i 700, 1500 – number of quasi random point in Alg I; requested accuracy for Alg II

Test 2 – covariance matrix with determinant  0.5 dimension = 4, determinant = dimension = 7, determinant = NumericalComparison 3. Numerical Comparison

Test 3 – covariance matrix with determinant  0.1 dimension = 4, determinant = dimension = 7, determinant = NumericalComparison 3. Numerical Comparison

-Motivation – - part of CATS model - build to describe the causal factors influencing the maintenance crew Methodology - non-parametric BBNQuantification - Nodes – variables which can influence the human performance among the maintenance crew; marginal distribution – data or Classical Method (Expert Judgment) - Conditional rank correlations – obtained from experts through the dependence probabilities of exceedance 5. Maintenance Performance Model

5. Maintenance Performance Model – model variables VariableDefinition Source of marginal distribution 1. Job 1. Job Trainings average number of training per year Expert judgment 2. Alertnessaverage number of hours an aircraft mechanic sleeps of per dayData 3. Communicationcurrent information transfer procedure in use, distinguishing: 1. only paper notes, 2. paper notes with oral feedback Expert judgment 4. Experienceaverage number of years a person worked as aircraft mechanicData 6. Aircraft Generationaircraft generation in scale from 1 to 4 where 4 is the most recent generation Data 5. Working Conditions average number of maintenance operations needed to be performed 1.out-side / 2. inside the hangar per 10,000 maintenance operations Expert Judgment 7. Human Error number of maintenance human errors that might lead to hazardous situations per 10,000 maintenance tasks Expert Judgment